22:43
Hao Zhang - Chatbot Arena (UCSD / LMSys)
906 views • 4 weeks ago
26:15
Hanna Hajishirzi (AI2) - OLMo: Findings of Training an Open LM
998 views • 4 weeks ago
16:26
MambaByte: Token-Free Language Modeling
5.6K views • 1 month ago
12:05
Luca Soldaini - Curating Pretrain Data (AI2 / Dolma)
763 views • 1 month ago
33:50
Do we need Attention? A Mamba Primer
6.2K views • 1 month ago
10:38
Swabha Swayamdipta: Towards (Closed-Source) LLM Accountability via Logit Signatures (USC)
456 views • 1 month ago
10:32
Ying Sheng - Bridging human and LLM systems
372 views • 1 month ago
23:37
Tatsu Hashimoto - Lessons from the Alpaca Project (Stanford)
910 views • 1 month ago
7:39
Louis Castricato - RLAIF, User Autonomy, and Controllability (Eleuther / Synthlabs)
204 views • 1 month ago
12:50
Eugene Cheah - From idea to LLM (RWKV / Recursal)
1K views • 1 month ago
20:02
Daphne Ippolito (CMU / Google) - No One-Size Fits All Pre-Training Data
775 views • 1 month ago
24:11
Ludwig Schmidt - Open source AI for Multimodality
767 views • 1 month ago
8:40
Leshem Chosen - Wiki-models through Natural Feedback
246 views • 1 month ago
9:30
Irina Rish (Mila) - Continual Learning of Foundation Models
421 views • 1 month ago
5:40
Niklas Muennighoff - From GPU poor to poor GPU rich
513 views • 1 month ago
19:52
Graham Neubig (CMU) - Can we make building with open-source AI as simple as prompting ChatGPT?
687 views • 1 month ago
58:02
Large Language Models in Five Formulas
32K views • 3 months ago
12:45
RNNs for Diffusion? Generating Images with DiffuSSM
1.8K views • 4 months ago
18:50
Insanely Fast Speech Recognition: Sequence Distilation for Whisper
2.6K views • 5 months ago
15:07
AIF + DPO: Distilling Zephyr and friends
3.6K views • 5 months ago
25:08
Inverting Language Models: Raw Text from Vectors and LLM APIs
2K views • 6 months ago
17:47
MiniTorch - Backprop (1.3)
518 views • 7 months ago
16:53
MiniTorch - Neural Networks (2.0)
356 views • 7 months ago
14:02
MiniTorch - Fundamentals (0.1)
2.3K views • 7 months ago
16:12
MiniTorch - Autodifferentiation (1.2)
409 views • 7 months ago
19:07
MiniTorch - Tensors (2.1)
328 views • 7 months ago
11:11
MiniTorch - Module 0 Code Walkthrough
843 views • 7 months ago
13:22
MiniTorch - Models and Modules (0.2)
861 views • 7 months ago
15:08
MiniTorch - Mini-ML (1.0)
548 views • 7 months ago
17:56
MiniTorch - Gradients (2.4)
282 views • 7 months ago
Load More