Sign in Subscribe

Alex

🤗

19

Dec

[Notes] Designing ML Systems

16 min read

18

Dec

Gentle Intro to CUDA

5 min read

16

Dec

[Notes] The Smol Training Playbook

17 min read

12

Dec

LLM Decoder Architecture Explained

6 min read

04

Dec

Optimizing Retrieval Augmented Generation

1 min read

01

Dec

vLLM Server with AWS EKS

17 min read

15

Nov

vLLM Serve Optimizations

11 min read

12

Oct

[Notes] LLM Engineer's Handbook

23 min read

10

Sep

Cells Unpacked

6 min read

24

Jul

Questions from a Stanford HAI Discussion

1 min read