- 
        
        
Recent RL Infra Related Papers
 - 
        
        
High Precision Used for Reasoning Recipes
 - 
        
        
DeepSeek-V3's Hardware-Aware Design
 - 
        
        
Summary on Llama-Nemotron
 - 
        
        
Summary on StreamRL
 - 
        
        
Disaggregate Prefill and Decoding
 - 
        
        
Summary on DeepSeek R1 and Kimi k1.5
 - 
        
        
Summary on MiniMax-01
 - 
        
        
Summary on Zero Bubble
 - 
        
        
CUDA H100 GEMM Optimization
 - 
        
        
Summary of SemiAnalysis o1 Reasoning Report
 - 
        
        
People Retrospective: Communication, Growth, Collaboration, and Challenges
 - 
        
        
Summary on DeepSeek V3
 - 
        
        
Scaling Law
 - 
        
        
Understand Speculative Decoding for LLM Inference
 - 
        
        
Notes on Reading Hunyuan Model
 - 
        
        
Educational Materials for GEMM Optimizations on CPUs and GPUs