Jianyu Huang's Blog
About
  • May 4, 2025

    DeepSeek-V3's Hardware-Aware Design

  • May 4, 2025

    Summary on Llama-Nemotron

  • May 4, 2025

    Summary on StreamRL

  • Mar 30, 2025

    Disaggregate Prefill and Decoding

  • Jan 20, 2025

    Summary on DeepSeek R1 and Kimi k1.5

  • Jan 18, 2025

    Summary on MiniMax-01

  • Dec 30, 2024

    Summary on Zero Bubble

  • Dec 29, 2024

    CUDA H100 GEMM Optimization

  • Dec 28, 2024

    Summary of SemiAnalysis o1 Reasoning Report

  • Dec 27, 2024

    People Retrospective: Communication, Growth, Collaboration, and Challenges

  • Dec 26, 2024

    Summary on DeepSeek V3

  • Dec 23, 2024

    Scaling Law

  • Dec 15, 2024

    Understand Speculative Decoding for LLM Inference

  • Nov 12, 2024

    Notes on Reading Hunyuan Model

  • Nov 11, 2024

    Educational Materials for GEMM Optimizations on CPUs and GPUs

Subscribe

  • Jianyu Huang
  • jianyu0huang@gmail.com

Record the technical thoughts.