Posts

Getting Started With Training Infra

Distributed Training 主流技术 PyTorch 官方路线 -> FSDP https://docs.pytorch.org/docs/stable/fsdp.html DeepSpeed ZeRO Stage 1/2/3 -> 理论必学，工程次之 Tensor Parallel (Megatron-LM) -> 训练超大模型，70B+ Pipeline Parallel Sequence Parallel / Activation Parallel 所以，未来一段时间，主要需要学习的是 FSDP、ZeRO Stage3、Megatron-LM 的 TP / PP 高性能 Kernel + Compiler CUDA Kernel 优化，FlashAttention v1/v2/v3 Triton Kernel Fused Kernels torch.compile XLA / PJIT / SPMD -> 只有 DeepMind 重度使用 Mixed Precision -> 基础，快速掌握 Training Platform + Orchestration 工程化，Sharded Checkpointing，Streaming Dataset + Global Shuffling，Job Orchestration / Scheduler，Fault Tolerance，Scaling & Throughput Optimization，Monitoring / Profiling / Telemetry

Block Ads

iPhone -> General -> VPN & Device Management -> DNS https://dnsforge.de/ https://my.nextdns.io/ PC/Mac uBlock Origin Lite

Set Up Blog

I recommend this theme https://github.com/adityatelange/hugo-PaperMod/ Azure Front Door(CDN) 购买一个 Front Door 服务，得到 xuesong-cxa7edcca8a7cgc4.z01.azurefd.net GitHub Pages 里删掉自定义域名 Go to Azure Front Door add custom domain xuesong.cc 找到 Validate custom domain ownership，把 record 加入到 cloudflare，删除所有旧的 A, AAAA, CNAME 记录 Cloudflare DNS 改成走 Front Door 加两条 CNAME @: xuesong-cxa7edcca8a7cgc4.z01.azurefd.net www: xuesong-cxa7edcca8a7cgc4.z01.azurefd.net Azure Front Door -> Front Door manager -> Update route -> Domains 改成 xuesong.cc 流量路径变成 User -> Cloudflare DNS 解析 (xuesong.cc) -> Azure Front Door 边缘节点（CDN）-> GitHub Pages (yxs.github.io) ...

Gym Training For Next 3 Months

6 Days On, 1 Day Rest. RPE 8. 202509 - ? Day 1 – Quads + Chest + HIIT Smith Squat, 史密斯深蹲 – 5 sets | 8–10 reps Leg Press, 坐姿腿举 – 3 sets | 12–15 reps Smith Bench Press, 史密斯卧推 – 5 sets | 8–10 reps Cable Chest Fly, 拉力器飞鸟 – 3 sets | 12–15 reps Plank, 平板支撑 – 3×60s HIIT Running – 10 rounds (30s sprint + 90s walk) Day 2 – Core + Pull + LISS ...