LLM#
Large Language Models (LLM) training, inference, and optimization. Covers PyTorch for model development, distributed training across GPUs, vLLM and SGLang for high-performance LLM inference and serving, and benchmarking tools for measuring serving performance.