Date & Time
Aug. 13, 2025, 5 p.m. - Aug. 13, 2025, 8 p.m.
Cost
$0
Location
Austin, TX
Aug. 13, 2025, 5 p.m. - Aug. 13, 2025, 8 p.m.
$0
Austin, TX
Join PyTorch ATX this August for a hands-on look at the next generation of AI inference pipelines. We’ll explore the full modern stack—from aggressive model-size reductions like INT4/INT8 quantization and pruning, dynamic batching, paged-attention memory tricks, and multi-node scheduling. We'll dive into vLLM—today’s most popular open-source engine for high-throughput LLM inference—alongside other cutting edge inference stacks.
Expect deeply technical talks, live demos, and open Q&A with the engineers building and running these systems.
Presentations
When: August 2025 - exact date TBD
Where: Austin, TX - exact location TBD
Food and beverages will be provided.