Paper Reading: Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism 02-26
Paper Reading: From Laptop to Lambda: Outsourcing Everyday Jobs to Thousands of Transient Functional Containers 11-20