Ai

Published on
December 23, 2024
A brief history of LLM Scaling Laws and what to expect in 2025
ai llm classical
A brief history of LLM Scaling Laws from compute-optimal training and inference to scaling test-time compute and whether Scaling Laws are coming to an end.
Published on
December 18, 2024
Scaling Laws for LLM Pretraining
ai llm classical
A comparison of Scaling Laws for LLM Pretraining, from Kaplan, to Chinchilla, the Chinchilla Trap, covering compute-optimal training and inference.
Published on
November 15, 2024
Generating Synthetic Data for LLM Post-Training
ai llm classical
An overview of the motivations and techniques used for generating synthetic data for LLM post-training, as seen in the Llama 3.1, AFM, Qwen2 and Hunyuan-Large papers.
Published on
October 20, 2024
Scaling LLM Test Time Compute
ai llm classical
An overview of recent research on scaling test-time compute in large language models (LLMs) including CoT, STaR, ReST, RISE, ORMs, PRMs and more.

A brief history of LLM Scaling Laws and what to expect in 2025