Published onNovember 15, 2024Generating Synthetic Data for LLM Post-TrainingaillmclassicalAn overview of the motivations and techniques used for generating synthetic data for LLM post-training, as seen in the Llama 3.1, AFM, Qwen2 and Hunyuan-Large papers.
Published onOctober 20, 2024Scaling LLM Test Time ComputeaillmclassicalAn overview of recent research on scaling test-time compute in large language models (LLMs) including CoT, STaR, ReST, RISE, ORMs, PRMs and more.