Noteworthy LLM Research Papers of 2024

The post reviews 12 influential research papers on large language models (LLMs) published throughout 2024. It covers significant advancements and methods, including the Mixture of Experts models, improvements in low-rank adaptation techniques, effective pretraining strategies, and the introduction of new scaling laws. The reviews highlight developments in LLM architectures, optimization techniques, and the use of synthetic data, emphasizing their implications for future LLM research and applications.

#ai

#machine-learning

#llm

Jan 23, 2025•41m read time•From sebastianraschka.com

Table of contents

1. January: Mixtral’s Mixture of Experts Approach 2. February: Weight-decomposed LoRA 3. March: Tips for Continually Pretraining LLMs 4. April: DPO or PPO for LLM alignment, or both?5. May: LoRA learns less and forgets less 6. June: The 15 Trillion Token FineWeb Dataset 7. July: The Llama 3 Herd of Models 8. August: Improving LLMs by scaling inference-time compute 9. September: Comparing multimodal LLM paradigms 10. October: Replicating OpenAI O1’s reasoning capabilities 11. November: LLM scaling laws for precision 12. December: Phi-4 and Learning from Synthetic Data Conclusion & Outlook

Comment

Bookmark

Copy

Sort: