Abstract page for arXiv paper 2509.19371: How to inject knowledge efficiently? Knowledge Infusion Scaling Law for Pre-training Large Language Models

Hacker News is a community-driven platform for sharing and discussing technology news, startups, and programming-related topics. Through user submissions and comments, Hacker News offers insights into emerging technology trends, industry developments, and entrepreneurial ventures. Readers can participate in discussions, share their insights, and stay informed about the latest advancements in technology and innovation.

Hacker News

Research paper proposing a scaling law to optimize domain knowledge injection during LLM pretraining. The study identifies critical collapse points where excessive domain-specific data causes catastrophic forgetting, and demonstrates these thresholds scale predictably with model size. The proposed scaling law enables predicting optimal knowledge infusion amounts for large models by analyzing smaller counterparts, validated across multiple model sizes and token budgets.

[2509.19371] How to inject knowledge efficiently? Knowledge Infusion Scaling Law for Pre-training Large Language Models