Adding code to your training data makes your LLM better at non-coding tasks too

AIModels.fyi

Including code data during pre-training of large language models can significantly improve their performance on a variety of non-coding tasks such as natural language reasoning and generative tasks. The study found that a balanced mix of code and text data in initial pre-training, followed by text-centric continued training, yields the best results. High-quality, synthetic code examples were found to be particularly beneficial for enhancing model capabilities.

Training on code improves LLM performance on non-coding tasks