Have you ever looked at a freshly plotted scatter plot and immediately thought, “Ah, this is clearly a logarithmic curve with some heteroskedastic noise,” without running a single line of modeling code? How do you do that? You don’t perform gradient descent in your head. You use your intuition! As an experienced data scientist, you … Continue reading "The Magic of In-Context Learning (ICL): When Your Model Already Knows Your Data"

R-bloggers

In-Context Learning (ICL) allows pre-trained models to make predictions on new data without any retraining. The TabPFN R package brings this capability to tabular data by applying a Transformer architecture — the same used in LLMs — to spreadsheet rows instead of text tokens. Rather than training on real-world datasets, TabPFN was trained on millions of synthetically generated mathematical dependency structures, giving it broad pattern recognition for tabular data. A demo using the iris dataset achieves 97.8% accuracy with no hyperparameter tuning, positioning TabPFN as a compelling alternative to Random Forests or XGBoost for small to medium datasets.

The Magic of In-Context Learning (ICL): When Your Model Already Knows Your Data

The Training Matrix: Learning the Shape of Maths