Microsoft researchers introduce Table-GPT, a model fine-tuned specifically to handle tabular data tasks where standard LLMs like ChatGPT underperform. The core issue is that LLMs pre-trained on text and code struggle with two-dimensional table reasoning, especially vertical column-based queries. ChatGPT achieves only 42.2% accuracy on column identification and 51.2% on table question answering. Table-GPT addresses this via 'table-tuning', an additional fine-tuning step using a large synthesized dataset of (instruction, table, response) triplets. The dataset is built from 2.9M Wikipedia tables and 188K database tables using a synthesis-then-augment pipeline that includes instruction paraphrasing, row/column reordering, and LLM-generated reasoning labels. Results show significant performance gains over both GPT-3.5 and ChatGPT across 8 task types, including tasks the model was never trained on.
Sort: