In this video we dive into the Orca 2 model, presented in a recent research by Microsoft, titled: "Orca 2: Teaching Small Language Models How to Reason". 

We first provide a background for the previous Orca paper which was released earlier this year, so previous knowledge about the first Orca model is not required to follow this video. We discuss about what is imitation learning, and how explanation tuning helps to boost the knowledge gained with imitation learning, as shown in Orca 1.

Then, we explain the two key improvements that Orca 2 brings to the table. One improves the quality of the data used for training by Orca 2 by using prompts selectively for different language tasks, and the second is Cautious Reasoning, a new term introduced in this paper, which is about teaching the model to be able to choose the proper solution strategy to use in order to answer a given user instruction. 
Orca 2 gains this capability thanks to Prompt Erasing, a novel technique introduced in the paper, which we also cover in this video.

-----------------------------------------------------------------------------------------------
Paper page - https://arxiv.org/abs/2311.11045

Blog post - https://aipapersacademy.com/orca-2/

Model page - https://huggingface.co/microsoft/Orca-2-13b

Orca 1 video - https://youtu.be/D8eZugu63vI
-----------------------------------------------------------------------------------------------
✉️ Join the newsletter - https://aipapersacademy.com/newsletter/

👍 Please like & subscribe if you enjoy this content

We use VideoScribe to edit our videos - https://tidd.ly/44TZEiX (affiliate)
-----------------------------------------------------------------------------------------------

Chapters:
0:00 Introducing Orca 2
0:48 Orca 1 Recap
3:00 What's New With Orca 2
5:48 Orca 2 Results

AI Papers Academy

Microsoft's Orca 2 research paper introduces two key improvements over Orca 1 for training small language models. First, it maps specific solution strategies (step-by-step, recall-then-generate, direct-answer, etc.) to appropriate task types, ensuring training data is more accurate. Second, it introduces 'cautious reasoning' via a technique called Prompt Erasing, where system instructions are replaced with a generic prompt during training, teaching the model to autonomously select the right reasoning strategy. Built on LLaMA-2 (7B and 13B), Orca 2 outperforms or matches much larger models like LLaMA-2-Chat 70B on most benchmarks, with model weights publicly released.

Orca 2 by Microsoft: Teaching Small Language Models How to Reason