Can LoReFT be a rival for LoRA? According to ReFT paper, it has the potential to replace LoRA in various cases. 

In this video we dive into the research paper that presents ReFT and LoReFT. We'll explain what is representation fine-tuning (ReFT), and how it is different than previous parameter-efficient fine-tuning (PEFT) methods, such as LoRA.
ReFT is a family of methods that can be used to adapt pre-trained transformer models to specific tasks, and we'll understand the concept of how such methods work. Specifically, the paper presents a concrete ReFT method which is called LoReFT, which stands for Low-rank Linear Subspace ReFT. We'll explain how it works, and see results from the paper that show the great potential of this method comparing to previous PEFT methods.

Paper page - https://arxiv.org/abs/2404.03592
GitHub repo -https://github.com/stanfordnlp/pyreft
Code for LoReFT - https://github.com/stanfordnlp/pyreft/blob/main/pyreft/interventions.py
Blog post - https://aipapersacademy.com/reft/

-----------------------------------------------------------------------------------------------
✉️ Join the newsletter - https://aipapersacademy.com/newsletter/

👍 Please like & subscribe if you enjoy this content
-----------------------------------------------------------------------------------------------

Chapters:
0:00 Introduction & Motivation
1:49 What is ReFT?
3:47 ReFT & LoReFT Details
6:07 LoReFT Results

AI Papers Academy

ReFT (Representation Finetuning) is a Stanford research paper introducing a new parameter-efficient finetuning approach for large language models. Instead of modifying model weights like LoRA, ReFT edits hidden representations via small trainable intervention components inserted between transformer layers. The specific method introduced, LoReFT (Low-rank Linear Subspace ReFT), requires 10-50x fewer parameters than LoRA while achieving competitive or superior results on commonsense reasoning, arithmetic reasoning, and instruction-following benchmarks. LoReFT applies interventions to prefix and suffix tokens at selected layers, training matrices R, W and vector b to edit representations. One LoReFT variant achieved the best win-rate among evaluated open-source models on instruction following, trained in just 18 minutes on a single A100 GPU.

ReFT: Representation Finetuning for Language Models | AI Paper Explained