EmoNet: Speaker-Aware Transformers for Emotion Recognition — and What I’d Build Differently in 2026

A retrospective on EmoNet, an MS thesis project that achieved competitive results on the EmoryNLP emotion recognition benchmark using speaker-aware transformers. The system introduced global speaker identity tracking across dialogues, a GRU-based speaker behaviour module, and weighted cross-entropy loss for class imbalance. Two years later, the author reflects on how the field shifted to LLM-based approaches (InstructERC, BiosERC, LaERC-S) and notes that the core architectural intuitions — speaker biography and historical behaviour — survived the paradigm shift, just reimplemented via instruction tuning and retrieval-augmented prompting. The post concludes with how the author would rebuild EmoNet today using LoRA fine-tuning on a small open-source LLM.

#llm

#nlp

#transformers

#lora

Yesterday•10m read time•From towardsdatascience.com

Table of contents

What ERC is, and why text-only is hard The 2024 landscape Three contributions, with intuition Results: what worked, and what surprised me Reflection (2026): the field moved, and so should we Where this leaves me

Comment

Bookmark

Copy

Sort: