Making an LLM Miserable About Boston Weather
This title could be clearer and more informative.Try out Clickbait Shieldfor free (5 uses left this month).
An experiment replicating Anthropic's emotion vector research on a smaller scale using Llama 3.1 8B. The author synthesized happy and sad emotion vectors by running ~1000 inferences, then connected the model to a weather API so that Boston weather conditions dynamically adjust the model's emotional state via activation steering. The results were surprisingly effective, and the author plans to extend the work with physical sensors, more layers, and more nuanced emotions.
Sort: