A good meta-learning model is expected to generalize to new tasks or new environments. The adaptation process happens at test with limited exposure to the new configurations. Even without any explicit fine-tuning (no gradient backpropagation on trainable variables) the model autonomously adjusts internal hidden states to learn.

20m read timeFrom lilianweng.github.io
Post cover image
Table of contents
On the Origin of Meta-RL #Define Meta-RL #Meta-Learning Algorithms for Meta-RL #Training Task Acquisition #References #

Sort: