This post introduces the concept of Unsolvable Problem Detection (UPD) for Vision Language Models (VLMs), which aims to improve their reliability and trustworthiness. The researchers propose three problem types within UPD: Absent Answer Detection (AAD), Incompatible Answer Set Detection (IASD), and Incompatible Visual Question Detection (IVQD). They explore the performance of various VLMs on these problem types and discuss strategies to improve UPD accuracy. The research emphasizes the complexity of the UPD challenge and the need for innovative approaches to enhance VLMs.
Sort: