The recent GPT-4 has demonstrated extraordinary multi-modal abilities, such as directly generating websites from handwritten text and identifying humorous elements within images. We are currently preparing a lighter model runnable on a single 3090 GPU, which you will be able to run on your own machine.
Sort: