Last updated 2 years ago
Type: Multimodality Chat (Text & graphics mixed during a chat session) Website:
Enhancing Vision-language Understanding with Advanced Large Language Models