MMBench-Chinese
Paper: MMBench: Is Your Multi-modal Model an All-around Player? Project Link Publisher: Arxiv Author Affiliation: Shanghai AI Laboratory
Paper: MMBench: Is Your Multi-modal Model an All-around Player? Project Link Publisher: Arxiv Author Affiliation: Shanghai AI Laboratory
Paper: Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models GitHub Link Publisher: Arxiv Author Affiliation: Sun Yat-sen University Functional Div...
Paper: Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models GitHub Link Publisher: Arxiv Author Affiliation: Sun Yat-sen University Multi-turn ...
Paper: Chinese-LLaVA GitHub Link Publisher: Website Author Affiliation: LinkSoul-AI Functional Division Understanding Generation Design Division Too...
Paper: The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World GitHub Link Publisher: Arxiv Author Affiliation: Shanghai AI Laboratory Functional ...
Paper: OpenFlamingo: An Open-Source Framework for Training Large Autoregressive Vision-Language Models GitHub Link Publisher: Arxiv Author Affiliation: University of Washington Functiona...
Paper: LISA: Reasoning Segmentation via Large Language Model GitHub Link Publisher: Arxiv Author Affiliation: The Chinese University of Hong Kong Functional Division Understan...
Paper: 3D-LLM: Injecting the 3D World into Large Language Models GitHub Link Publisher: Arxiv Author Affiliation: University of California, Los Angeles Functional Division Und...
Paper: ChatSpot: Bootstrapping Multimodal LLMs via Precise Referring Instruction Tuning GitHub Link Publisher: Arxiv Author Affiliation: MEGVII Type SFT RLHF M...
Paper: ChatSpot: Bootstrapping Multimodal LLMs via Precise Referring Instruction Tuning GitHub Link Publisher: Arxiv Author Affiliation: MEGVII Functional Division Understandi...