POPE

Paper: Evaluating Object Hallucination in Large Vision-Language Models Project Link Publisher: EMNLP 2023 Author Affiliation: Renmin University of China

Oct 26, 2023 EMNLP 2023

ControlLLM

Paper: ControlLLM: Augment Language Models with Tools by Searching on Graphs GitHub Link Publisher: Arxiv Author Affiliation: The Hong Kong University of Science and Technolo Functional ...

Oct 26, 2023 Arxiv

MM-Vet

Paper: MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities Project Link Publisher: Arxiv Author Affiliation: National University of Singapore

Oct 24, 2023 Arxiv

SALMONN

Paper: SALMONN: Towards Generic Hearing Abilities for Large Language Models GitHub Link Publisher: Arxiv Author Affiliation: Tsinghua University Functional Division Understand...

Oct 20, 2023 Arxiv

Fuyu-8B

Paper: Fuyu-8B: A Multimodal Architecture for AI Agents GitHub Link: None Publisher: Website Author Affiliation: ADEPT Functional Division Understanding Generation ...

Oct 17, 2023 Website

MiniGPT-v2

Paper: MINIGPT-V2: LARGE LANGUAGE MODEL AS A UNIFIED INTERFACE FOR VISION-LANGUAGE MULTITASK LEARNING GitHub Link Publisher: Arxiv Author Affiliation: King Abdullah University of Science a...

Oct 14, 2023 Arxiv

LLaVA-1.5

Paper: Improved baselines with visual instruction tuning GitHub Link Publisher: Arxiv Author Affiliation: University of Wisconsin–Madison Functional Division Understanding ...

Oct 5, 2023 Arxiv

Kosmos-G

Paper: Kosmos-G: Generating Images in Context with Multimodal Large Language Models GitHub Link Publisher: Arxiv Author Affiliation: Microsoft Research Functional Division Und...

Oct 4, 2023 Arxiv

MiniGPT-5

Paper: MINIGPT-5: INTERLEAVED VISION-AND-LANGUAGE GENERATION VIA GENERATIVE VOKENS GitHub Link Publisher: Arxiv Author Affiliation: University of California, Santa Cruz Functional Divisi...

Oct 3, 2023 Arxiv

MathVista

Paper: MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual Contexts Project Link Publisher: ICLR 2024 Author Affiliation: UCLA

Oct 3, 2023 ICLR 2024