VITRON

Paper: VITRON: A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing Project Link Publisher: NeurIPS 2024 Author Affiliation: National University of Singapore...

Sep 26, 2024 NeurIPS 2024

TroL

Paper: TroL: Traversal of Layers for Large Language and Vision Models Project Link Publisher: EMNLP 2024 Author Affiliation: KAIST Functional Division Understanding Gene...

Sep 25, 2024 EMNLP 2024

VITA

Paper: VITA: Towards Open-Source Interactive Omni Multimodal LLM Project Link Publisher: Arxiv Author Affiliation: Tencent Youtu Lab Functional Division Understanding Ge...

Sep 10, 2024 Arxiv

EAGLE

Paper: EAGLE: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders Project Link Publisher: Arxiv Author Affiliation: NVIDIA Functional Division Understandin...

Aug 28, 2024 Arxiv

mPLUG-Owl3

Paper: mPLUG-Owl3: Towards Long Image-Sequence Understanding in Multi-Modal Large Language Models Project Link Publisher: Arxiv Author Affiliation: Alibaba Group Functional Division ...

Aug 13, 2024 Arxiv

Parrot

Paper: Parrot: Multilingual Visual Instruction Tuning Project Link Publisher: Arxiv Author Affiliation: Nanjing University Functional Division Understanding Generation ...

Aug 11, 2024 Arxiv

video-SALMONN

Paper: video-SALMONN: Speech-Enhanced Audio-Visual Large Language Models Project Link Publisher: ICML 2024 Author Affiliation: Tsinghua University Functional Division Understa...

Jun 22, 2024 ICML 2024

VideoLLM-online

Paper: VideoLLM-online: Online Video Large Language Model for Streaming Video Project Link Publisher: CVPR 2024 Author Affiliation: National University of Singapore Functional Division ...

Jun 17, 2024 CVPR 2024

Libra

Paper: Libra: Building Decoupled Vision System on Large Language Models Project Link Publisher: ICML 2024 Author Affiliation: Chinese Academy of Sciences Functional Division [...

May 16, 2024 ICML 2024

CuMo

Paper: CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts Project Link Publisher: Arxiv Author Affiliation: Georgia Tech & UIUC Functional Division Understan...

May 9, 2024 Arxiv