LanguageBind

Paper: LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment GitHub Link Publisher: ICLR 2024 Author Affiliation: Peking University Functi...

Oct 3, 2023 ICLR 2024

JAM

Paper: Jointly Training Large Autoregressive Multimodal Models GitHub Link: None Publisher: Arxiv Author Affiliation: Meta AI Functional Division Understanding Generatio...

Sep 27, 2023 Arxiv

AnyMAL

Paper: AnyMAL: An Efficient and Scalable Any-Modality Augmented Language Model GitHub Link: None Author Affiliation: FAIR, Meta & Meta Reality Labs Functional Division Under...

Sep 27, 2023 Arxiv

QBench

Paper: Q-Bench: A Benchmark for General-Purpose Foundation Models on Low-level Vision Project Link Publisher: Arxiv Author Affiliation: Nanyang Technological University

Sep 25, 2023 Arxiv

LLaVA-RLHF

Paper: Aligning Large Multimodal Models with Factually Augmented RLHF GitHub Link Publisher: Arxiv Author Affiliation: UC Berkeley Type SFT RLHF Multi-turn ...

Sep 25, 2023 Arxiv

Kosmos-2.5

Paper: Kosmos-2.5: A Multimodal Literate Model GitHub Link Publisher: Arxiv Author Affiliation: Microsoft Functional Division Understanding Generation Design D...

Sep 20, 2023 Arxiv

DreamLLM

Paper: DreamLLM: Synergistic Multimodal Comprehension and Creation GitHub Link Publisher: ICLR 2024 Author Affiliation: MEGVII Functional Division Understanding Generati...

Sep 20, 2023 ICLR 2024

T2M

Paper: NExT-GPT: Any-to-Any Multimodal LLM GitHub Link Publisher: ICLR 2024 Author Affiliation: National University of Singapore Type SFT RLHF Multi-turn ...

Sep 11, 2023 ICLR 2024

NExT-GPT

Paper: NExT-GPT: Any-to-Any Multimodal LLM GitHub Link Publisher: ICLR 2024 Author Affiliation: National University of Singapore Functional Division Understanding Genera...

Sep 11, 2023 ICLR 2024

MosIT

Paper: NExT-GPT: Any-to-Any Multimodal LLM GitHub Link Publisher: ICLR 2024 Author Affiliation: National University of Singapore Type SFT RLHF Multi-turn ...

Sep 11, 2023 ICLR 2024

LanguageBind

JAM

AnyMAL

QBench

LLaVA-RLHF

Kosmos-2.5

DreamLLM

T2M

NExT-GPT

MosIT

Trending Tags