BenchLMM

Paper: BenchLMM: Benchmarking Cross-style Visual Capability of Large Multimodal Models Project Link Publisher: Arxiv Author Affiliation: Nanyang Technological University

Dec 5, 2023 Arxiv

PixelLM

Paper: PixelLM: Pixel Reasoning with Large Multimodal Model GitHub Link Publisher: Arxiv Author Affiliation: Beijing Jiaotong University Functional Division Understanding ...

Dec 4, 2023 Arxiv

RLHF-V

Paper: RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback GitHub Link Publisher: Arxiv Author Affiliation: Tsinghua University Functio...

Dec 1, 2023 Arxiv

RLHF-V’s IT

Paper: RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback GitHub Link Publisher: Arxiv Author Affiliation: Tsinghua University Type ...

Dec 1, 2023 Arxiv

Dolphins

Paper: Dolphins: Multimodal Language Model for Driving GitHub Link Publisher: Arxiv Author Affiliation: University of Wisconsin-Madison Functional Division Understanding ...

Dec 1, 2023 Arxiv

mPLUG-PaperOwl

Paper: mPLUG-PaperOwl: Scientific Diagram Analysis with the Multimodal Large Language Model GitHub Link Publisher: Arxiv Author Affiliation: Alibaba Group Functional Division ...

Nov 30, 2023 Arxiv

X-InstructBLIP

Paper: X-InstructBLIP: A Framework for aligning X-Modal instruction-aware representations to LLMs and Emergent Cross-modal Reasoning GitHub Link Publisher: Arxiv Author Affiliation: Univer...

Nov 30, 2023 Arxiv

X-InstructBLIP’s IT

Paper: X-InstructBLIP: A Framework for aligning X-Modal instruction-aware representations to LLMs and Emergent Cross-modal Reasoning GitHub Link Publisher: Arxiv Author Affiliation: Univer...

Nov 30, 2023 Arxiv

CoDi-2

Paper: CoDi-2: In-Context, Interleaved, and Interactive Any-to-Any Generation GitHub Link Publisher: Arxiv Author Affiliation: UC Berkeley Functional Division Understanding ...

Nov 30, 2023 Arxiv

VIM

Paper: VIM: Probing Multimodal Large Language Models for Visual Embedded Instruction Following GitHub Link Publisher: Arxiv Author Affiliation: University of California, Santa Barbara Fu...

Nov 29, 2023 Arxiv

BenchLMM

PixelLM

RLHF-V

RLHF-V’s IT

Dolphins

mPLUG-PaperOwl

X-InstructBLIP

X-InstructBLIP’s IT

CoDi-2

VIM

Trending Tags