EmbodiedGPT

Paper: EmbodiedGPT: Vision-Language Pre-Training via Embodied Chain of Thought GitHub Link Publisher: NeurIPS 2023 Author Affiliation: The University of Hong Kong, Functional Division ...

May 24, 2023 NeurIPS 2023

DetGPT

Paper: DetGPT: Detect What You Need via Reasoning GitHub Link Publisher: Arxiv Author Affiliation: The Hong Kong University of Science and Technology Functional Division Under...

May 23, 2023 Arxiv

SpeechGPT

Paper: SpeechGPT: Empowering Large Language Models with Intrinsic Cross-Modal Conversational Abilities GitHub Link Publisher: EMNLP 2023 Author Affiliation: Fudan University Functional D...

May 18, 2023 EMNLP 2023

InstructBLIP

Paper: InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning GitHub Link Publisher: NeurIPS 2023 Author Affiliation: Salesforce Research Functional Divisio...

May 11, 2023 NeurIPS 2023

InstructBLIP’s IT

Paper: InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning GitHub Link Publisher: NeurIPS 2023 Author Affiliation: Salesforce Research Type SF...

May 11, 2023 NeurIPS 2023

VideoChat

Paper: VideoChat: Chat-Centric Video Understanding GitHub Link Publisher: Arxiv Author Affiliation: Shanghai AI Laboratory & Nanjing University & The University of Hong Kong & ...

May 10, 2023 Arxiv

VideoChat’s IT

Paper: VideoChat: Chat-Centric Video Understanding GitHub Link Publisher: Arxiv Author Affiliation: Shanghai AI Laboratory & Nanjing University & The University of Hong Kong & ...

May 10, 2023 Arxiv

MultiModal-GPT

Paper: MultiModal-GPT: A Vision and Language Model for Dialogue with Humans GitHub Link Publisher: Arxiv Author Affiliation: Shanghai AI Laboratory & The University of Hong Kong & ...

May 8, 2023 Arxiv

X-LLM

Paper: X-LLM:Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages GitHub Link Publisher: Arxiv Author Affiliation: Chinese Academy of Sciences F...

May 7, 2023 Arxiv

Otter

Paper: Otter: A Multi-Modal Model with In-Context Instruction Tuning GitHub Link Publisher: Arxiv Author Affiliation: Nanyang Technological University Functional Division Unde...

May 5, 2023 Arxiv