DetGPT
Paper: DetGPT: Detect What You Need via Reasoning GitHub Link Publisher: Arxiv Author Affiliation: The Hong Kong University of Science and Technology Functional Division Under...
Paper: DetGPT: Detect What You Need via Reasoning GitHub Link Publisher: Arxiv Author Affiliation: The Hong Kong University of Science and Technology Functional Division Under...
Paper: SpeechGPT: Empowering Large Language Models with Intrinsic Cross-Modal Conversational Abilities GitHub Link Publisher: EMNLP 2023 Author Affiliation: Fudan University Functional D...
Paper: InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning GitHub Link Publisher: NeurIPS 2023 Author Affiliation: Salesforce Research Functional Divisio...
Paper: InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning GitHub Link Publisher: NeurIPS 2023 Author Affiliation: Salesforce Research Type SF...
Paper: VideoChat: Chat-Centric Video Understanding GitHub Link Publisher: Arxiv Author Affiliation: Shanghai AI Laboratory & Nanjing University & The University of Hong Kong & ...
Paper: VideoChat: Chat-Centric Video Understanding GitHub Link Publisher: Arxiv Author Affiliation: Shanghai AI Laboratory & Nanjing University & The University of Hong Kong & ...
Paper: MultiModal-GPT: A Vision and Language Model for Dialogue with Humans GitHub Link Publisher: Arxiv Author Affiliation: Shanghai AI Laboratory & The University of Hong Kong & ...
Paper: X-LLM:Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages GitHub Link Publisher: Arxiv Author Affiliation: Chinese Academy of Sciences F...
Paper: Otter: A Multi-Modal Model with In-Context Instruction Tuning GitHub Link Publisher: Arxiv Author Affiliation: Nanyang Technological University Functional Division Unde...
Paper: mPLUG-Owl: Modularization Empowers Large Language Models with Multimodality GitHub Link Publisher: Arxiv Author Affiliation: DAMO Academy, Alibaba Group Functional Division ...