Picture for Xiaohan Ding

Xiaohan Ding

SEED-X: Multimodal Models with Unified Multi-granularity Comprehension and Generation

Add code
Apr 22, 2024
Viaarxiv icon

Leveraging Prompt-Based Large Language Models: Predicting Pandemic Health Decisions and Outcomes Through Social Media Language

Add code
Mar 01, 2024
Figure 1 for Leveraging Prompt-Based Large Language Models: Predicting Pandemic Health Decisions and Outcomes Through Social Media Language
Figure 2 for Leveraging Prompt-Based Large Language Models: Predicting Pandemic Health Decisions and Outcomes Through Social Media Language
Figure 3 for Leveraging Prompt-Based Large Language Models: Predicting Pandemic Health Decisions and Outcomes Through Social Media Language
Figure 4 for Leveraging Prompt-Based Large Language Models: Predicting Pandemic Health Decisions and Outcomes Through Social Media Language
Viaarxiv icon

InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions

Add code
Feb 05, 2024
Viaarxiv icon

Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities

Add code
Jan 25, 2024
Viaarxiv icon

VL-GPT: A Generative Pre-trained Transformer for Vision and Language Understanding and Generation

Add code
Dec 14, 2023
Figure 1 for VL-GPT: A Generative Pre-trained Transformer for Vision and Language Understanding and Generation
Figure 2 for VL-GPT: A Generative Pre-trained Transformer for Vision and Language Understanding and Generation
Figure 3 for VL-GPT: A Generative Pre-trained Transformer for Vision and Language Understanding and Generation
Figure 4 for VL-GPT: A Generative Pre-trained Transformer for Vision and Language Understanding and Generation
Viaarxiv icon

Online Vectorized HD Map Construction using Geometry

Add code
Dec 06, 2023
Viaarxiv icon

UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition

Add code
Nov 27, 2023
Viaarxiv icon

Advancing Vision Transformers with Group-Mix Attention

Add code
Nov 26, 2023
Viaarxiv icon

RefConv: Re-parameterized Refocusing Convolution for Powerful ConvNets

Add code
Oct 16, 2023
Figure 1 for RefConv: Re-parameterized Refocusing Convolution for Powerful ConvNets
Figure 2 for RefConv: Re-parameterized Refocusing Convolution for Powerful ConvNets
Figure 3 for RefConv: Re-parameterized Refocusing Convolution for Powerful ConvNets
Figure 4 for RefConv: Re-parameterized Refocusing Convolution for Powerful ConvNets
Viaarxiv icon

Towards Unified and Effective Domain Generalization

Add code
Oct 16, 2023
Figure 1 for Towards Unified and Effective Domain Generalization
Figure 2 for Towards Unified and Effective Domain Generalization
Figure 3 for Towards Unified and Effective Domain Generalization
Figure 4 for Towards Unified and Effective Domain Generalization
Viaarxiv icon