Picture for Xiaohan Wang

Xiaohan Wang

Why are Visually-Grounded Language Models Bad at Image Classification?

Add code
May 28, 2024
Viaarxiv icon

Just Shift It: Test-Time Prototype Shifting for Zero-Shot Generalization with Vision-Language Models

Add code
Mar 19, 2024
Figure 1 for Just Shift It: Test-Time Prototype Shifting for Zero-Shot Generalization with Vision-Language Models
Figure 2 for Just Shift It: Test-Time Prototype Shifting for Zero-Shot Generalization with Vision-Language Models
Figure 3 for Just Shift It: Test-Time Prototype Shifting for Zero-Shot Generalization with Vision-Language Models
Figure 4 for Just Shift It: Test-Time Prototype Shifting for Zero-Shot Generalization with Vision-Language Models
Viaarxiv icon

VideoAgent: Long-form Video Understanding with Large Language Model as Agent

Add code
Mar 15, 2024
Figure 1 for VideoAgent: Long-form Video Understanding with Large Language Model as Agent
Figure 2 for VideoAgent: Long-form Video Understanding with Large Language Model as Agent
Figure 3 for VideoAgent: Long-form Video Understanding with Large Language Model as Agent
Figure 4 for VideoAgent: Long-form Video Understanding with Large Language Model as Agent
Viaarxiv icon

Editing Conceptual Knowledge for Large Language Models

Add code
Mar 10, 2024
Figure 1 for Editing Conceptual Knowledge for Large Language Models
Figure 2 for Editing Conceptual Knowledge for Large Language Models
Figure 3 for Editing Conceptual Knowledge for Large Language Models
Figure 4 for Editing Conceptual Knowledge for Large Language Models
Viaarxiv icon

DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval

Add code
Jan 19, 2024
Viaarxiv icon

Describing Differences in Image Sets with Natural Language

Add code
Dec 05, 2023
Viaarxiv icon

Exploring Large Language Models for Human Mobility Prediction under Public Events

Add code
Nov 29, 2023
Viaarxiv icon

Editing Personality for LLMs

Add code
Oct 03, 2023
Figure 1 for Editing Personality for LLMs
Figure 2 for Editing Personality for LLMs
Figure 3 for Editing Personality for LLMs
Figure 4 for Editing Personality for LLMs
Viaarxiv icon

DiverseMotion: Towards Diverse Human Motion Generation via Discrete Diffusion

Add code
Sep 04, 2023
Viaarxiv icon

JOTR: 3D Joint Contrastive Learning with Transformers for Occluded Human Mesh Recovery

Add code
Aug 17, 2023
Viaarxiv icon