Picture for Zhaojiang Lin

Zhaojiang Lin

SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM

Add code
Mar 07, 2024
Figure 1 for SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM
Figure 2 for SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM
Figure 3 for SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM
Figure 4 for SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM
Viaarxiv icon

Large Language Models as Zero-shot Dialogue State Tracker through Function Calling

Add code
Feb 16, 2024
Figure 1 for Large Language Models as Zero-shot Dialogue State Tracker through Function Calling
Figure 2 for Large Language Models as Zero-shot Dialogue State Tracker through Function Calling
Figure 3 for Large Language Models as Zero-shot Dialogue State Tracker through Function Calling
Figure 4 for Large Language Models as Zero-shot Dialogue State Tracker through Function Calling
Viaarxiv icon

AnyMAL: An Efficient and Scalable Any-Modality Augmented Language Model

Add code
Sep 27, 2023
Viaarxiv icon

Continual Dialogue State Tracking via Example-Guided Question Answering

Add code
May 23, 2023
Figure 1 for Continual Dialogue State Tracking via Example-Guided Question Answering
Figure 2 for Continual Dialogue State Tracking via Example-Guided Question Answering
Figure 3 for Continual Dialogue State Tracking via Example-Guided Question Answering
Figure 4 for Continual Dialogue State Tracking via Example-Guided Question Answering
Viaarxiv icon

Introducing Semantics into Speech Encoders

Add code
Nov 15, 2022
Figure 1 for Introducing Semantics into Speech Encoders
Figure 2 for Introducing Semantics into Speech Encoders
Figure 3 for Introducing Semantics into Speech Encoders
Figure 4 for Introducing Semantics into Speech Encoders
Viaarxiv icon

IMU2CLIP: Multimodal Contrastive Learning for IMU Motion Sensors from Egocentric Videos and Text

Add code
Oct 26, 2022
Figure 1 for IMU2CLIP: Multimodal Contrastive Learning for IMU Motion Sensors from Egocentric Videos and Text
Figure 2 for IMU2CLIP: Multimodal Contrastive Learning for IMU Motion Sensors from Egocentric Videos and Text
Figure 3 for IMU2CLIP: Multimodal Contrastive Learning for IMU Motion Sensors from Egocentric Videos and Text
Figure 4 for IMU2CLIP: Multimodal Contrastive Learning for IMU Motion Sensors from Egocentric Videos and Text
Viaarxiv icon

Enabling Classifiers to Make Judgements Explicitly Aligned with Human Values

Add code
Oct 14, 2022
Figure 1 for Enabling Classifiers to Make Judgements Explicitly Aligned with Human Values
Figure 2 for Enabling Classifiers to Make Judgements Explicitly Aligned with Human Values
Figure 3 for Enabling Classifiers to Make Judgements Explicitly Aligned with Human Values
Figure 4 for Enabling Classifiers to Make Judgements Explicitly Aligned with Human Values
Viaarxiv icon

FaceFormer: Speech-Driven 3D Facial Animation with Transformers

Add code
Dec 28, 2021
Figure 1 for FaceFormer: Speech-Driven 3D Facial Animation with Transformers
Figure 2 for FaceFormer: Speech-Driven 3D Facial Animation with Transformers
Figure 3 for FaceFormer: Speech-Driven 3D Facial Animation with Transformers
Figure 4 for FaceFormer: Speech-Driven 3D Facial Animation with Transformers
Viaarxiv icon

Joint Audio-Text Model for Expressive Speech-Driven 3D Facial Animation

Add code
Dec 07, 2021
Figure 1 for Joint Audio-Text Model for Expressive Speech-Driven 3D Facial Animation
Figure 2 for Joint Audio-Text Model for Expressive Speech-Driven 3D Facial Animation
Figure 3 for Joint Audio-Text Model for Expressive Speech-Driven 3D Facial Animation
Figure 4 for Joint Audio-Text Model for Expressive Speech-Driven 3D Facial Animation
Viaarxiv icon

Few-Shot Bot: Prompt-Based Learning for Dialogue Systems

Add code
Oct 15, 2021
Figure 1 for Few-Shot Bot: Prompt-Based Learning for Dialogue Systems
Figure 2 for Few-Shot Bot: Prompt-Based Learning for Dialogue Systems
Figure 3 for Few-Shot Bot: Prompt-Based Learning for Dialogue Systems
Figure 4 for Few-Shot Bot: Prompt-Based Learning for Dialogue Systems
Viaarxiv icon