Picture for Yuhao Zhang

Yuhao Zhang

A One-Layer Decoder-Only Transformer is a Two-Layer RNN: With an Application to Certified Robustness

Add code
May 27, 2024
Viaarxiv icon

NGD-SLAM: Towards Real-Time SLAM for Dynamic Environments without GPU

Add code
May 12, 2024
Viaarxiv icon

Ensuring Safe and High-Quality Outputs: A Guideline Library Approach for Language Models

Add code
Mar 23, 2024
Figure 1 for Ensuring Safe and High-Quality Outputs: A Guideline Library Approach for Language Models
Figure 2 for Ensuring Safe and High-Quality Outputs: A Guideline Library Approach for Language Models
Figure 3 for Ensuring Safe and High-Quality Outputs: A Guideline Library Approach for Language Models
Figure 4 for Ensuring Safe and High-Quality Outputs: A Guideline Library Approach for Language Models
Viaarxiv icon

Verified Training for Counterfactual Explanation Robustness under Data Shift

Add code
Mar 06, 2024
Figure 1 for Verified Training for Counterfactual Explanation Robustness under Data Shift
Figure 2 for Verified Training for Counterfactual Explanation Robustness under Data Shift
Figure 3 for Verified Training for Counterfactual Explanation Robustness under Data Shift
Figure 4 for Verified Training for Counterfactual Explanation Robustness under Data Shift
Viaarxiv icon

Soft Alignment of Modality Space for End-to-end Speech Translation

Add code
Dec 18, 2023
Viaarxiv icon

DragVideo: Interactive Drag-style Video Editing

Add code
Dec 03, 2023
Viaarxiv icon

Semantic-Aware Frame-Event Fusion based Pattern Recognition via Large Vision-Language Models

Add code
Nov 30, 2023
Viaarxiv icon

TrainerAgent: Customizable and Efficient Model Training through LLM-Powered Multi-Agent System

Add code
Nov 23, 2023
Viaarxiv icon

Rethinking and Improving Multi-task Learning for End-to-end Speech Translation

Add code
Nov 07, 2023
Viaarxiv icon

Bridging the Gaps of Both Modality and Language: Synchronous Bilingual CTC for Speech Translation and Speech Recognition

Add code
Sep 21, 2023
Figure 1 for Bridging the Gaps of Both Modality and Language: Synchronous Bilingual CTC for Speech Translation and Speech Recognition
Figure 2 for Bridging the Gaps of Both Modality and Language: Synchronous Bilingual CTC for Speech Translation and Speech Recognition
Figure 3 for Bridging the Gaps of Both Modality and Language: Synchronous Bilingual CTC for Speech Translation and Speech Recognition
Figure 4 for Bridging the Gaps of Both Modality and Language: Synchronous Bilingual CTC for Speech Translation and Speech Recognition
Viaarxiv icon