Picture for Damai Dai

Damai Dai

Exploring Activation Patterns of Parameters in Language Models

Add code
May 28, 2024
Viaarxiv icon

Large Language Models Are Unconscious of Unreasonability in Math Problems

Add code
Mar 28, 2024
Figure 1 for Large Language Models Are Unconscious of Unreasonability in Math Problems
Figure 2 for Large Language Models Are Unconscious of Unreasonability in Math Problems
Figure 3 for Large Language Models Are Unconscious of Unreasonability in Math Problems
Figure 4 for Large Language Models Are Unconscious of Unreasonability in Math Problems
Viaarxiv icon

PeriodicLoRA: Breaking the Low-Rank Bottleneck in LoRA Optimization

Add code
Feb 25, 2024
Viaarxiv icon

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Add code
Jan 11, 2024
Viaarxiv icon

Language Models Understand Numbers, at Least Partially

Add code
Jan 08, 2024
Viaarxiv icon

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

Add code
Jan 05, 2024
Viaarxiv icon

Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations

Add code
Dec 28, 2023
Viaarxiv icon

Not All Demonstration Examples are Equally Beneficial: Reweighting Demonstration Examples for In-Context Learning

Add code
Oct 12, 2023
Figure 1 for Not All Demonstration Examples are Equally Beneficial: Reweighting Demonstration Examples for In-Context Learning
Figure 2 for Not All Demonstration Examples are Equally Beneficial: Reweighting Demonstration Examples for In-Context Learning
Figure 3 for Not All Demonstration Examples are Equally Beneficial: Reweighting Demonstration Examples for In-Context Learning
Figure 4 for Not All Demonstration Examples are Equally Beneficial: Reweighting Demonstration Examples for In-Context Learning
Viaarxiv icon

Denoising Bottleneck with Mutual Information Maximization for Video Multimodal Fusion

Add code
May 25, 2023
Figure 1 for Denoising Bottleneck with Mutual Information Maximization for Video Multimodal Fusion
Figure 2 for Denoising Bottleneck with Mutual Information Maximization for Video Multimodal Fusion
Figure 3 for Denoising Bottleneck with Mutual Information Maximization for Video Multimodal Fusion
Figure 4 for Denoising Bottleneck with Mutual Information Maximization for Video Multimodal Fusion
Viaarxiv icon

Bi-Drop: Generalizable Fine-tuning for Pre-trained Language Models via Adaptive Subnetwork Optimization

Add code
May 24, 2023
Figure 1 for Bi-Drop: Generalizable Fine-tuning for Pre-trained Language Models via Adaptive Subnetwork Optimization
Figure 2 for Bi-Drop: Generalizable Fine-tuning for Pre-trained Language Models via Adaptive Subnetwork Optimization
Figure 3 for Bi-Drop: Generalizable Fine-tuning for Pre-trained Language Models via Adaptive Subnetwork Optimization
Figure 4 for Bi-Drop: Generalizable Fine-tuning for Pre-trained Language Models via Adaptive Subnetwork Optimization
Viaarxiv icon