Picture for Xiang Yue

Xiang Yue

Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization

Add code
May 27, 2024
Viaarxiv icon

Long Context Alignment with Short Instructions and Synthesized Positions

Add code
May 07, 2024
Viaarxiv icon

MAmmoTH2: Scaling Instructions from the Web

Add code
May 06, 2024
Figure 1 for MAmmoTH2: Scaling Instructions from the Web
Figure 2 for MAmmoTH2: Scaling Instructions from the Web
Figure 3 for MAmmoTH2: Scaling Instructions from the Web
Figure 4 for MAmmoTH2: Scaling Instructions from the Web
Viaarxiv icon

MuPT: A Generative Symbolic Music Pretrained Transformer

Add code
Apr 10, 2024
Viaarxiv icon

VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?

Add code
Apr 09, 2024
Figure 1 for VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?
Figure 2 for VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?
Figure 3 for VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?
Figure 4 for VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?
Viaarxiv icon

CodeEditorBench: Evaluating Code Editing Capability of Large Language Models

Add code
Apr 06, 2024
Viaarxiv icon

Long-context LLMs Struggle with Long In-context Learning

Add code
Apr 04, 2024
Viaarxiv icon

Trial and Error: Exploration-Based Trajectory Optimization for LLM Agents

Add code
Mar 04, 2024
Figure 1 for Trial and Error: Exploration-Based Trajectory Optimization for LLM Agents
Figure 2 for Trial and Error: Exploration-Based Trajectory Optimization for LLM Agents
Figure 3 for Trial and Error: Exploration-Based Trajectory Optimization for LLM Agents
Figure 4 for Trial and Error: Exploration-Based Trajectory Optimization for LLM Agents
Viaarxiv icon

StructLM: Towards Building Generalist Models for Structured Knowledge Grounding

Add code
Feb 28, 2024
Viaarxiv icon

OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement

Add code
Feb 28, 2024
Viaarxiv icon