Picture for Fanzhuang Meng

Fanzhuang Meng

AntBatchInfer: Elastic Batch Inference in the Kubernetes Cluster

Add code
Apr 15, 2024
Viaarxiv icon

Rethinking Memory and Communication Cost for Efficient Large Language Model Training

Add code
Oct 09, 2023
Figure 1 for Rethinking Memory and Communication Cost for Efficient Large Language Model Training
Figure 2 for Rethinking Memory and Communication Cost for Efficient Large Language Model Training
Figure 3 for Rethinking Memory and Communication Cost for Efficient Large Language Model Training
Figure 4 for Rethinking Memory and Communication Cost for Efficient Large Language Model Training
Viaarxiv icon