Cheng Li
Cheng Li
Home
Experience
Publications
Talks
Languages
Contact
Connor Holmes
Latest
Deepspeed data efficiency: Improving deep learning model quality and training efficiency via efficient data sampling and routing
Random-LTD: Random and Layerwise Token Dropping Brings Efficient Training for Large-scale Transformers
Cite
×