GTC 2019 - TrIMS: Transparent and Isolated Model Sharing for Low Latency Deep Learning Inference


Date
Mar 22, 2019 3:30 PM
Location
San Jose, CA
Cheng Li
Cheng Li
Senior Researcher

My work focus on optimizing inference/training of Deep Learning models, particularly on Transformers (LLMs).