GTC 2019 - TrIMS: Transparent and Isolated Model Sharing for Low Latency Deep Learning Inference


Date
Mar 22, 2019 3:30 PM
Location
San Jose, CA
Cheng Li
Cheng Li
Senior Software Engineer

My work focus on optimizing training/inference of Deep Learning models, particularly on LLM/LMM.