GTC 2019 - TrIMS: Transparent and Isolated Model Sharing for Low Latency Deep Learning Inference


Date
Mar 22, 2019 3:30 PM
Location
San Jose, CA
Cheng Li
Cheng Li
Member of Technical Staff

I specialize in building efficient AI training and inference systems using GPUs, with a focus on optimizing performance for Large Language Models (LLMs) and Large Vision Models (LVMs).