Search

Home
Experience
Publications
Talks
Languages
Contact

GTC 2019 - TrIMS: Transparent and Isolated Model Sharing for Low Latency Deep Learning Inference

Date

Mar 22, 2019 3:30 PM

Event

NVIDIA GPU Technology Conference 2019

Location

San Jose, CA

Cheng Li

Member of Technical Staff

I specialize in building efficient AI training and inference systems using GPUs, with a focus on optimizing performance for Large Language Models (LLMs) and Large Vision Models (LVMs).

© 2025 Cheng Li

Published with Wowchemy — the free, open source website builder that empowers creators.

Cite