My work focus on optimizing inference/training of Deep Learning models, particularly on Transformers (LLMs).