
I am a Staff Research Scientist @ Meta SuperIntelligence Lab.
I enjoy pushing the frontier of Large Language Models in both performance and compute efficiency where I have contributed towards:
My daily works involve balancing research exploration and engineering execution to drive projects from prototypes towards product impacts.
In my previous life, I designed and studied efficient attention architectures to scale up visual perception during my PhD with Prof. Ehsan Elhamifar. I received my B.Sc. in Advanced Program in Computer Science from University of Sciences (Viet Nam).
If you are interested in my research or collaboration, I can be reached via: