Research
Research Experience
-
Sep 2025 – Present
MIT Media Lab
Multisensory Intelligence Group
Graduate Researcher
- Working on a new research direction in Vision-Language-Action (VLA) models and robotics.
-
Jun 2025 – Sep 2025
MIT Media Lab
Multisensory Intelligence Group
Graduate Visiting Researcher
- Co-developed PAGE-4D, a feedforward 4D perception framework extending VGGT with a dynamics-aware aggregator for static–dynamic disentanglement.
- Introduced mask-guided attention to suppress motion for pose tokens while exploiting dynamics for geometry tokens.
- Applied selective fine-tuning on the middle 10 VGGT layers (~30% parameters), matching full fine-tuning performance with no runtime or memory overhead.
- Achieved state-of-the-art results on Sintel, DyCheck, and TUM benchmarks, improving depth, pose accuracy, and rendering quality (PSNR/SSIM, LPIPS).
-
Jul 2024 – Oct 2024
Imperial College
Department of Mechanical Engineering
Undergraduate Researcher, Full-Stack Developer
- Designed and developed a modern, data-driven web platform called Smart-Forming that enables engineers to discover, evaluate, and share manufacturing knowledge modules.
- Focused on intuitive UX, modular architecture, and seamless integration of metadata analytics (heatmap, word cloud supported by Python and MATLAB) to support industrial R&D.
-
Jan 2022 – Dec 2022
UESTC
School of Computer Science and Engineering
High School Researcher
- Research on Open World Object Detection for classifying known and unknown objects.
- Improved a Detectron2-based model ORE for incremental object detection using contrastive clustering and auto-labeling RPN.