About Me
I'm currently a second-year PhD student at Dartmouth College, advised by Prof. Yu-Wing Tai. My current research interests lie in multimodal generative models, efficient diffusion models, and multimodal agentic systems.
I received my B.S. in Data Science and Technology from the Hong Kong University of Science and Technology. During my undergraduate studies, I was fortunate to work with Prof. Xiaomeng Li, Prof. Yu-Wing Tai, and Prof. Chi Keung Tang, and I also had the chance to be an exchange student at EPFL.
I'm also a music producer and a photographer.
Selected Publications
Proposed a framework that decouples global and local modeling for ultra-high-resolution long video generation. Uses temporally scaled RoPE for global semantic proxy and hierarchical locality-preserving attention for high-res details, achieving 60.9× speedup over native 4K generators with resolution-agnostic training.
Developed an efficient, scalable ultra-high resolution (8K) image generation framework. Eliminates the need for hi-res training data, achieving >10× inference speedup and significantly lower memory usage compared to FLUX baselines.
Developed an efficient, scalable ultra-high resolution (4K) image editing framework without any specialized high-resolution training data. Achieves >5× faster inference speed and enables ultra-high-res editing that other models failed.
Proposed an automated framework using LLMs for structured text-to-image generation and editing. Introduced ChainArchitect for CoT-based 3D-aware layout generation and Object-Integration Network (OIN) for seamless and efficient subject-driven inpainting.
Education
PhD Student in Computer Science · Hanover, NH, USA
Advisor: Prof. Yu-Wing Tai
B.Sc. in Data Science and Technology, First Class Honor (Top 10%) · Hong Kong, China
Advisors: Prof. Xiaomeng Li, Prof. Yu-Wing Tai, Prof. Chi-Keung Tang
Regular Term Exchange · Lausanne, Switzerland
Experience & Teaching
- COSC 89/189 (Generative AI), ENGS 106 (Machine Learning), COSC 83/183 (Computer Vision)