I acknowledge that information asymmetry can significantly hinder research opportunities for junior students. If you’re interested in chatting about life, research, or potential collaborations, feel free to email me.
For undergrad and master's students at Oxford ONLY: If you are interested in an internship or 4YP at TVG working with me and Phil, please email me your current CV to discuss further.
[11.2024] Two papers (3D-GPT and DreamBeast) were accepted to 3DV 2025. Congratulations to Chunyi and Runjia! 3D-GPT is pioneering work in large-scale 3D scene generation, and DreamBeast is a fascinating project generating fantastical 3D animals.
[07.2024] VFusion3D and Unicorns were accepted to ECCV 2024! VFusion3D is the first work exploring scalable 3D generative/reconstruction models as a step towards a 3D foundation. Check it out if you are interested in 3D generation.
[07.2023] Hyperbolic Audio-visual Zero-shot Learning was accepted to ICCV 2023!
[07.2023] Graduated from ANU with First-Class Honours.
[10.2022] Awarded as a top reviewer in NeurIPS 2022!
[07.2022] Blind Image Decomposition (BID) was accepted to ECCV 2022! A new low-level vision task that better adapts to complex real-world scenarios. Check our project page for more.
[05.2022] You Only Cut Once (YOCO) was accepted to ICML 2022! Check here for our work on how to perform data augmentation.
My research focuses on multimodal foundation models, including multimodal large language models and world generation models, as well as their interactions and integration. You can click on to expand some projects:
Transactions on Pattern Analysis and Machine Intelligence (TPAMI), International Journal of Computer Vision (IJCV), Transactions on Image Processing (TIP), Transactions on Geoscience and Remote Sensing (TGRS)