I am a recent PhD graduate from the School of Computing at the Australian National University, supervised by Prof. Hongdong Li. I worked on a wide range of topics in computer vision including:

  • 3D pose tracking and extended reality (with Pan Ji at Tencent XR Vision Labs)
  • Personalized skinned avatar reconstruction and animation (with Fabian Prada and Jason Saragih at Meta Codec Avatars Lab)
  • 3D asset generation and orbital video creation (with Pulak Purkait at Amazon IML)
  • Motion transfer for stylized characters (with Takashi Shibuya and Kengo Uchida at Sony AI).

Prior to my PhD, I obtained my Bachelor’s degree in the College of Engineering and Computer Science at the Australian National University, and was awarded with University Medal and Erin Brent Computer Science Prize. I was also awarded with First Class Honours under the supervision of Dongxu Li and Hongdong Li.

Please also find more details in my CV.

πŸ’» Work Experience

  • 2022.07 - 2023.04, XR Vision Labs, Tencent, Research Intern.
  • 2024.06 - 2024.11, Codec Avatars Lab, Meta, Research Intern.
  • 2025.01 - 2025.07, International Machine Learning, Amazon, Research Intern.
  • 2025.07 - 2025.11, Music Foundation Modeling Team, Sony AI, Research Intern.

πŸ“ Publications

ECCV 2026
sym

GKDT: General Keypoint Detection Transformer

Changsheng Lu, Yuxin Chen, Haokun GUI, Rong Wang, Jie Yang, Harry Yang, Anton van den Hengel, Jiaya Jia

  • We present MegaKPT, a large-scale unified keypoint dataset, and GKDT, a flexible DINOv3-based Transformer for general keypoint detection, supporting visual and text prompts to detect keypoints across broad seen and unseen object categories.
CVPR 2026
sym

Towards Realistic and Consistent Orbital Video Generation via 3D Foundation Priors

Rong Wang, Ruyi Zha, Ziang Cheng, Jiayu Yang, Pulak Purkait, Hongdong Li

Video

  • We present a novel method for generating geometrically realistic and consistent orbital videos from a single image, leveraging rich shape priors from a 3D foundation model to improve structural coherence, multi-view consistency, and generalization to complex camera trajectories.
3DV 2026
sym

Learning High-Fidelity Garment Deformation via Skinning-Free Image Transfer

Rong Wang, Wei Mao, Changsheng Lu, Hongdong Li

  • We present a skinning-free image transfer framework for high-fidelity 3D garment deformation, decoupling low-frequency posed shapes and high-frequency wrinkle details to generate realistic animation across garments with diverse topologies.
CVPR 2025
sym

FRESA: Feedforward Reconstruction of Personalized Skinned Avatars from Few Images

Rong Wang, Fabian Prada, Ziyan Wang, Zhongshi Jiang, Chengxiang Yin, Junxuan Li, Shunsuke Saito, Igor Santesteban, Javier Romero, Rohan Joshi, Hongdong Li, Jason Saragih, Yaser Sheikh

GitHub, Project

  • We learn a universal prior from over a thousand clothed humans to achieve instant feedforward generation and zero-shot generalization, aiming for reconstructing personalized 3D human avatars with realistic animation from only a few images.
ECCV 2024
sym

Towards High-Quality 3D Motion Transfer with Realistic Apparel Animation

Rong Wang, Wei Mao, Changsheng Lu, Hongdong Li

GitHub

  • We present a novel method aiming for high-quality motion transfer with realistic apparel animation, and also build a new dataset named MMDMC combining stylized characters from the MikuMikuDance community with real-world Motion Capture data.
NeurIPS 2023
sym

DeepSimHO: Stable Pose Estimation for Hand-Object Interaction via Physics Simulation

Rong Wang, Wei Mao, Hongdong Li

GitHub

  • We present DeepSimHO: a novel deep-learning pipeline that combines forward physics simulation and backward gradient approximation with a neural network.
WACV 2023
sym

Interacting hand-object pose estimation via dense mutual attention

Rong Wang, Wei Mao, Hongdong Li

GitHub

  • We propose a novel dense mutual attention mechanism that is able to model fine-grained dependencies between the hand and the object.
AAAI 2021
sym

Transcribing natural languages for the deaf via neural editing programs

Dongxu Li, Chenchen Xu, Liu Liu, Yiran Zhong, Rong Wang, Lars Petersson, Hongdong Li

  • We design a new neural agent that learns to synthesize and execute editing programs, conditioned on sentence contexts and partial editing results.
WSDM 2021
sym

AttentionFlow: Visualising Influence in Networks of Time Series

Minjeong Shin, Alasdair Tran, Siqi Wu, Alexander Mathews, Rong Wang, Georgiana Lyall, Lexing Xie

GitHub

  • We present AttentionFlow, a new system to visualise networks of time series and the dynamic influence they have on one another.

πŸŽ– Honors and Awards

  • 2018, ANU Excellence Scholarship.
  • 2019 – 2021, ANU Summer Research Scholarship.
  • 2019 – 2021, Terrell International Undergraduate Scholarships.
  • 2021, University Medal.
  • 2021, Erin Brent Computer Science Prize.
  • 2022, Australian Government Research Training Program International Scholarship.
  • 2026, Research Award for Doctoral Thesis.

πŸ“– Educations

  • 2018.02 - 2021.12, Bachelor of Advanced Computing (Research & Development).
  • 2022.02 - 2026.04, Doctor of Philosophy, School of Computing.