I am currently a doctoral student at the Graduate School of Information Science and Technology (IST), The University of Tokyo, with Prof. Yinqiang Zheng.

Prior to this, I obtained my M.Sc. degree at the School of Computer Science, Peking University, under the supervision of Prof. Yisong Chen. I worked at Graphics and Interaction Lab (GIL) led by Prof. Guoping Wang on a scalable 3D reconstruction system (i23D) especially for modeling urban scenes from massive UAV imagery.

My current research interest is to achieve generalized reconstruction of various types of data by leveraging explicit, implicit, or hybrid representations. More specifically, I study the algorithms of 3D reconstruction (photogrammetry), neural rendering, implicit representations, and computational photography. I also have experience in remote sensing, visual localization (VPS), geometry processing, and graph signal processing.

I am able to speak Chinese Mandarin (native), English (IELTS 8.0), Spanish (DELE B1), Portuguese (CAPLE CIPLE), and Japanese (JLPT N2).

Should you have any questions, please feel free to reach out!

🔥 News

2025.06: 🎉🎉 Two papers are accepted by ICCV 2025.
2025.06: 🥇 Our team ranks 1^st in ScanNet++ Novel View Synthesis Challenge of CVPR 2025.
2025.05: 🎉 One paper is accepted by ICML 2025.
2024.09: 🥇 Our team ranks 1^st in ECCV 2024 GigaRendering Challenge.
2024.08: 🎉 One paper is accepted by ECCV 2024 as an oral presentation. See you at Milan.

See Previous

2024.05: 🥇 Our team ranks 1^st in ScanNet++ Novel View Synthesis Challenge of CVPR 2024.
2023.10: 🇯🇵 I start pursuing my Ph.D. degree at UTokyo as a Todai Fellowship student.
2023.06: 🥇 Our team ranks 1^st in GAIIC 2023 GigaRendering Challenge.
2023.05: I have passed the defense of my master thesis, titled Semantic Reconstruction of 3D Models for Urban Scenes.
2023.02: 🥇 Our team ranks 1^st in GigaReconstruction Challenge of GigaVision 2022.
2022.10: I receive Benz Scholarship offered by Daimler Greater China Ltd.
2022.09: 🎉 One paper is accepted by BMVC 2022.
2022.06: 🎉🎉 Two papers are accepted by ECCV 2022.
2022.04: 🎉 One paper is accepted by TVCG.

📃 Publications

^* = equal contributions

🌟 Selected Publications

ICML 2025

SUICA: Learning Super-high Dimensional Implicit Neural Representations for Spatial Transcriptomics
Qingtian Zhu^*, Yumin Zheng^*, Yuling Sang, Yifan Zhan, Ziyan Zhu, Jun Ding, Yinqiang Zheng
[arxiv] [paper] [code]

We employ INRs to model ST in a continuous way, making it possible to sample at arbitrary locations.
SUICA achieves restored authenticity and enriched bio-conservation with Stereo-seq, 10x Visium, Slide-seqV2, and MERFISH.

ECCV 2024 Oral

RPBG: Towards Robust Neural Point-based Graphics in the Wild
Qingtian Zhu, Zizhuang Wei, Zhongtian Zheng, Yifan Zhan, Zhuyu Yao, Jiawang Zhang, Kejian Wu, Yinqiang Zheng
[arxiv] [paper] [code]

We propose a robust alternative for NVS that performs neural re-rendering of triangulated points from posed images.
It stably outperforms radiance-field-based methods especially in convergence time, perceptual quality, and generalizability across diverse scene types.

📁 Full List

2025

Adversarial Attacks on Event-based Pedestrian Detectors: A Physical Approach
Guixu Lin, Muyao Niu, Qingtian Zhu, Zhuoxiao Li, Zhengwei Yin, Shengfeng He, Yinqiang Zheng
AAAI 2025 [arxiv] [paper]
SUICA: Learning Super-high Dimensional Sparse Implicit Neural Representations for Spatial Transcriptomics
Qingtian Zhu^*, Yumin Zheng^*, Yuling Sang, Yifan Zhan, Ziyan Zhu, Jun Ding, Yinqiang Zheng
ICML 2025 [arxiv] [paper] [code]
ToMiE: Towards Explicit Exoskeleton for the Reconstruction of Complicated 3D Human Avatars
Yifan Zhan, Qingtian Zhu, Muyao Niu, Mingze Ma, Jiancheng Zhao, Zhihang Zhong, Xiao Sun, Yu Qiao, Yinqiang Zheng
ICCV 2025 [arxiv] [code]
Tree-NeRV: Efficient Non-Uniform Sampling for Neural Video Representation via Tree-Structured Feature Grids
Jiancheng Zhao, Yifan Zhan, Qingtian Zhu, Mingze Ma, Muyao Niu, Zunian Wan, Xiang Ji, Yinqiang Zheng
ICCV 2025 [arxiv]
R3-Avatar: Record and Retrieve Canonical Temporal Codebook for Reconstructing Photorealistic Human Avatars
Yifan Zhan, Wangze Xu, Qingtian Zhu, Muyao Niu, Mingze Ma, Yifei Liu, Zhihang Zhong, Xiao Sun, Yinqiang Zheng
[arxiv] [code]
Robustifying Fourier Features Embeddings for Implicit Neural Representations
Mingze Ma, Qingtian Zhu, Yifan Zhan, Zhengwei Yin, Hongjun Wang, Jiancheng Zhao, Yinqiang Zheng
[arxiv]
AniCrafter: Customizing Realistic Human-Centric Animation via Avatar-Background Conditioning in Video Diffusion Models
Muyao Niu, Mingdeng Cao, Yifan Zhan, Qingtian Zhu, Mingze Ma, Jiancheng Zhao, Yanhong Zeng, Zhihang Zhong, Xiao Sun, Yinqiang Zheng
[arxiv] [code]

2024

RPBG: Towards Robust Neural Point-based Graphics in the Wild
Qingtian Zhu, Zizhuang Wei, Zhongtian Zheng, Yifan Zhan, Zhuyu Yao, Jiawang Zhang, Kejian Wu, Yinqiang Zheng
ECCV 2024 (Oral) [arxiv] [paper] [code]
Learning-based Multi-view Stereo: A Survey
Fangjinhua Wang^*, Qingtian Zhu^*, Di Chang^*, Quankai Gao, Junlin Han, Tong Zhang, Richard Hartley, Marc Pollefeys
[arxiv]
Bundle Adjusted Gaussian Avatars Deblurring
Muyao Niu, Yifan Zhan, Qingtian Zhu, Zhuoxiao Li, Wei Wang, Zhihang Zhong, Xiao Sun, Yinqiang Zheng
[arxiv] [code]

2022

Bidirectional Hybrid LSTM Based Recurrent Neural Network for Multi-view Stereo
Zizhuang Wei, Qingtian Zhu, Chen Min, Yisong Chen, Guoping Wang
TVCG [paper]
TransMVSNet: Global Context-aware Multi-view Stereo Network with Transformers
Yikang Ding^*, Wentao Yuan^*, Qingtian Zhu, Haotian Zhang, Xiangyue Liu, Yuanjiang Wang, Xiao Liu
CVPR 2022 [arxiv] [paper] [code]
Sobolev Training for Implicit Neural Representations with Approximated Image Derivatives
Wentao Yuan, Qingtian Zhu, Xiangyue Liu, Yikang Ding, Haotian Zhang, Chi Zhang
ECCV 2022 [arxiv] [paper] [code]
KD-MVS: Knowledge Distillation Based Self-supervised Learning for Multi-view Stereo
Yikang Ding, Qingtian Zhu, Xiangyue Liu, Wentao Yuan, Haotian Zhang, Chi Zhang
ECCV 2022 [arxiv] [paper] [code]
Hybrid Cost Volume Regularization for Memory-efficient Multi-view Stereo Networks
Qingtian Zhu, Zizhuang Wei, Zhongtao Wang, Yisong Chen, Guoping Wang
BMVC 2022 [paper]

2021

MegLoc: A Robust and Accurate Visual Localization Pipeline
Shuxue Peng^*, Zihang He^*, Haotian Zhang^*, Ran Yan^*, Chuting Wang^*, Qingtian Zhu^*, Xiao Liu
[arxiv]
AA-RMVSNet: Adaptive Aggregation Recurrent Multi-view Stereo Network
Zizhuang Wei, Qingtian Zhu, Chen Min, Yisong Chen, Guoping Wang
ICCV 2021 [arxiv] [paper] [code]
Deep Learning for Multi-view Stereo via Plane Sweep: A Survey
Qingtian Zhu, Chen Min, Zizhuang Wei, Yisong Chen, Guoping Wang
[arxiv]

2019

Efficient Multi-class Semantic Segmentation of High Resolution Aerial Imagery with Dilated LinkNet
Qingtian Zhu, Yumin Zheng, Yulai Jiang, Junli Yang
IGARSS 2019 (Oral) [paper] [code]

🏆 Awards

2025.06: Winner in Novel View Synthesis Challenge of ScanNet++ CVPR 2025 Workshop [website][video]
2024.09: Winner in ECCV 2024 GigaRendering Challenge
2024.05: Winner in Novel View Synthesis Challenge of ScanNet++ CVPR 2024 Workshop [website]
2023.09: The University of Tokyo Fellowship (Todai Fellowship)
2023.06: Winner in GAIIC 2023 GigaRendering Challenge [report]
2023.02: Winner in GigaReconstruction Challenge of GigaVision 2022 [report]
2023.02: Runner-up in GigaRendering Challenge of GigaVision 2022 [report]
2022.10: Benz Scholarship
2021.10: Winner in Indoor and Outdoor Visual Localization Challenge of ICCV 2021 Workshop on Long-term Visual Localization under Changing Conditions [website] [video] [report]
2021.09: PKU Special Academic Scholarship
2020.06: Outstanding Graduate and Outstanding Thesis
2019.06: Winner in Visual Localization Challenge of CVPR 2019 Workshop on Long-term Visual Localization under Changing Conditions [website]
2017.11: National Scholarship

🎓 Educations

2023.10 - present: Ph.D. in Mechano-Informatics, The University of Tokyo
2020.09 - 2023.07: M.Sc. in Computer Software and Theory, Peking University
2016.09 - 2020.06: B.Eng. in Internet of Things Engineering, Beijing University of Posts and Telecommunications
2016.09 - 2020.06: B.S.E. (First Class Honour) in Internet of Things Engineering, Queen Mary University of London

💻 Internships

2024.12 - present: CyberAgent AI Lab, supervised by Xu Cao and Takafumi Taketomi
2023.05 - 2024.03: XREAL (Nreal), supervised by Zhuyu Yao, Jiawang Zhang, and Kejian Wu
2022.02 - 2022.07: XR Lab, Alibaba DAMO Academy, supervised by Lingzhi Li and Li Shen
2021.08 - 2022.02: MEGVII Research, supervised by Ran Yan, Haotian Zhang, and Xiao Liu
2019.03 - 2019.06: NLPR, CASIA, supervised by Shuhan Shen