I am currently a doctoral student at the Graduate School of Information Science and Technology (IST), The University of Tokyo, supervised by Prof. Yinqiang Zheng.
Before that, I obtained my M.Sc. degree at the School of Computer Science, Peking University, under the supervision of Prof. Yisong Chen. I worked at Graphics and Interaction Lab (GIL) led by Prof. Guoping Wang on a scalable 3D reconstruction system (i23D) especially for modeling urban scenes from massive UAV imagery.
My research interests are the relevant techniques involved in AR/MR applications, which mainly lie in the scope of computer vision and computer graphics. More specifically, I study the algorithms of 3D reconstruction (photogrammetry), neural rendering, and computational photography. I also have experience in remote sensing, visual localization (VPS), geometry processing, and graph signal processing.
I am able to speak Mandarin (native), English (IELTS 8.0), Spanish (DELE B1), Portugues (CAPLE CIPLE), and struggling with Japanese.
Should you have any questions, please feel free to reach out!
🔥 News
- 2023.10: 🇯🇵 I start pursuing my Ph.D. degree at UTokyo as a Todai Fellowship student.
- 2023.06: 🥇 Our team ranks 1st in GAIIC 2023 GigaRendering Challenge.
- 2023.05: I have passed the defense of my master thesis, titled Semantic Reconstruction of 3D Models for Urban Scenes.
- 2023.02: 🥇 Our team ranks 1st in GigaReconstruction Challenge of GigaVision 2022.
- 2022.10: I receive Benz Scholarship offered by Daimler Greater China Ltd.
- 2022.09: 🎉 One paper is accepted by BMVC 2022.
- 2022.06: 🎉🎉 Two papers are accepted by ECCV 2022.
- 2022.04: 🎉 One paper is accepted by TVCG.
📃 Publications
🌟 Selected Publications
Sobolev Training for Implicit Neural Representations with Approximated Image Derivatives
Wentao Yuan, Qingtian Zhu, Xiangyue Liu, Yikang Ding, Haotian Zhang, Chi Zhang
- We propose and evaluate a new training paradigm for INRs where numerical derivatives are used to supervise analytical derivatives of INRs in addition to signal values.
- Approximation performance is improved with a notable margin on tasks such as image regression, inverse rendering and audio regression.
TransMVSNet: Global Context-aware Multi-view Stereo Network with Transformers
Yikang Ding*, Wentao Yuan*, Qingtian Zhu, Haotian Zhang, Xiangyue Liu, Yuanjiang Wang, Xiao Liu
- Transformers are leveraged to extract deep features within and across images and capture global contexts.
- TransMVSNet ranks 1st on all MVS datasets, including DTU, BlendedMVS and Tanks and Temples.
📁 Full List
2022
-
Bidirectional Hybrid LSTM Based Recurrent Neural Network for Multi-view Stereo
Zizhuang Wei, Qingtian Zhu, Chen Min, Yisong Chen, Guoping Wang
TVCG [paper] -
TransMVSNet: Global Context-aware Multi-view Stereo Network with Transformers
Yikang Ding*, Wentao Yuan*, Qingtian Zhu, Haotian Zhang, Xiangyue Liu, Yuanjiang Wang, Xiao Liu
CVPR 2022 [paper] [code] -
Sobolev Training for Implicit Neural Representations with Approximated Image Derivatives
Wentao Yuan, Qingtian Zhu, Xiangyue Liu, Yikang Ding, Haotian Zhang, Chi Zhang
ECCV 2022 [paper] [code] -
KD-MVS: Knowledge Distillation Based Self-supervised Learning for Multi-view Stereo
Yikang Ding, Qingtian Zhu, Xiangyue Liu, Wentao Yuan, Haotian Zhang, Chi Zhang
ECCV 2022 [paper] [code] -
Hybrid Cost Volume Regularization for Memory-efficient Multi-view Stereo Networks
Qingtian Zhu, Zizhuang Wei, Zhongtao Wang, Yisong Chen, Guoping Wang
BMVC 2022 [paper]
2021
-
AA-RMVSNet: Adaptive Aggregation Recurrent Multi-view Stereo Network
Zizhuang Wei, Qingtian Zhu, Chen Min, Yisong Chen, Guoping Wang
ICCV 2021 [paper] [code] -
Deep Learning for Multi-view Stereo via Plane Sweep: A Survey
Qingtian Zhu, Chen Min, Zizhuang Wei, Yisong Chen, Guoping Wang
[paper]
2019
- Efficient Multi-class Semantic Segmentation of High Resolution Aerial Imagery with Dilated LinkNet
Qingtian Zhu, Yumin Zheng, Yulai Jiang, Junli Yang
IGARSS 2019 (Oral) [paper] [code]
🏆 Awards
- 2023.09: The University of Tokyo Fellowship (Todai Fellowship)
- 2023.06: Winner in GAIIC 2023 GigaRendering Challenge [report]
- 2023.02: Winner in GigaReconstruction Challenge of GigaVision 2022 [report]
- 2023.02: Runner-up in GigaRendering Challenge of GigaVision 2022 [report]
- 2022.10: Benz Scholarship
- 2021.10: Winner in Indoor and Outdoor Visual Localization Challenge of ICCV 2021 Workshop on Long-term Visual Localization under Changing Conditions [website] [talk] [report]
- 2021.09: PKU Special Academic Scholarship
- 2020.06: Outstanding Graduate and Outstanding Thesis
- 2019.06: Winner in Visual Localization Challenge of CVPR 2019 Workshop on Long-term Visual Localization under Changing Conditions [website]
- 2017.11: National Scholarship
🎓 Educations
- 2023.10 - now: Ph.D. in Mechano-Informatics, The University of Tokyo
- 2020.09 - 2023.07: M.Sc. in Computer Software and Theory, Peking University
- 2016.09 - 2020.06: B.Eng. in Internet of Things Engineering, Beijing University of Posts and Telecommunications
- 2016.09 - 2020.06: B.S.E. (First Class Honour) in Internet of Things Engineering, Queen Mary University of London
💻 Internships
- 2023.05 - now: XREAL (Nreal), supervised by Zhuyu Yao and Jiawang Zhang
- 2022.02 - 2022.07: XR Lab, Alibaba DAMO Academy, supervised by Lingzhi Li and Li Shen
- 2021.08 - 2022.02: MEGVII Research, supervised by Ran Yan and Xiao Liu
- 2019.03 - 2019.06: NLPR, CASIA, supervised by Shuhan Shen
👨🏫 Teaching
- Fall 2021: T.A. of Image- and Video-based 3D Reconstruction for graduate students, Peking University
- Spring 2021: T.A. of Digital Image Processing for undergraduate students, Peking University