I am currently a doctoral student at the Graduate School of Information Science and Technology (IST), The University of Tokyo, supervised by Prof. Yinqiang Zheng.

Before that, I obtained my M.Sc. degree at the School of Computer Science, Peking University, under the supervision of Prof. Yisong Chen. I worked at Graphics and Interaction Lab (GIL) led by Prof. Guoping Wang on a scalable 3D reconstruction system (i23D) especially for modeling urban scenes from massive UAV imagery.

My research interests are the relevant techniques involved in AR/MR applications, which mainly lie in the scope of computer vision and computer graphics. More specifically, I study the algorithms of 3D reconstruction (photogrammetry), neural rendering, and computational photography. I also have experience in remote sensing, visual localization (VPS), geometry processing, and graph signal processing.

I am able to speak Mandarin (native), English (IELTS 8.0), Spanish (DELE B1), Portugues (CAPLE CIPLE), and struggling with Japanese.

Should you have any questions, please feel free to reach out!

🔥 News

  • 2023.10: 🇯🇵 I start pursuing my Ph.D. degree at UTokyo as a Todai Fellowship student.
  • 2023.06: 🥇 Our team ranks 1st in GAIIC 2023 GigaRendering Challenge.
  • 2023.05: I have passed the defense of my master thesis, titled Semantic Reconstruction of 3D Models for Urban Scenes.
  • 2023.02: 🥇 Our team ranks 1st in GigaReconstruction Challenge of GigaVision 2022.
  • 2022.10: I receive Benz Scholarship offered by Daimler Greater China Ltd.
  • 2022.09: 🎉 One paper is accepted by BMVC 2022.
  • 2022.06: 🎉🎉 Two papers are accepted by ECCV 2022.
  • 2022.04: 🎉 One paper is accepted by TVCG.

📃 Publications

🌟 Selected Publications

ECCV 2022
sym

Sobolev Training for Implicit Neural Representations with Approximated Image Derivatives

Wentao Yuan, Qingtian Zhu, Xiangyue Liu, Yikang Ding, Haotian Zhang, Chi Zhang

[paper] [code]

  • We propose and evaluate a new training paradigm for INRs where numerical derivatives are used to supervise analytical derivatives of INRs in addition to signal values.
  • Approximation performance is improved with a notable margin on tasks such as image regression, inverse rendering and audio regression.
CVPR 2022
sym

TransMVSNet: Global Context-aware Multi-view Stereo Network with Transformers

Yikang Ding*, Wentao Yuan*, Qingtian Zhu, Haotian Zhang, Xiangyue Liu, Yuanjiang Wang, Xiao Liu

[paper] [code]

  • Transformers are leveraged to extract deep features within and across images and capture global contexts.
  • TransMVSNet ranks 1st on all MVS datasets, including DTU, BlendedMVS and Tanks and Temples.

📁 Full List

2022

  • Bidirectional Hybrid LSTM Based Recurrent Neural Network for Multi-view Stereo
    Zizhuang Wei, Qingtian Zhu, Chen Min, Yisong Chen, Guoping Wang
    TVCG [paper]

  • TransMVSNet: Global Context-aware Multi-view Stereo Network with Transformers
    Yikang Ding*, Wentao Yuan*, Qingtian Zhu, Haotian Zhang, Xiangyue Liu, Yuanjiang Wang, Xiao Liu
    CVPR 2022 [paper] [code]

  • Sobolev Training for Implicit Neural Representations with Approximated Image Derivatives
    Wentao Yuan, Qingtian Zhu, Xiangyue Liu, Yikang Ding, Haotian Zhang, Chi Zhang
    ECCV 2022 [paper] [code]

  • KD-MVS: Knowledge Distillation Based Self-supervised Learning for Multi-view Stereo
    Yikang Ding, Qingtian Zhu, Xiangyue Liu, Wentao Yuan, Haotian Zhang, Chi Zhang
    ECCV 2022 [paper] [code]

  • Hybrid Cost Volume Regularization for Memory-efficient Multi-view Stereo Networks
    Qingtian Zhu, Zizhuang Wei, Zhongtao Wang, Yisong Chen, Guoping Wang
    BMVC 2022 [paper]

2021

  • AA-RMVSNet: Adaptive Aggregation Recurrent Multi-view Stereo Network
    Zizhuang Wei, Qingtian Zhu, Chen Min, Yisong Chen, Guoping Wang
    ICCV 2021 [paper] [code]

  • Deep Learning for Multi-view Stereo via Plane Sweep: A Survey
    Qingtian Zhu, Chen Min, Zizhuang Wei, Yisong Chen, Guoping Wang
    [paper]

2019

  • Efficient Multi-class Semantic Segmentation of High Resolution Aerial Imagery with Dilated LinkNet
    Qingtian Zhu, Yumin Zheng, Yulai Jiang, Junli Yang
    IGARSS 2019 (Oral) [paper] [code]

🏆 Awards

  • 2023.09: The University of Tokyo Fellowship (Todai Fellowship)
  • 2023.06: Winner in GAIIC 2023 GigaRendering Challenge [report]
  • 2023.02: Winner in GigaReconstruction Challenge of GigaVision 2022 [report]
  • 2023.02: Runner-up in GigaRendering Challenge of GigaVision 2022 [report]
  • 2022.10: Benz Scholarship
  • 2021.10: Winner in Indoor and Outdoor Visual Localization Challenge of ICCV 2021 Workshop on Long-term Visual Localization under Changing Conditions [website] [talk] [report]
  • 2021.09: PKU Special Academic Scholarship
  • 2020.06: Outstanding Graduate and Outstanding Thesis
  • 2019.06: Winner in Visual Localization Challenge of CVPR 2019 Workshop on Long-term Visual Localization under Changing Conditions [website]
  • 2017.11: National Scholarship

🎓 Educations

  • 2023.10 - now: Ph.D. in Mechano-Informatics, The University of Tokyo
  • 2020.09 - 2023.07: M.Sc. in Computer Software and Theory, Peking University
  • 2016.09 - 2020.06: B.Eng. in Internet of Things Engineering, Beijing University of Posts and Telecommunications
  • 2016.09 - 2020.06: B.S.E. (First Class Honour) in Internet of Things Engineering, Queen Mary University of London

💻 Internships

  • 2023.05 - now: XREAL (Nreal), supervised by Zhuyu Yao and Jiawang Zhang
  • 2022.02 - 2022.07: XR Lab, Alibaba DAMO Academy, supervised by Lingzhi Li and Li Shen
  • 2021.08 - 2022.02: MEGVII Research, supervised by Ran Yan and Xiao Liu
  • 2019.03 - 2019.06: NLPR, CASIA, supervised by Shuhan Shen

👨‍🏫 Teaching

  • Fall 2021: T.A. of Image- and Video-based 3D Reconstruction for graduate students, Peking University
  • Spring 2021: T.A. of Digital Image Processing for undergraduate students, Peking University