Zhu Yu

I'm a final year Ph.D. student at Zhejiang University (ZJU), supervised by Prof. Hui-Liang Shen and Dr. Si-Yuan Cao. Previously, I earned my bachelor's degree from Zhejiang University in 2021.

My current interest lies in 3D computer vision, especially in 3D geometry reconstruction/generation.

Email / Scholar / Github

profile photo

Experiences

2025.1 - now
Research Intern
Alibaba Tongyi Lab
2024.6 - 2024.12
Research Intern
Alibaba CAINIAO Autonomous Driving Team
  • Host by Lizhe Liu.
  • Topic: Open Vocabulary Perception.

Publications

(* equal contribution, † corresponding author)

Large Depth Completion Model from Sparse Observations
Zhu Yu, Zhengyi Zhao, Runmin Zhang, Lingteng Qiu, Kejie Qiu, Yisheng He, Siyu Zhu, Zilong Dong†, Si-Yuan Cao†, Hui-liang Shen
ICLR, 2026
project page / arXiv / code

Rethinking Unsupervised Cross-modal Flow Estimation: Learning from Decoupled Optimization and Consistency Constraint
Runmin Zhang, Jialiang Wang, Si-Yuan Cao†, Zhu Yu, Junchen Yu, Guangyi Zhang, Hui-Liang Shen
ICLR, 2026
arXiv / code

Bevplace++: Fast, robust, and lightweight lidar global localization for unmanned ground vehicles
Lun Luo, Si-Yuan Cao, Xiaorui Li, Jintao Xu, Rui Ai, Zhu Yu, Xieyuanli Chen†
TRO, 2025
arXiv / code

Language Driven Occupancy Prediction
Zhu Yu*, Bowen Pang*, Lizhe Liu†, Runmin Zhang, Qiang Li, Si-Yuan Cao, Maochun Luo, Mingxia Chen, Sheng Yang, Hui-liang Shen†
ICCV, 2025
arXiv / code

Boosting Multi-View Indoor 3D Object Detection via Adaptive 3D Volume Construction
Runmin Zhang, Zhu Yu†, Si-Yuan Cao†, Lingyu Zhu, Guangyi Zhang, Xiaokai Bai, Hui-Liang Shen
ICCV, 2025
arXiv / code

SGDFormer: One-stage Transformer-based Architecture for Cross-Spectral Stereo Image Guided Denoising
Runmin Zhang, Zhu Yu, Zehua Sheng, Jiacheng Ying†, Si-Yuan Cao, Shu-Jie Chen, Bailin Yang, Junwei Li, Hui-Liang Shen†
Information Fusion, 2025
arXiv / code

VoxDet: Rethinking 3D Semantic Occupancy Prediction as Dense Object Detection
Wuyang Li, Zhu Yu, Alexandre Alahi
NeurIPS, 2025   (Spotlight)
project page / arXiv / code

Context and Geometry Aware Voxel Transformer for Semantic Scene Completion
Zhu Yu, Runmin Zhang, Jiacheng Ying, Junchen Yu, Xiaohai Hu, Lun Luo, Si-Yuan Cao†, Hui-Liang Shen†
NeurIPS, 2024   (Spotlight)
arXiv / code

Aggregating Feature Point Cloud for Depth Completion
Zhu Yu, Zehua Sheng, Zili Zhou, Lun Luo, Si-Yuan Cao†, Hong Gu, Huaqi Zhang, Hui-Liang Shen†
ICCV, 2023
arXiv

Structure aggregation for cross-spectral stereo image guided denoising
Zehua Sheng, Zhu Yu, Xiongwei Liu, Si-Yuan Cao, Yuqi Liu, Hui-Liang Shen†, Huaqi Zhang
CVPR, 2023
arXiv / code

Services

  • Conference Reviewer: NeurIPS, ICLR, CVPR, ECCV, ICCV, BMVC
  • Journal Reviewer: TIP, RAL

Template from Jon Barron.