Hi there! I am a postdoc at UC Berkeley and was a visiting scholar at MIT, working at the intersection of computer vision, robotics, and machine learning. I received my PhD from the University of Hong Kong advised by Prof. Ping Luo, and my B.S. from Renmin University of China under the supervision of Prof. Zhiwu Lu. During my studies, I spent wonderful times collaborating with industry labs, e.g. Google, Microsoft, IBM, TikTok, and Baidu. I am open to research discussions and collaborations, please feel free to get in touch!
myding at berkeley dot edu [Google Scholar]
Research Highlights
My long-term research goal is to build machines that can jointly perceive, understand, and reason about the physical world.
- General-purpose Models: visual perception, foundation models, self-supervised learning
- Common Sense Reasoning: embodied AI, physical simulation, robot learning
CVPR20: Depth-Guided 3D Detection
ICRA20: SegVoxelNet
NeurIPS21: Reasoning with DiffPhysics
CoRL22: Embodied Concept Learner
NeurIPS22: ComPhy Benchmark
ICML22: CtrlFormer
- Embodied Concept Learner: Self-supervised Learning of Concepts and Mapping through Instruction Following
- Mingyu Ding, Yan Xu, Zhenfang Chen, David Daniel Cox, Ping Luo, Joshua B. Tenenbaum, Chuang Gan
CoRL 2022
[paper]
- DaViT: Dual Attention Vision Transformers
- Mingyu Ding, Bin Xiao, Noel Codella, Ping Luo, Jingdong Wang, Lu Yuan
ECCV 2022
[paper] [code]
- Learning Versatile Neural Architectures by Propagating Network Codes
- Mingyu Ding, Yuqi Huo, Haoyu Lu, Linjie Yang, Zhe Wang, Zhiwu Lu, Jingdong Wang, Ping Luo
ICLR 2022
[paper] [code] [project]
- Dynamic Visual Reasoning by Learning Differentiable Physics Models from Video and Language
- Mingyu Ding, Zhenfang Chen, Tao Du, Ping Luo, Joshua B. Tenenbaum, Chuang Gan
NeurIPS 2021
[paper] [code] [project]
- HR-NAS: Searching Efficient High-Resolution Neural Architectures with Lightweight Transformers
- Mingyu Ding, Xiaochen Lian, Linjie Yang, Peng Wang, Xiaojie Jin, Zhiwu Lu, Ping Luo
CVPR 2021
(Oral) [paper] [code]
- Learning Depth-Guided Convolutions for Monocular 3D Object Detection
- Mingyu Ding, Yuqi Huo, Hongwei Yi, Zhe Wang, Jianping Shi, Zhiwu Lu, Ping Luo
CVPR 2020
[paper] [code]
- Every Frame Counts: Joint Learning of Video Segmentation and Optical Flow
- Mingyu Ding, Zhe Wang, Bolei Zhou, Jianping Shi, Zhiwu Lu, Ping Luo
AAAI 2020
[paper]
- CamNet: Coarse-to-Fine Retrieval for Camera Re-localization
- Mingyu Ding, Zhe Wang, Jiankai Sun, Jianping Shi, Ping Luo
ICCV 2019
[paper] [code]
Activities
- Conference Reviewer for ICML, ICLR, NeurIPS, CVPR, ICCV, ECCV, AAAI, IROS, WACV, ACCV, ACMMM, IV
- Journal Reviewer for TPAMI, IJCV, TIP, TCSVT, TMM, TOMM, TITS, TIV, RA-L, Neurocomputing
Selected Honors