Hi there! I am a postdoc at UC Berkeley working with Prof. Masayoshi
Tomizuka and was a visiting
scholar at MIT working with Prof.Joshua Tenenbaum. Before that, I received my PhD from the University of Hong Kong advised by Prof. Ping Luo, and my B.S. from Renmin University of China under the supervision of Prof. Zhiwu Lu. My research interests lie at the intersection of vision, embodied AI, and robotics. I am open to research discussions and collaborations, feel free to get in touch!
myding at berkeley dot edu [Google Scholar]
Research Highlights
My long-term research goal is to build machines that can jointly perceive, understand, and reason about the physical world.
- General-purpose Models: vision-language foundation models, self-supervised learning
- Common Sense Reasoning: embodied AI, physical simulation, robot learning
CVPR20: Depth-Guided 3D Det
NeurIPS21: Visual Reasoning with DiffPhysics
ICML22: CtrlFormer
NeurIPS22: ComPhy Benchmark
CoRL22: Embodied CL
CogSci23: Physion++ Benchmark
CVPR23: EC for Embodied Control
ICML23: AdaptDiffuser
ECCV22: DaViT
EmbodiedGPT
- Visual Dependency Transformers: Dependency Tree Emerges from Reversed Attention
- Mingyu Ding, Yikang Shen, Lijie Fan, Zhenfang Chen, Zitian Chen, Ping Luo, Joshua B. Tenenbaum, Chuang Gan
CVPR 2023
[paper] [code]
- Embodied Concept Learner: Self-supervised Learning of Concepts and Mapping through Instruction Following
- Mingyu Ding, Yan Xu, Zhenfang Chen, David Daniel Cox, Ping Luo, Joshua B. Tenenbaum, Chuang Gan
CoRL 2022
[paper] [code] [project]
- DaViT: Dual Attention Vision Transformers
- Mingyu Ding, Bin Xiao, Noel Codella, Ping Luo, Jingdong Wang, Lu Yuan
ECCV 2022
[paper] [code]
- Learning Versatile Neural Architectures by Propagating Network Codes
- Mingyu Ding, Yuqi Huo, Haoyu Lu, Linjie Yang, Zhe Wang, Zhiwu Lu, Jingdong Wang, Ping Luo
ICLR 2022
[paper] [code] [project]
- Dynamic Visual Reasoning by Learning Differentiable Physics Models from Video and Language
- Mingyu Ding, Zhenfang Chen, Tao Du, Ping Luo, Joshua B. Tenenbaum, Chuang Gan
NeurIPS 2021
[paper] [code] [project]
- HR-NAS: Searching Efficient High-Resolution Neural Architectures with Lightweight Transformers
- Mingyu Ding, Xiaochen Lian, Linjie Yang, Peng Wang, Xiaojie Jin, Zhiwu Lu, Ping Luo
CVPR 2021
(Oral) [paper] [code]
- Learning Depth-Guided Convolutions for Monocular 3D Object Detection
- Mingyu Ding, Yuqi Huo, Hongwei Yi, Zhe Wang, Jianping Shi, Zhiwu Lu, Ping Luo
CVPR 2020
[paper] [code]
- Every Frame Counts: Joint Learning of Video Segmentation and Optical Flow
- Mingyu Ding, Zhe Wang, Bolei Zhou, Jianping Shi, Zhiwu Lu, Ping Luo
AAAI 2020
[paper]
- CamNet: Coarse-to-Fine Retrieval for Camera Re-localization
- Mingyu Ding, Zhe Wang, Jiankai Sun, Jianping Shi, Ping Luo
ICCV 2019
[paper] [code]
Selected Honors
Activities
- Conference Reviewer for ICML, ICLR, NeurIPS, CVPR, ICCV, ECCV, AAAI, IROS, WACV, ACCV, ACMMM, IV
- Journal Reviewer for TPAMI, IJCV, TIP, TCSVT, TMM, TOMM, TITS, TIV, RA-L, ACM CSUR, Neurocomputing