Full list on Google Scholar Profile . ☆ denotes visiting undergraduate / graduate mentees.
GenEx: Generating an Explorable World .
TaiMing Lu ☆,
Tianmin Shu ,
Alan Yuille ,
Daniel Khashabi ,
Jieneng Chen.
ICLR , 2025
Turn a single image into a 3D world adventure.
Embodied agents refine their beliefs by predicting unseen parts of the physical world.
Paper (OpenReview) |
Blog |
Project Website
Medical World Model: Generative Simulation of Tumor Evolution for Treatment Planning .
Yijun Yang ☆,
Zhao-Yang Wang ,
Qiuping Liu ,
Shuwen Sun ,
Kang Wang ,
Rama Chellappa ,
Zongwei Zhou ,
Alan Yuille ,
Lei Zhu ,
Yu-Dong Zhang ,
Jieneng Chen.
ICCV , 2025.
Envision precision medicine via generative world modeling.
Paper | Code |
Project
4D-Animal: Freely Reconstructing Animatable 3D Animals from Videos .
Shanshan Zhong ☆,
Jiawei Peng ,
Zehan Zheng ,
Zhongzhan Huang ,
Wufei Ma ,
Guofeng Zhang ,
Qihao Liu ,
Alan Yuille ,
Jieneng Chen.
Technical report, 2025.
Paper | Code
Vision-Language-Vision Auto-Encoder: Scalable Knowledge Distillation from Diffusion Models .
Tiezheng Zhang ,
Yitong Li ,
Yu-Cheng Chou ,
Jieneng Chen,
Alan Yuille ,
Chen Wei ,
Junfei Xiao .
Technical report, 2025.
Paper | Project | Code | HuggingFace Data Card
Spatial457: A Diagnostic Benchmark for 6D Spatial Reasoning of Large Multimodal Models .
Xingrui Wang ,
Wufei Ma ,
Tiezheng Zhang ,
Celso Miguel de Melo ,
Jieneng Chen†,
Alan Yuille †.
CVPR, Highlight , 2025.
Paper | Code | HuggingFace Data Card
SpatialLLM: A Compound 3D-Informed Design towards Spatially-Intelligent Large Multimodal Models .
Wufei Ma ,
Luoxin Ye ☆,
Nessa McWeeney ,
Celso Miguel de Melo ,
Jieneng Chen ,
Alan Yuille .
CVPR, Highlight , 2025.
arXiv
LLaVolta: Efficient Large Multi-modal Models via Visual Context Compression .
Jieneng Chen,
Luoxin Ye ,
Ju He ,
Zhaoyang Wang ,
Daniel Khashabi ,
Alan Yuille .
NeurIPS , 2024.
Paper |
Code |
Project
ViTamin: Designing Scalable Vision Models in the Vision-Language Era .
Jieneng Chen,
Qihang Yu ,
Xiaohui Shen ,
Alan Yuille ,
Liang-Chieh Chen .
CVPR , 2024.
The first vision-centric design for LMM encoder, with SoTA performance on 60+ multimodal tasks in 2024.
Paper |
Code |
🤗 HuggingFace | timm | open_clip
TransUNet: Rethinking the U-Net Architecture Design for Medical Image Segmentation through the Lens of Transformers .
Jieneng Chen,
Jieru Mei ,
Xianhang Li ,
Yongyi Lu ,
Qihang Yu ,
Qingyue Wei ,
Xiangde Luo ,
Yutong Xie ,
Ehsan Adeli ,
Yan Wang ,
Matthew P Lungren ,
Shaoting Zhang ,
Lei Xing ,
Le Lu ,
Alan Yuille ,
Yuyin Zhou .
Medical Image Analysis (MedIA) , 2024.
ICML-W 2021 |
Journal |
Code |
Top ScienceDirect downloaded article 🏆, published all time.
Top 15 cited 2021 paper in all AI fields , 6000 citations.
Instructor : I designed and taught the undergraduate course Machine Imagination at JHU in 2025.
Invited reviewer for communities: computer vision (CVPR, ICCV, ECCV, WACV), deep learning (NeurIPS, ICML, ICLR, AAAI, TPAMI), medical AI (TMI, MICCAI) and CogSci.
Workshop co-organizer for CVPR and MICCAI.
I am fortunate to have mentored super talented undergraduate, master and visiting students at JHU.
Students from the 2024-2025 cohort will pursue top CS PhD programs at institutions including CMU, JHU, Princeton, Northwestern and Oxford.
TaiMing Lu, a JHU undergraduate on the GenEx project, has received the Michael J. Muuss Research Award and been named a finalist for the CRA Outstanding Undergraduate Researcher Award.