Jieneng Chen

I'm a final-year Ph.D. candidate in Computer Science at Johns Hopkins University, advised by Dist. Prof. Alan L. Yuille. I am co-advised by Prof. Daniel Khashabi and Dist. Prof. Rama Chellappa. I am awarded as a Siebel Scholar Class 2025.

My PhD work on neural architectures has become influential, with over 14,000 citations.

I am now interested in developing generative world models to push the frontier of AI+X across embodied AI, healthcare, and science.

I am on the job market for 2025! Would love to chat more if you are interested. I am very happy to give talks on my research in related seminars.


               

profile photo
News
Recent Projects

Full list on Google Scholar Profile (over 14,000 citations). ☆ denotes visiting undergraduate / graduate mentees.

GenEx: Generating an Explorable World.

TaiMing Lu ☆, Tianmin Shu, Alan Yuille, Daniel Khashabi, Jieneng Chen.

ICLR, 2025

Turn a single image into a 3D world adventure.

Embodied agents refine their beliefs by predicting unseen parts of the physical world.

Medical World Model: Generative Simulation of Tumor Evolution for Treatment Planning.

Yijun Yang ☆, Zhao-Yang Wang, Qiuping Liu, Shuwen Sun, Kang Wang, Rama Chellappa, Zongwei Zhou, Alan Yuille, Lei Zhu, Yu-Dong Zhang, Jieneng Chen.

Technical Report, 2025.

Envision precision medicine via generative world modeling.

Paper | Code | Project
SpatialLLM: A Compound 3D-Informed Design towards Spatially-Intelligent Large Multimodal Models.
Wufei Ma, Luoxin Ye ☆, Nessa McWeeney, Celso Miguel de Melo, Jieneng Chen, Alan Yuille.

CVPR, Highlight, 2025.
arXiv
Spatial457: A Diagnostic Benchmark for 6D Spatial Reasoning of Large Multimodal Models.
Xingrui Wang, Wufei Ma, Tiezheng Zhang, Celso Miguel de Melo, Jieneng Chen†, Alan Yuille†.

CVPR, Highlight, 2025.
Paper | Code | HuggingFace Data Card
LLaVolta: Efficient Large Multi-modal Models via Visual Context Compression.
Jieneng Chen, Luoxin Ye, Ju He, Zhaoyang Wang, Daniel Khashabi, Alan Yuille.

NeurIPS, 2024.
Paper | Code | Project
ViTamin: Designing Scalable Vision Models in the Vision-Language Era.
Jieneng Chen, Qihang Yu, Xiaohui Shen, Alan Yuille, Liang-Chieh Chen.

CVPR, 2024.
The first vision-centric design for LMM encoder, with SoTA performance on 60+ multimodal tasks in 2024.
Paper | Code | 🤗 HuggingFace | timm GitHub Stars Badge | open_clip GitHub Stars Badge
TransUNet: Rethinking the U-Net Architecture Design for Medical Image Segmentation through the Lens of Transformers.
Jieneng Chen, Jieru Mei, Xianhang Li, Yongyi Lu, Qihang Yu, Qingyue Wei, Xiangde Luo, Yutong Xie, Ehsan Adeli, Yan Wang, Matthew P Lungren, Shaoting Zhang, Lei Xing, Le Lu, Alan Yuille, Yuyin Zhou.

Medical Image Analysis (MedIA), 2024.

ICML-W 2021 | Journal | Code | GitHub Stars Badge
Top ScienceDirect downloaded article 🏆, published all time.
Top 15 cited 2021 paper in all AI fields, 6000 citations.
Teaching
  • Instructor: I designed and taught the undergraduate course Machine Imagination at JHU in 2025.
Service
  • Invited reviewer and PC member for CVPR, ICCV, ECCV, NeurIPS, ICML, ICLR, AAAI, TPAMI, TMI, MICCAI and CogSci.
  • Workshop organizer for CVPR and MICCAI.
Mentoring
  • I am fortunate to have mentored super talented undergraduate, master and visiting students at JHU.
  • Students from the 2024-2025 cohort will pursue top CS PhD programs at institutions including CMU, JHU, Princeton, Northwestern and Oxford.