Jieneng Chen
I'm a final-year Ph.D. candidate in Computer Science at Johns Hopkins University, advised by Dist. Prof. Alan L. Yuille. I am co-advised by Prof. Daniel Khashabi and Dist. Prof. Rama Chellappa.
I am awarded as a Siebel Scholar Class 2025.
My PhD work on neural architectures has become influential, with over 14,000 citations.
I am now interested in developing generative world models to push the frontier of AI+X across embodied AI, healthcare, and science.
I am on the job market for 2025! Would love to chat more if you are interested. I am very happy to give talks on my research in related seminars.
   
   
   
   
|
|
Full list on Google Scholar Profile (over 14,000 citations). ☆ denotes visiting undergraduate / graduate mentees.
|
GenEx: Generating an Explorable World.
TaiMing Lu ☆,
Tianmin Shu,
Alan Yuille,
Daniel Khashabi,
Jieneng Chen.
ICLR, 2025
Turn a single image into a 3D world adventure.
Embodied agents refine their beliefs by predicting unseen parts of the physical world.
Paper (OpenReview) |
Blog |
Project Website
|
|
Medical World Model: Generative Simulation of Tumor Evolution for Treatment Planning.
Yijun Yang ☆,
Zhao-Yang Wang,
Qiuping Liu,
Shuwen Sun,
Kang Wang,
Rama Chellappa,
Zongwei Zhou,
Alan Yuille,
Lei Zhu,
Yu-Dong Zhang,
Jieneng Chen.
Technical Report, 2025.
Envision precision medicine via generative world modeling.
Paper | Code |
Project
|
|
SpatialLLM: A Compound 3D-Informed Design towards Spatially-Intelligent Large Multimodal Models.
Wufei Ma,
Luoxin Ye ☆,
Nessa McWeeney,
Celso Miguel de Melo,
Jieneng Chen,
Alan Yuille.
CVPR, Highlight, 2025.
arXiv
|
|
Spatial457: A Diagnostic Benchmark for 6D Spatial Reasoning of Large Multimodal Models.
Xingrui Wang,
Wufei Ma,
Tiezheng Zhang,
Celso Miguel de Melo,
Jieneng Chen†,
Alan Yuille†.
CVPR, Highlight, 2025.
Paper | Code | HuggingFace Data Card
|
|
LLaVolta: Efficient Large Multi-modal Models via Visual Context Compression.
Jieneng Chen,
Luoxin Ye,
Ju He,
Zhaoyang Wang,
Daniel Khashabi,
Alan Yuille.
NeurIPS, 2024.
Paper |
Code |
Project
|
|
ViTamin: Designing Scalable Vision Models in the Vision-Language Era.
Jieneng Chen,
Qihang Yu,
Xiaohui Shen,
Alan Yuille,
Liang-Chieh Chen.
CVPR, 2024.
The first vision-centric design for LMM encoder, with SoTA performance on 60+ multimodal tasks in 2024.
Paper |
Code |
🤗 HuggingFace | timm | open_clip
|
|
TransUNet: Rethinking the U-Net Architecture Design for Medical Image Segmentation through the Lens of Transformers.
Jieneng Chen,
Jieru Mei,
Xianhang Li,
Yongyi Lu,
Qihang Yu,
Qingyue Wei,
Xiangde Luo,
Yutong Xie,
Ehsan Adeli,
Yan Wang,
Matthew P Lungren,
Shaoting Zhang,
Lei Xing,
Le Lu,
Alan Yuille,
Yuyin Zhou.
Medical Image Analysis (MedIA), 2024.
ICML-W 2021 |
Journal |
Code |
Top ScienceDirect downloaded article 🏆, published all time.
Top 15 cited 2021 paper in all AI fields, 6000 citations.
|
- Instructor: I designed and taught the undergraduate course Machine Imagination at JHU in 2025.
- Invited reviewer and PC member for CVPR, ICCV, ECCV, NeurIPS, ICML, ICLR, AAAI, TPAMI, TMI, MICCAI and CogSci.
- Workshop organizer for CVPR and MICCAI.
- I am fortunate to have mentored super talented undergraduate, master and visiting students at JHU.
- Students from the 2024-2025 cohort will pursue top CS PhD programs at institutions including CMU, JHU, Princeton, Northwestern and Oxford.
|