I am a Ph.D. candidate in Computer Science at The Chinese University of Hong Kong, Shenzhen, advised by Prof. Benyou Wang. Prior to CUHKSZ, I earned my master's from Harbin Institute of Technology (Shenzhen), supervised by Prof. Qingcai Chen, and my bachelor's from Jinan University.
Find me on Google Scholar (2.5k+ citations), and GitHub (6K stars)!
Email: junying.chen.cs@gmail.com
✦ I expect to graduate in 2027 and am exploring opportunities in both industry and academia.
OnePO: Direct One-stage Policy Optimization for SFT-free Domain Adaptation
ICML 2026 (HuatuoGPT-3)
Incentivizing Medical Vision Capabilities from Large-Scale Multimodal Pre-training
Under Review 2026 (HuatuoGPT-Vision2)
HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs
ACL Findings 2025
[pdf]
[code]
HuatuoGPT-Vision, Towards Injecting Medical Visual Knowledge into Multimodal LLMs at
Scale
EMNLP 2024
[pdf]
[code]
HuatuoGPT-II, One-stage Training for Medical Adaption of LLMs
COLM 2024
[pdf]
[code]
HuatuoGPT, Towards Taming Language Model to Be a Doctor
EMNLP Findings 2023
[pdf]
[code]
CoD, Towards an Interpretable Medical Agent using Chain of Diagnosis
ACL Findings 2025
[pdf]
[code]
On the Compositional Generalization of Multimodal LLMs for Medical Imaging
ACL 2025
[pdf]
[code]
Benchmarking LLMs on Authentic Cases from Medical Journals
ACL Findings 2026
LLMs Could Autonomously Learn Without External Supervision
ACL Findings 2025
[pdf]
[code]
RAG-Instruct: Boosting LLMs with Diverse Retrieval-Augmented Instructions
EMNLP 2025
[pdf]
[code]
Diaformer: Automatic Diagnosis via Symptoms Sequence Generation
AAAI 2022
[pdf]
[code]
ShizhenGPT: Towards Multimodal LLMs for Traditional Chinese Medicine
Preprint 2025
[pdf]
[code]
ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image Generation
Preprint 2025
[pdf]
[code]
MedGen: Unlocking Medical Video Generation by Scaling Granularly-annotated Medical Videos
Preprint 2025
[pdf]
[code]
Apollo: A Lightweight Multilingual Medical LLM towards Democratizing Medical AI to 6B
People
Preprint 2024
[pdf]
[code]
SeDR: Segment Representation Learning for Long Documents Dense Retrieval
Preprint 2022
[pdf]
[code]
I am motivated by the potential of LLMs to support broad applications and create real-world value. My research focuses on making LLMs more usable in real-world practice:
Toward useful and trustworthy AI on the path to AGI.
This website is adapted from Gregory Gunderson and Tianyu Gao.