About Me

I am Yudong Zhang (张宇东), a PhD candidate in NICS-EFC Laboratory, Department of Electronic Engineering, Tsinghua University, under the supervision of Professor Yu Wang (汪玉). I completed my bachelor’s degree in Department of Electronic Engineering at Tsinghua University in 2020, advised by Professor Jiansheng Chen (陈健生).

My research focuses on enhancing the safety and efficiency of vision-language models. To date, I have authored or co-authored 11 peer-reviewed papers, including 6 first-author publications in academic conferences such as AAAI, ACM Multimedia (ACMMM), EMNLP and NAACL, with citations to date.

Currently, I am an intern at Tencent’s Hunyuan team, mentored by Xingwu Sun (孙兴武) and Ruobing Xie (谢若冰). I focus on pre-training of large language models.

If you are interested in academic collaboration or would like to discuss potential research opportunities, please feel free to reach out via email.

I will be graduating in June 2026 and I am looking for job opportunities.

🔥 News

2025.08: 🎉 One paper with me as the first author (TCP) are accepted by EMNLP 2025 Findings.
2025.07: 🎉 Two paper with me as the first author are accepted by ACMMM 2025, Including DHCP and F3, where F3 is accepted as Oral.
2025.05: I was promoted by Tsinghua University on Twitter and Facebook, see details at Twitter 1, Twitter 2, Twitter 3, Facebook.
2025.04: I was recognized as Machine Learning Platform Department Outstanding Intern Award, Technology and Engineering Group (TEG), Tencent.
2025.01: 🎉 One paper with me as the first author (QAVA) are accepted by NAACL 2025.
2024.12: 🎉 One paper with me as the first author (JointAugmentation) are accepted by AAAI 2025.
2024.07: 🎉 One paper with me as the first author (PIP) are accepted by ACMMM 2024 as Oral.
2024.04: I join Tencent Hunyuan Team as a intern student in Beijing!

📝 Publications

(* indicates equal contribution, † indicates corresponding author.)

Safety of Large Models

ACM-MM 2024 (Oral) PIP: Detecting Adversarial Examples in Large Vision-Language Models via Attention Patterns of Irrelevant Probe Questions, Yudong Zhang, Ruobing Xie†, Jiansheng Chen†, Xingwu Sun, Yu Wang† | Paper | Code | Video
NAACL 2025 QAVA: Query-Agnostic Visual Attack to Large Vision-Language Models, Yudong Zhang, Ruobing Xie†, Jiansheng Chen†, Xingwu Sun, Zhanhui Kang, Yu Wang† | Paper | Code | Video
ACM-MM 2025 DHCP: Detecting Hallucinations by Cross-modal Attention Pattern in Large Vision-Language Models, Yudong Zhang, Ruobing Xie†, Xingwu Sun, Yiqing Huang, Jiansheng Chen†, Zhanhui Kang, Di Wang, Yu Wang† | Paper | Code | Video
ACM-MM 2025 (Oral) Fighting Fire with Fire (F3): A Training-free and Efficient Visual Adversarial Example Purification Method in LVLMs, Yudong Zhang, Ruobing Xie†, Yiqing Huang, Jiansheng Chen†, Xingwu Sun, Zhanhui Kang, Di Wang, Yu Wang† | Paper | Code | Video
EMNLP 2025 (Findings) The Security Threat of Compressed Projectors in Large Vision-Language Models, Yudong Zhang, Ruobing Xie, Xingwu Sun, Jiansheng Chen, Zhanhui Kang, Di Wang, Yu Wang | Paper | Code | Video

Efficiency of pre-training

AAAI 2025 Enhancing Contrastive Learning Inspired by the Philosophy of “the Blind Men and the Elephant”, Yudong Zhang, Ruobing Xie†, Jiansheng Chen†, Xingwu Sun, Zhanhui Kang, Yu Wang† | Paper | Code | Video

Others (non-first author)

ACM-MM 2022 3D Human Mesh Reconstruction by Learning to Sample Joint Adaptive Tokens for Transformers, Youze Xue, Jiansheng Chen†, Yudong Zhang, Cheng Yu, Huimin Ma, Hongbing Ma | Paper
CIKM 2023 Transferable Structure-based Adversarial Attack of Heterogeneous Graph Neural Network, Yu Shang, Yudong Zhang, Jiansheng Chen†, Depeng Jin, Yong Li | Paper
AAAI 2024 Step Vulnerability Guided Mean Fluctuation Adversarial Attack against Conditional Diffusion Models, Hongwei Yu, Jiansheng Chen†, Xinlong Ding, Yudong Zhang, Ting Tang, Huimin Ma | Paper
Knowledge-Based Systems (KBS) Image paragraph captioning with topic clustering and topic shift prediction, Ting Tang, Jiansheng Chen†, Yiqing Huang, Huimin Ma, Yudong Zhang, Hongwei Yu | Paper
ICCV 2025 DADet: Safeguarding Image Conditional Diffusion Models against Adversarial and Backdoor Attacks via Diffusion Anomaly Detection, Hongwei Yu, Xinlong Ding, Jiawei Li, Jinlong Wang, Yudong Zhang, Rongquan Wang, Huimin Ma, Jiansheng Chen

🎖 Honors and Awards

Highlights Honors

2025.04 Tencent TEG Machine Learning Platform Department Outstanding Intern Award
2024.12 Tsinghua University 2024 Student of the Year (10 students in Tsinghua each year)
2023.12 Tsinghua University Lin Feng Counselor Award (The highest honor for Tsinghua University counselors)
2018.12 Tsinghua University Jiang Nanxiang Scholarship (The highest scholarship for third-year undergraduate students at Tsinghua University)
2022.10 First Prize, Lu Dajin and Liu Runsheng Outstanding Teaching Assistant Award, Department of Electronic Engineering, Tsinghua University
2023.10 Excellent Student Leader of Tsinghua University
2019.11 Top Ten Volunteers at Tsinghua University (10 volunteers in Tsinghua every two years)
2024.12 Feng Chongxi Research Excellence Scholarship, Department of Electronic Engineering, Tsinghua University
2025.04 The project I completed as the first finisher won the first and third prizes of the 35th Student Laboratory Construction Contribution Award of Tsinghua University

Others

2022.12 Tsinghua University December Ninth Counselor Award
2024.12 Graduate Student “Star of Electronics”, Department of Electronic Engineering, Tsinghua University (no more than 5 graduate students per year)
2019.12 Undergraduate Student “Star of Electronics”, Department of Electronic Engineering, Tsinghua University (no more than 5 undergraduate students per year)
Tsinghua University General Excellence Scholarship (2 times during undergraduate and 4 times during graduate)
Tsinghua University Science and Technology Innovation Excellence Award, Volunteer Excellence Award, Social Work Excellence Award (2 times), Friends of Tsinghua-Changfei Scholarship First Prize and Second Prize
Outstanding Student Leader of Tsinghua University (3 times)
2020.09 As captain, led the Tsinghua University Science and Technology Service Team to win the Tan Haoqiang Computer Education Fund Scholarship Outstanding Student Group Award

📖 Educations

2020.09 - now, PhD student, Department of Electronic Engineering, Tsinghua University, Beijing.
2016.09 - 2020.06, Undergraduate, Department of Electronic Engineering, Tsinghua University, Beijing.
2013.09 - 2016.06, Baotou No. 95 Middle School (Baogang No. 1 Middle School), Baotou, Inner Mongolia.

💻 Internships

2024.04 - now, Tencent, Hunyuan team , Beijing.
2023.07 - 2024.01, Infinigence, Infra group , Beijing.
2021.07 - 2021.08, Ministry of Education of the People’s Republic of China, China Education Television , Beijing.
2019.07 - 2019.08, Megvii, Research institute , Beijing.

🔧 Patent

Chinese patent (Patent granted)

Kubernetes container access methods, devices, computing devices, and storage media (2024104387144), Yu Wang, Yudong Zhang.
Distributed Task Dynamic Service Discovery Method, Device, and Task Training System (2024104387341), Yu Wang, Yudong Zhang.
Methods and apparatus for multi-user collaborative use of GPU computing capabilities (2024104384428), Yu Wang, Yudong Zhang (Inventor registration error, change in progress).
Sample purification methods, apparatus, equipment, and media (2024109947320), Yudong Zhang, Ruobing Xie, Xingwu Sun, Zhanhui Kang.
Model hallucination detection method, apparatus, device, storage medium, and program product (2024110474366), Yudong Zhang, Ruobing Xie, Xingwu Sun, Zhanhui Kang.
Text generation method, apparatus, device, and readable storage medium (2024109116223), Yudong Zhang, Ruobing Xie, Xingwu Sun, Zhanhui Kang.

Chinese patent (Patent pending)

Adversarial example image generation method, apparatus, computer device, and storage medium (2024110458537), Yudong Zhang, Ruobing Xie, Xingwu Sun, Zhanhui Kang.
An image processing method and related apparatus (2024110598759), Yudong Zhang, Ruobing Xie, Xingwu Sun, Zhanhui Kang.
Image processing methods, devices, equipment, readable storage media, and program products (2024111072007), Yudong Zhang, Ruobing Xie, Xingwu Sun, Zhanhui Kang.

✍️ Academic Service

Reviewer

CVPR
ICCV
ACM-MM
ICLR
ARR (ACL/EMNLP/NAACL)
AAAI