About Me
I am a final-year master’s student at Tianjin University, under the mentorship of Prof. Gang Pan. Previously, I worked as a research intern at Baidu and explored multimodal document understanding. I also received guidance from Prof. Jianfei Yu during my research journey.
My enthusiasm broadly lies in exploring Multimodal Learning, Natural Language Processing and Computer Vision. My recent series of multimodal research (MNER->GMNER->SMNER) explored how to unleash the potential capabilities of visual-language models in complex multimodal scenarios, how to build harmonious interaction and collaboration between multiple models, and how to construct image-text based knowledge augmentation methods in open-world scenarios. Additionally, I also maintain interest in several visual tasks (e.g., Blind Image Inpainting, Infrared and Visible Image Fusion).
I am more concerned about the problems that are worth solving rather than limiting myself to a specific field. Feel free to contact me and explore possibilities together.
Research Interests
Vision and Language:
News🔥
- [Nov. 2024] I am honored to have received three top honors at Tianjin University (Ranked #1 in CS Major): the China National Scholarship, the First-Class Academic Excellence Scholarship, and the Outstanding (Sanhao) Student Award.
- [Jun. 2024] We release a new study proposing the Segmented Multimodal Named Entity Recognition (SMNER) task and constructing the corresponding Twitter-SMNER dataset. Datasets and Code will be released at here.
- [May. 2024] One paper is accepted to ACL 2024. See you in Bangkok!
- [Apr. 2024] Delighted to be onboard at Baidu as a research intern.
- [Feb. 2024] A new research about Grounded Multimodal Named Entity Recognition (GMNER) and Large Language Models has been released! see here.
- [Oct. 2023] One paper about Multimodal Named Entity Recognition (MNER) is accepted to EMNLP 2023.
Publications
-
ACL
Jinyuan Li, Han Li, Di Sun, Jiahao Wang, Wenkun Zhang, Zan Wang, Gang Pan
Findings of the Association for Computational Linguistics (ACL) 2024, Bangkok, Thailand
-
EMNLP
Jinyuan Li, Han Li, Zhuo Pan, Di Sun, Jiahao Wang, Wenkun Zhang, Gang Pan
Findings of the Association for Computational Linguistics (EMNLP) 2023, Singapore
-
arxiv
Jinyuan Li, Ziyan Li, Han Li, Jianfei Yu, Rui Xia, Di Sun, Gang Pan
arxiv, 2024
-
arxiv
Jiahao Wang, Jinyuan Li, Gang Pan, Di Sun, Jiawan Zhang
arxiv, 2024
Under Review (IEEE TMM)
-
arxiv
Gang Pan, Yonglu Liu, Jinyuan Li, Zhenjun Han, Jiahao Wang, Di Sun
arxiv, 2024
Experience
- [Apr. 2024 - Jun. 2024] Baidu, Research Intern
Services
Conference Reviewer/Program Committee
Journal Reviewer
Teaching Assistant
- Advanced Computer Vision (Postgraduate), Tianjin University, Fall 2023
Awards and Honors
- [2024] China National Scholarship (Top 1%, Ranked #1 in CS Major)
- [2024] (First-class) Academic Excellence Scholarship of Tianjin University (Ranked #1 in CS Major)
- [2024] Outstanding Student of Tianjin University (Ranked #1 in CS Major)
- [2022 & 2023] (Second-class) Academic Excellence Scholarship of Tianjin University
- [2021] Outstanding Student of Taiyuan University of Technology (Top 2%, 3/128)
- [2020 & 2021] Academic Excellence Scholarship of Taiyuan University of Technology
- [2020] Excellent Academic Progress Student of Taiyuan University of Technology
- [2019] Outstanding Student Cadre of Taiyuan University of Technology
Miscellaneous
When I’m not in research mode, I enjoy swimming (Swim one kilometer freestyle in less than 18 minutes) and violin (Fluent sight-reading skills). They have been with me for nearly twenty years. My favorite virtuoso is Ray Chen. His interpretation of the music always carries a distinct personal touch. I hope I can approach my research in the same way.
In the past, I demonstrated good road cycling ability (being able to maintain a speed of 36 km/h for two hours), but I no longer continue this sport.