About Me
I am a final-year master’s student at Tianjin University, under the mentorship of Prof. Gang Pan. Previously, I worked as a research intern at Baidu and explored multimodal document understanding.
My enthusiasm broadly lies in exploring Multimodal Learning, Natural Language Processing and Computer Vision. My recent series of multimodal research (MNER->GMNER->SMNER) explored how to unleash the potential capabilities of visual-language models in complex multimodal scenarios, how to build harmonious interaction and collaboration between multiple models, and how to construct image-text based knowledge augmentation methods in open-world scenarios. Additionally, I also maintain interest in several visual tasks (e.g., Blind Image Inpainting, Infrared and Visible Image Fusion).
I am more concerned about the problems that are worth solving rather than limiting myself to a specific field. Feel free to contact me and explore possibilities together.
Research Interests
Vision and Language:
News🔥
- [Nov. 2024] I am honored to have received three top honors at Tianjin University (Ranked #1 in CS Major): the China National Scholarship, the First-Class Academic Excellence Scholarship, and the Outstanding (Sanhao) Student Award.
- [Jun. 2024] We release a new study proposing the Segmented Multimodal Named Entity Recognition (SMNER) task and constructing the corresponding Twitter-SMNER dataset. Datasets and Code will be released at here.
- [May. 2024] One paper is accepted to ACL 2024. See you in Bangkok!
- [Apr. 2024] Delighted to be onboard at Baidu as a research intern.
- [Feb. 2024] A new research about Grounded Multimodal Named Entity Recognition (GMNER) and Large Language Models has been released! see here.
- [Oct. 2023] One paper about Multimodal Named Entity Recognition (MNER) is accepted to EMNLP 2023.
Publications
-
ACL
Jinyuan Li, Han Li, Di Sun, Jiahao Wang, Wenkun Zhang, Zan Wang, Gang Pan
Findings of the Association for Computational Linguistics (ACL) 2024, Bangkok, Thailand
-
EMNLP
Jinyuan Li, Han Li, Zhuo Pan, Di Sun, Jiahao Wang, Wenkun Zhang, Gang Pan
Findings of the Association for Computational Linguistics (EMNLP) 2023, Singapore
-
arxiv
Jinyuan Li, Ziyan Li, Han Li, Jianfei Yu, Rui Xia, Di Sun, Gang Pan
arxiv, 2024
-
arxiv
Jiahao Wang, Jinyuan Li, Gang Pan, Di Sun, Jiawan Zhang
arxiv, 2024
Under Review (IEEE TMM)
-
arxiv
Gang Pan, Yonglu Liu, Jinyuan Li, Zhenjun Han, Jiahao Wang, Di Sun
arxiv, 2024
Experience
- [Apr. 2024 - Jun. 2024] Baidu, Research Intern
Services
Conference Reviewer/Program Committee
Journal Reviewer
Teaching Assistant
- Advanced Computer Vision (Postgraduate), Tianjin University, Fall 2023
Awards and Honors
- [2024] China National Scholarship (Top 1%, Ranked #1 in CS Major)
- [2024] First-class Academic Excellence Scholarship of Tianjin University (Ranked #1 in CS Major)
- [2024] Outstanding Student of Tianjin University (Ranked #1 in CS Major)
- [2021] Outstanding Student of Taiyuan University of Technology (Top 2%, 3/128)
- [2020 & 2021] Academic Excellence Scholarship of Taiyuan University of Technology
Miscellaneous
When I’m not in research mode, I enjoy swimming (Swim one kilometer freestyle in less than 18 minutes) and violin (Fluent sight-reading skills). They have been with me for nearly twenty years. My favorite virtuoso is Ray Chen. His interpretation of the music always carries a distinct personal touch. I hope I can approach my research in the same way.
In the past, I demonstrated good road cycling ability (being able to maintain a speed of 36 km/h for two hours), but I no longer continue this sport.