I am the Research Fellow in National University of Singapore (NUS). Prior to that, I received the PhD and Master Degree from NUS in 2023 and 2019, supervised by Prof. Li Haizhou, Bachelor Degree from Soochow University in 2018.

My research interest includes (audio-only or audio-visual) speech processing: enhancement, extraction and seperation; speaker processing: recognition, diarization, active speaker detection and anti-spoofing. I also work in self-supervised learning. I have published more than 20 papers at the top international AI conferences and journals such as TASLP, TMM, ACM MM, ICASSP, INTERSPEECH.

📜 Research Area

Research Area Tasks
Speech processing (Audio-visual) speech enhancement, extraction and separation
Speaker processing (Audio-visual) speaker recognition, verification, diarization and anti-spoofing
Multi-modal speech processing Active speaker detection, cross-modal speaker recognition
Algoirthm Self-supervised learning, fundamental model

🏫 Education

  • 2019.08 - 2023.08, Ph.D. in Speech Processing and Computer Vision, National University of Singapore (NUS), Singapore.
  • 2018.08 - 2019.06, M.Sc. in Electronic and Computer Engineer, National University of Singapore (NUS), Singapore.
  • 2014.09 - 2018.06, B.Eng. in Electronic Engineer, Soochow University, Suzhou, China.

Working Experience

  • 2023.08 - Now, Research Fellow, National University of Singapore (NUS), Singapore.

📝 Publication

2024

2023

2022

2021

2020

💻 Open Source Code

  • Speaker Recognition Framework
  • Active Speaker Detection Framework
  • Self-supervised Speaker Recognition Framework
  • Audio-visual Speaker Recognition Framework
  • Cross-modal Speaker Recognition Framework
  • Ego4d Benchmark

👔 Internship and Visiting Experience

  • 2022.02 - 2022.08, Visiting Student, Chinese University of Hong Kong (CUHKSZ), Shenzhen, China.
  • 2015.07 - 2015.08, Visiting Student, University of Cambridge, Cambridge, UK.

🎖 Others

Award

  • The 1st place winner in FAME Challenge, ACM-Multimedia, 2024
  • Egocentric Vision (EgoVis) 2022/2023 Distinguished Paper Award, 2024
  • IEEE SLP Student Travel Grant, ICASSP Best Paper Nominee (Corresponding author), 2024
  • Nanyang Speech Technology Forum, Best Student Paper Award, 2023
  • PREMIA, Best Student Paper Award, 2022
  • CVPR Best Paper Nominee, 2022
  • The 2nd place winner in NIST Speaker Recognition Evaluation (SRE), 2021
  • The 3rd place winner in the ActivityNet Challenge (Speaker), CVPR Workshop, 2021
  • NUS Research Scholarship, 2019

Reviewer

  • Computer Vision and Pattern Recognition Conference (CVPR),
  • IEEE Transactions on Audio, Speech, and Language Processing (TASLP),
  • IEEE The International Conference on Acoustics, Speech, & Signal Processing (ICASSP),
  • IEEE Spoken Language Technology Workshop(SLT),
  • The International Speech Communication Association (INTERSPEECH),
  • Signal Processing Letters (SPL),
  • Digital Signal Processing (DSP),
  • Computer Speech & Language (CSL)
  • IEEE Open Journal of Signal Processing (OJSP)
  • International Symposium on Chinese Spoken Language Processing (ISCSLP)

Teaching

  • EE3801 Data Engineering Principles, NUS undergraduate course
  • EE5132 Wireless and Sensor Networks, NUS graduate course