Yuxia Wang
Hi, I am Yuxia Wang (王宇侠 in Chinese).
Floor 13, Room 2
INSAIT
Sofia, Bulgaria
I am currently a tenure-track Assistant Professor at INSAIT in Sofia. Prior to this, I was a postdoctoral researcher at MBZUAI NLP department, working with Prof. Preslav Nakov. I completed my PhD at The University of Melbourne in January 2023, under the guidance of Prof. Tim Baldwin and Prof. Karin Verspoor. I earned both my Bachelor’s (2016) and Master’s (2018) degrees from the Beijing Institute of Technology.
My research interests lie in natural language processing, with a particular goal to enable models to advance safe, factual, and empathetic human-AI interactions. My current work mainly focuses on LLM/LRM optimization in reasoning, safety, factuality and empathy, machine-generated content detection, and LLM applications in financial and medical domains. I have published papers in top-tier NLP conferences and journals such as ACL, TACL, EMNLP, NAACL and so on.
I am looking for motivated PhD students. We offer competitive scholarship (€40,000 per year), ample GPU resources (GB200), and strong academic ties with ETH Zurich, MIT, and DeepMind. We have co-supervision programs with ETH Zurich and DeepMind. Under our DeepMind Co-Supervision Program, PhD students can work jointly with world-leading mentors from Deepmind, such as Kristina Toutanova and Fei Liu. If you’re passionate about these topics, feel free to contact me with your CV and a brief introduction of your research interests.
news
| May 19, 2025 | One paper (SpeechDialogueFactory) accepted to Interspeech 2025! | 
|---|---|
| May 15, 2025 | Seven papers (5 Main and 2 Findings) accepted to ACL 2025! | 
| Apr 28, 2025 | Three papers (Arabic Safeguard Evaluation, Libra-leaderboard, and FIRE) accepted to NAACL 2025! See you in Albuquerque, New Mexico! | 
selected publications
-  M4M4: Multi-generator, Multi-domain, and Multi-lingual Black-Box Machine-Generated Text DetectionIn Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers), Mar 2024
 -  M4GTM4GT-Bench: Evaluation Benchmark for Black-Box Machine-Generated Text DetectionIn Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Aug 2024
 -  OpenFactCheckOpenFactCheck: A Unified Framework for Factuality Evaluation of LLMsIn Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, Nov 2024
 -  Factcheck-BenchFactcheck-Bench: Fine-Grained Evaluation Benchmark for Automatic Fact-checkersIn Findings of the Association for Computational Linguistics: EMNLP 2024, Nov 2024
 -  DNADo-Not-Answer: Evaluating Safeguards in LLMsIn Findings of the Association for Computational Linguistics: EACL 2024, Mar 2024
 -  OpenFactCheckOpenFactCheck: Building, Benchmarking Customized Fact-Checking Systems and Evaluating the Factuality of Claims and LLMsIn Proceedings of the 31st International Conference on Computational Linguistics, Jan 2025
 -  HumanEval MGTIs Human-Like Text Liked by Humans? Multilingual Human Detection and Preference Against AIarXiv preprint arXiv:2502.11614, Jan 2025
 -  KazLLMLlama-3.1-Sherkala-8B-Chat: An Open Large Language Model for KazakharXiv preprint arXiv:2503.01493, Jan 2025
 -  SpeechDialogueSpeechDialogueFactory: Generating High-Quality Speech Dialogue Data to Accelerate Your Speech-LLM DevelopmentInterspeech 2025, Jan 2025
 -  FinChainFinChain: A Symbolic Benchmark for Verifiable Chain-of-Thought Financial ReasoningJan 2025
 -  HD-NDEsHD-NDEs: Neural Differential Equations for Hallucination Detection in LLMsACL 2025, Jan 2025