Yuxia Wang
Hi, I am Yuxia Wang (王宇侠 in Chinese).
NLP Department
MBZUAI, 1B, Block C
Masdar City, Abu Dhabi, UAE
I am currently a postdoctoral researcher at MBZUAI NLP department, working with Prof. Preslav Nakov. I will be joining INSAIT in Sofia as a tenure-track Assistant Professor starting in Fall 2025. Prior to this, I completed my PhD at The University of Melbourne in January 2023, under the guidance of Prof. Tim Baldwin and Prof. Karin Verspoor. I earned both my Bachelor’s (2016) and Master’s (2018) degrees from the Beijing Institute of Technology.
My research interests lie in natural language processing, with a particular goal to advance safe, factual, and empathetic human-AI interactions. My current work mainly focuses on LLM/LRM optimization in reasoning, safety, factuality and empathy, low-resource language model development, and machine-generated content detection. I have published papers in top-tier NLP conferences and journals such as ACL, TACL, EMNLP, NAACL and so on.
I am looking for motivated PhD students. If you’re passionate about these topics, feel free to contact me with your CV and a brief introduction of your research interests.
news
May 19, 2025 | One paper (SpeechDialogueFactory) accepted to Interspeech 2025! |
---|---|
May 15, 2025 | Seven papers (5 Main and 2 Findings) accepted to ACL 2025! |
Apr 28, 2025 | Three papers (Arabic Safeguard Evaluation, Libra-leaderboard, and FIRE) accepted to NAACL 2025! See you in Albuquerque, New Mexico! |
selected publications
- M4M4: Multi-generator, Multi-domain, and Multi-lingual Black-Box Machine-Generated Text DetectionIn Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers), Mar 2024
- M4GTM4GT-Bench: Evaluation Benchmark for Black-Box Machine-Generated Text DetectionIn Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Aug 2024
- OpenFactCheckOpenFactCheck: A Unified Framework for Factuality Evaluation of LLMsIn Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, Nov 2024
- Factcheck-BenchFactcheck-Bench: Fine-Grained Evaluation Benchmark for Automatic Fact-checkersIn Findings of the Association for Computational Linguistics: EMNLP 2024, Nov 2024
- DNADo-Not-Answer: Evaluating Safeguards in LLMsIn Findings of the Association for Computational Linguistics: EACL 2024, Mar 2024
- OpenFactCheckOpenFactCheck: Building, Benchmarking Customized Fact-Checking Systems and Evaluating the Factuality of Claims and LLMsIn Proceedings of the 31st International Conference on Computational Linguistics, Jan 2025
- HumanEval MGTIs Human-Like Text Liked by Humans? Multilingual Human Detection and Preference Against AIarXiv preprint arXiv:2502.11614, Jan 2025
- KazLLMLlama-3.1-Sherkala-8B-Chat: An Open Large Language Model for KazakharXiv preprint arXiv:2503.01493, Jan 2025
- SpeechDialogueSpeechDialogueFactory: Generating High-Quality Speech Dialogue Data to Accelerate Your Speech-LLM DevelopmentInterspeech 2025, Jan 2025
- FinChainFinChain: A Symbolic Benchmark for Verifiable Chain-of-Thought Financial ReasoningJan 2025
- HD-NDEsHD-NDEs: Neural Differential Equations for Hallucination Detection in LLMsACL 2025, Jan 2025