Yuxia Wang

Hi, I am Yuxia Wang (王宇侠 in Chinese).

yuxia_bubble.JPG

NLP Department

MBZUAI, 1B, Block C

Masdar City, Abu Dhabi, UAE

I am currently a postdoctoral researcher at MBZUAI NLP department, working with Prof. Preslav Nakov. I will be joining INSAIT in Sofia as a tenure-track Assistant Professor starting in Fall 2025. Prior to this, I completed my PhD at The University of Melbourne in January 2023, under the guidance of Prof. Tim Baldwin and Prof. Karin Verspoor. I earned both my Bachelor’s (2016) and Master’s (2018) degrees from the Beijing Institute of Technology.

My research interests lie in natural language processing, with a particular goal to advance safe, factual, and empathetic human-AI interactions. My current work mainly focuses on LLM/LRM optimization in reasoning, safety, factuality and empathy, low-resource language model development, and machine-generated content detection. I have published papers in top-tier NLP conferences and journals such as ACL, TACL, EMNLP, NAACL and so on.

I am looking for motivated PhD students. If you’re passionate about these topics, feel free to contact me with your CV and a brief introduction of your research interests.

news

May 19, 2025 One paper (SpeechDialogueFactory) accepted to Interspeech 2025!
May 15, 2025 Seven papers (5 Main and 2 Findings) accepted to ACL 2025!
Apr 28, 2025 Three papers (Arabic Safeguard Evaluation, Libra-leaderboard, and FIRE) accepted to NAACL 2025! See you in Albuquerque, New Mexico!

selected publications

  1. M4
    M4: Multi-generator, Multi-domain, and Multi-lingual Black-Box Machine-Generated Text Detection
    Yuxia Wang, Jonibek Mansurov, Petar Ivanov, and 12 more authors
    In Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers), Mar 2024
  2. M4GT
    M4GT-Bench: Evaluation Benchmark for Black-Box Machine-Generated Text Detection
    Yuxia Wang, Jonibek Mansurov, Petar Ivanov, and 11 more authors
    In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Aug 2024
  3. OpenFactCheck
    OpenFactCheck: A Unified Framework for Factuality Evaluation of LLMs
    Hasan Iqbal*, Yuxia Wang*, Minghan Wang, and 4 more authors
    In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, Nov 2024
  4. Factcheck-Bench
    Factcheck-Bench: Fine-Grained Evaluation Benchmark for Automatic Fact-checkers
    Yuxia Wang, Revanth Gangi Reddy, Zain Muhammad Mujahid, and 10 more authors
    In Findings of the Association for Computational Linguistics: EMNLP 2024, Nov 2024
  5. CDNA
    A Chinese Dataset for Evaluating the Safeguards in Large Language Models
    Yuxia Wang, Zenan Zhai, Haonan Li, and 6 more authors
    In Findings of the Association for Computational Linguistics: ACL 2024, Aug 2024
  6. DNA
    Do-Not-Answer: Evaluating Safeguards in LLMs
    Yuxia Wang, Haonan Li, Xudong Han, and 2 more authors
    In Findings of the Association for Computational Linguistics: EACL 2024, Mar 2024
  7. SemEval2024MGT
    SemEval-2024 Task 8: Multidomain, Multimodel and Multilingual Machine-Generated Text Detection
    Yuxia Wang, Jonibek Mansurov, Petar Ivanov, and 7 more authors
    In Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024), Jun 2024
  8. Empathy
    Can Machines Resonate with Humans? Evaluating the Emotional and Empathic Comprehension of LMs
    Muhammad Arslan Manzoor, Yuxia Wang, Minghan Wang, and 1 more author
    In Findings of the Association for Computational Linguistics: EMNLP 2024, Nov 2024
  9. OpenFactCheck
    OpenFactCheck: Building, Benchmarking Customized Fact-Checking Systems and Evaluating the Factuality of Claims and LLMs
    Yuxia Wang, Minghan Wang, Hasan Iqbal, and 4 more authors
    In Proceedings of the 31st International Conference on Computational Linguistics, Jan 2025
  10. HumanEval MGT
    Is Human-Like Text Liked by Humans? Multilingual Human Detection and Preference Against AI
    Yuxia Wang, Rui Xing, Jonibek Mansurov, and 8 more authors
    arXiv preprint arXiv:2502.11614, Jan 2025
  11. KazLLM
    Llama-3.1-Sherkala-8B-Chat: An Open Large Language Model for Kazakh
    Fajri Koto, Rituraj Joshi, Nurdaulet Mukhituly, and 8 more authors
    arXiv preprint arXiv:2503.01493, Jan 2025
  12. SpeechDialogue
    SpeechDialogueFactory: Generating High-Quality Speech Dialogue Data to Accelerate Your Speech-LLM Development
    Minghan Wang, Ye Bai, Yuxia Wang, and 3 more authors
    Interspeech 2025, Jan 2025
  13. FinChain
    FinChain: A Symbolic Benchmark for Verifiable Chain-of-Thought Financial Reasoning
    Zhuohan Xie, Dhruv Sahnan, Debopriyo Banerjee, and 14 more authors
    Jan 2025
  14. HD-NDEs
    HD-NDEs: Neural Differential Equations for Hallucination Detection in LLMs
    Qing Li, Jiahui Geng, Zongxiong Chen, and 5 more authors
    ACL 2025, Jan 2025