Publications

Publications sorted by year.

2025

2025

  1. CLEF2025
    Overview of PAN 2025: Generative AI Detection, Multilingual Text Detoxification, Multi-Author Writing Style Analysis, and Generative Plagiarism Detection
    Janek Bevendorff, Daryna Dementieva, Maik Fröbe, and 8 more authors
    In ECIR, 2025
  2. Loki
    Loki: An Open-Source Tool for Fact Verification
    Haonan Li, Xudong Han, Hao Wang, and 7 more authors
    In Proceedings of the 31st International Conference on Computational Linguistics: System Demonstrations, Jan 2025
  3. Libra-leaderboard
    Libra-Leaderboard: Towards Responsible AI through a Balanced Leaderboard of Safety and Capability
    Haonan Li, Xudong Han, Zenan Zhai, and 32 more authors
    In Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (System Demonstrations), Apr 2025
  4. FIRE
    FIRE: Fact-checking with Iterative Retrieval and Verification
    Zhuohan Xie, Rui Xing, Yuxia Wang, and 5 more authors
    In Findings of the Association for Computational Linguistics: NAACL 2025, Apr 2025
  5. ADNA
    Arabic Dataset for LLM Safeguard Evaluation
    Yasser Ashraf, Yuxia Wang, Bin Gu, and 2 more authors
    In Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), Apr 2025
  6. Jailbreak survey
    Against The Achilles’ Heel: A Survey on Red Teaming for Generative Models
    Lizhi Lin, Honglin Mu, Zenan Zhai, and 8 more authors
    Journal of Artificial Intelligence Research, Apr 2025
  7. GenAI-MGT
    GenAI Content Detection Task 1: English and Multilingual Machine-Generated Text Detection: AI vs. Human
    Yuxia Wang, Artem Shelmanov, Jonibek Mansurov, and 23 more authors
    In Proceedings of the 1stWorkshop on GenAI Content Detection (GenAIDetect), Jan 2025
  8. OpenFactCheck
    OpenFactCheck: Building, Benchmarking Customized Fact-Checking Systems and Evaluating the Factuality of Claims and LLMs
    Yuxia Wang, Minghan Wang, Hasan Iqbal, and 4 more authors
    In Proceedings of the 31st International Conference on Computational Linguistics, Jan 2025
  9. HumanEval MGT
    Is Human-Like Text Liked by Humans? Multilingual Human Detection and Preference Against AI
    Yuxia Wang, Rui Xing, Jonibek Mansurov, and 8 more authors
    arXiv preprint arXiv:2502.11614, Jan 2025
  10. Kazakh Safety
    Qorgau: Evaluating LLM Safety in Kazakh-Russian Bilingual Contexts
    Maiya Goloburda*, Nurkhan Laiyk*, Diana Turmakhan*, and 8 more authors
    ACL 2025 (Findings), Jan 2025
  11. Kazakh SFT
    Instruction Tuning on Public Government and Cultural Data for Low-Resource Language: a Case Study in Kazakh
    Nurkhan Laiyk, Daniil Orel, Rituraj Joshi, and 4 more authors
    ACL 2025, Jan 2025
  12. KazMMLU
    KazMMLU: Evaluating Language Models on Kazakh, Russian, and Regional Knowledge of Kazakhstan
    Mukhammed Togmanov, Nurdaulet Mukhituly, Diana Turmakhan, and 8 more authors
    ACL 2025, Jan 2025
  13. KazLLM
    Llama-3.1-Sherkala-8B-Chat: An Open Large Language Model for Kazakh
    Fajri Koto, Rituraj Joshi, Nurdaulet Mukhituly, and 8 more authors
    arXiv preprint arXiv:2503.01493, Jan 2025
  14. Unlearning survey
    A comprehensive survey of machine unlearning techniques for large language models
    Jiahui Geng, Qing Li, Herbert Woisetschlaeger, and 6 more authors
    arXiv preprint arXiv:2503.01854, Jan 2025
  15. SpeechDialogue
    SpeechDialogueFactory: Generating High-Quality Speech Dialogue Data to Accelerate Your Speech-LLM Development
    Minghan Wang, Ye Bai, Yuxia Wang, and 3 more authors
    Interspeech 2025, Jan 2025
  16. FAID
    FAID: Fine-grained AI-generated Text Detection using Multi-task Auxiliary and Multi-level Contrastive Learning
    Minh Ngoc Ta, Dong Cao Van, Duc-Anh Hoang, and 6 more authors
    arXiv preprint arXiv:2505.14271, Jan 2025
  17. UrduFactCheck
    UrduFactCheck: An Agentic Fact-Checking Framework for Urdu with Evidence Boosting and Benchmarking
    Sarfraz Ahmad, Hasan Iqbal, Momina Ahsan, and 6 more authors
    arXiv preprint arXiv:2505.15063, Jan 2025
  18. VSCBench
    VSCBench: Bridging the Gap in Vision-Language Model Safety Calibration
    Jiahui Geng, Qing Li, Zongxiong Chen, and 7 more authors
    ACL 2025 (Findings), Jan 2025
  19. FinChain
    FinChain: A Symbolic Benchmark for Verifiable Chain-of-Thought Financial Reasoning
    Zhuohan Xie, Dhruv Sahnan, Debopriyo Banerjee, and 14 more authors
    Jan 2025
  20. HD-NDEs
    HD-NDEs: Neural Differential Equations for Hallucination Detection in LLMs
    Qing Li, Jiahui Geng, Zongxiong Chen, and 5 more authors
    ACL 2025, Jan 2025

2024

2024

  1. M4
    M4: Multi-generator, Multi-domain, and Multi-lingual Black-Box Machine-Generated Text Detection
    Yuxia Wang, Jonibek Mansurov, Petar Ivanov, and 12 more authors
    In Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers), Mar 2024
  2. M4GT
    M4GT-Bench: Evaluation Benchmark for Black-Box Machine-Generated Text Detection
    Yuxia Wang, Jonibek Mansurov, Petar Ivanov, and 11 more authors
    In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Aug 2024
  3. UrduNewsDetection
    Detection of Human and Machine-Authored Fake News in Urdu
    Muhammad Zain Ali, Yuxia Wang, Bernhard Pfahringer, and 1 more author
    ACL 2025, Aug 2024
  4. RethinkSTS
    Rethinking STS and NLI in Large Language Models
    Yuxia Wang, Minghan Wang, and Preslav Nakov
    In Findings of the Association for Computational Linguistics: EACL 2024, Mar 2024
  5. OpenFactCheck
    OpenFactCheck: A Unified Framework for Factuality Evaluation of LLMs
    Hasan Iqbal*, Yuxia Wang*, Minghan Wang, and 4 more authors
    In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, Nov 2024
  6. Factcheck-Bench
    Factcheck-Bench: Fine-Grained Evaluation Benchmark for Automatic Fact-checkers
    Yuxia Wang, Revanth Gangi Reddy, Zain Muhammad Mujahid, and 10 more authors
    In Findings of the Association for Computational Linguistics: EMNLP 2024, Nov 2024
  7. ASR
    Exploring the Potential of Multimodal LLM with Knowledge-Intensive Multimodal ASR
    Minghan Wang, Yuxia Wang, Thuy-Trang Vu, and 2 more authors
    In Findings of the Association for Computational Linguistics: EMNLP 2024, Nov 2024
  8. SFT
    Demystifying Instruction Mixing for Fine-tuning Large Language Models
    Renxi Wang, Haonan Li, Minghao Wu, and 4 more authors
    In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 4: Student Research Workshop), Aug 2024
  9. Factuality survey
    Factuality of Large Language Models: A Survey
    Yuxia Wang, Minghan Wang, Muhammad Arslan Manzoor, and 4 more authors
    In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Nov 2024
  10. SimulMT
    Conversational simulmt: Efficient simultaneous translation with large language models
    Minghan Wang, Thuy-Trang Vu, Yuxia Wang, and 2 more authors
    arXiv preprint arXiv:2402.10552, Nov 2024
  11. Uncertainty srvey
    A Survey of Confidence Estimation and Calibration in Large Language Models
    Jiahui Geng, Fengyu Cai, Yuxia Wang, and 3 more authors
    In Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), Jun 2024
  12. CDNA
    A Chinese Dataset for Evaluating the Safeguards in Large Language Models
    Yuxia Wang, Zenan Zhai, Haonan Li, and 6 more authors
    In Findings of the Association for Computational Linguistics: ACL 2024, Aug 2024
  13. DNA
    Do-Not-Answer: Evaluating Safeguards in LLMs
    Yuxia Wang, Haonan Li, Xudong Han, and 2 more authors
    In Findings of the Association for Computational Linguistics: EACL 2024, Mar 2024
  14. SemEval2024MGT
    SemEval-2024 Task 8: Multidomain, Multimodel and Multilingual Machine-Generated Text Detection
    Yuxia Wang, Jonibek Mansurov, Petar Ivanov, and 7 more authors
    In Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024), Jun 2024
  15. Empathy
    Can Machines Resonate with Humans? Evaluating the Emotional and Empathic Comprehension of LMs
    Muhammad Arslan Manzoor, Yuxia Wang, Minghan Wang, and 1 more author
    In Findings of the Association for Computational Linguistics: EMNLP 2024, Nov 2024
  16. Llm-detectaive
    LLM-DetectAIve: a Tool for Fine-Grained Machine-Generated Text Detection
    Mervat Abassy, Kareem Elozeiri, Alexander Aziz, and 21 more authors
    In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, Nov 2024

2023

2023

  1. Collective STS
    Collective Human Opinions in Semantic Textual Similarity
    Yuxia Wang, Shimin Tao, Ning Xie, and 3 more authors
    Transactions of the Association for Computational Linguistics, Nov 2023
  2. PhD Thesis
    Towards Accurate and Reliable Modelling for Semantic Textual Similarity
    Yuxia Wang
    The University of Melbourne, Nov 2023

2022

2022

  1. NoisyRegulation
    Noisy Label Regularisation for Textual Regression
    Yuxia Wang, Timothy Baldwin, and Karin Verspoor
    In Proceedings of the 29th International Conference on Computational Linguistics, Oct 2022
  2. NLI
    Capture Human Disagreement Distributions by Calibrated Networks for Natural Language Inference
    Yuxia Wang, Minghan Wang, Yimeng Chen, and 5 more authors
    In Findings of the Association for Computational Linguistics: ACL 2022, May 2022
  3. Diformer
    Diformer: Directional Transformer for Neural Machine Translation
    Minghan Wang, Jiaxin Guo, Yuxia Wang, and 8 more authors
    In Proceedings of the 23rd Annual Conference of the European Association for Machine Translation, Jun 2022
  4. MT
    The HW-TSC’s Speech to Speech Translation System for IWSLT 2022 Evaluation
    Jiaxin Guo, Yinglu Li, Minghan Wang, and 9 more authors
    In Proceedings of the 19th International Conference on Spoken Language Translation (IWSLT 2022), May 2022
  5. Uncertainty
    Uncertainty Estimation and Reduction of Pre-trained Models for Text Regression
    Yuxia Wang, Daniel Beck, Timothy Baldwin, and 1 more author
    Transactions of the Association for Computational Linguistics, May 2022
  6. Speech
    The HW-TSC’s Offline Speech Translation System for IWSLT 2022 Evaluation
    Yinglu Li, Minghan Wang, Jiaxin Guo, and 9 more authors
    In Proceedings of the 19th International Conference on Spoken Language Translation (IWSLT 2022), May 2022
  7. Speech
    The HW-TSC’s Simultaneous Speech Translation System for IWSLT 2022 Evaluation
    Minghan Wang, Jiaxin Guo, Yinglu Li, and 9 more authors
    In Proceedings of the 19th International Conference on Spoken Language Translation (IWSLT 2022), May 2022

2021

2021

  1. HI-CMLM
    HI-CMLM: Improve CMLM with Hybrid Decoder Input
    Minghan Wang, Guo Jiaxin, Yuxia Wang, and 6 more authors
    In Proceedings of the 14th International Conference on Natural Language Generation, Aug 2021
  2. MT
    Joint-training on Symbiosis Networks for Deep Nueral Machine Translation models
    Zhengzhe Yu, Jiaxin Guo, Minghan Wang, and 8 more authors
    arXiv preprint arXiv:2112.11642, Aug 2021
  3. MT
    How Length Prediction Influence the Performance of Non-Autoregressive Translation?
    Minghan Wang, Guo Jiaxin, Yuxia Wang, and 6 more authors
    In Proceedings of the Fourth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, Nov 2021
  4. MT
    Self-distillation mixup training for non-autoregressive neural machine translation
    Jiaxin Guo, Minghan Wang, Daimeng Wei, and 8 more authors
    arXiv preprint arXiv:2112.11640, Nov 2021
  5. MT
    HW-TSC’s Participation at WMT 2021 Quality Estimation Shared Task
    Yimeng Chen, Chang Su, Yingtao Zhang, and 9 more authors
    In Proceedings of the Sixth Conference on Machine Translation, Nov 2021
  6. MT
    The HW-TSC’s Offline Speech Translation Systems for IWSLT 2021 Evaluation
    Minghan Wang, Yuxia Wang, Chang Su, and 8 more authors
    arXiv preprint arXiv:2108.03845, Nov 2021
  7. MT
    Incorporating complete syntactical knowledge for spoken language understanding
    Shimin Tao, Ying Qin, Yimeng Chen, and 8 more authors
    In Knowledge Graph and Semantic Computing: Knowledge Graph Empowers New Infrastructure Construction: 6th China Conference, CCKS 2021, Guangzhou, China, November 4-7, 2021, Proceedings 6, Nov 2021
  8. MT
    Incorporating Complete Syntactical Knowledge for Spoken Language Understanding
    Weibin Meng, Yanghua Xiao, Jiaxin Guo, and 5 more authors
    In Knowledge Graph and Semantic Computing: Knowledge Graph Empowers New Infrastructure Construction: 6th China Conference, CCKS 2021, Guangzhou, China, November 4-7, 2021, Proceedings, Nov 2021

2020

2020

  1. STS
    Evaluating the Utility of Model Configurations and Data Augmentation on Clinical Semantic Textual Similarity
    Yuxia Wang, Fei Liu, Karin Verspoor, and 1 more author
    In Proceedings of the 19th SIGBioMed Workshop on Biomedical Language Processing, Jul 2020
  2. STS
    Learning from Unlabelled Data for Clinical Semantic Textual Similarity
    Yuxia Wang, Karin Verspoor, and Timothy Baldwin
    In Proceedings of the 3rd Clinical Natural Language Processing Workshop, Nov 2020
  3. Medical
    A multi-pass sieve for clinical concept normalization
    Yuxia Wang, Brian Hur, Karin Verspoor, and 1 more author
    Traitement Automatique Des Langues, Nov 2020