Publications
Publications sorted by year.
2025
2025
- CLEF2025Overview of PAN 2025: Generative AI Detection, Multilingual Text Detoxification, Multi-Author Writing Style Analysis, and Generative Plagiarism DetectionIn ECIR, 2025
- LokiLoki: An Open-Source Tool for Fact VerificationIn Proceedings of the 31st International Conference on Computational Linguistics: System Demonstrations, Jan 2025
- Libra-leaderboard
- Jailbreak surveyAgainst The Achilles’ Heel: A Survey on Red Teaming for Generative ModelsJournal of Artificial Intelligence Research, Apr 2025
- GenAI-MGTGenAI Content Detection Task 1: English and Multilingual Machine-Generated Text Detection: AI vs. HumanIn Proceedings of the 1stWorkshop on GenAI Content Detection (GenAIDetect), Jan 2025
- OpenFactCheckOpenFactCheck: Building, Benchmarking Customized Fact-Checking Systems and Evaluating the Factuality of Claims and LLMsIn Proceedings of the 31st International Conference on Computational Linguistics, Jan 2025
- HumanEval MGTIs Human-Like Text Liked by Humans? Multilingual Human Detection and Preference Against AIarXiv preprint arXiv:2502.11614, Jan 2025
- Kazakh SafetyQorgau: Evaluating LLM Safety in Kazakh-Russian Bilingual ContextsACL 2025 (Findings), Jan 2025
- Kazakh SFTInstruction Tuning on Public Government and Cultural Data for Low-Resource Language: a Case Study in KazakhACL 2025, Jan 2025
- KazMMLUKazMMLU: Evaluating Language Models on Kazakh, Russian, and Regional Knowledge of KazakhstanACL 2025, Jan 2025
- KazLLMLlama-3.1-Sherkala-8B-Chat: An Open Large Language Model for KazakharXiv preprint arXiv:2503.01493, Jan 2025
- Unlearning surveyA comprehensive survey of machine unlearning techniques for large language modelsarXiv preprint arXiv:2503.01854, Jan 2025
- SpeechDialogueSpeechDialogueFactory: Generating High-Quality Speech Dialogue Data to Accelerate Your Speech-LLM DevelopmentInterspeech 2025, Jan 2025
- FAIDFAID: Fine-grained AI-generated Text Detection using Multi-task Auxiliary and Multi-level Contrastive LearningarXiv preprint arXiv:2505.14271, Jan 2025
- UrduFactCheckUrduFactCheck: An Agentic Fact-Checking Framework for Urdu with Evidence Boosting and BenchmarkingarXiv preprint arXiv:2505.15063, Jan 2025
- VSCBenchVSCBench: Bridging the Gap in Vision-Language Model Safety CalibrationACL 2025 (Findings), Jan 2025
- FinChainFinChain: A Symbolic Benchmark for Verifiable Chain-of-Thought Financial ReasoningJan 2025
- HD-NDEsHD-NDEs: Neural Differential Equations for Hallucination Detection in LLMsACL 2025, Jan 2025
2024
2024
- M4M4: Multi-generator, Multi-domain, and Multi-lingual Black-Box Machine-Generated Text DetectionIn Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers), Mar 2024
- M4GTM4GT-Bench: Evaluation Benchmark for Black-Box Machine-Generated Text DetectionIn Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Aug 2024
- UrduNewsDetectionDetection of Human and Machine-Authored Fake News in UrduACL 2025, Aug 2024
- RethinkSTSRethinking STS and NLI in Large Language ModelsIn Findings of the Association for Computational Linguistics: EACL 2024, Mar 2024
- OpenFactCheckOpenFactCheck: A Unified Framework for Factuality Evaluation of LLMsIn Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, Nov 2024
- Factcheck-BenchFactcheck-Bench: Fine-Grained Evaluation Benchmark for Automatic Fact-checkersIn Findings of the Association for Computational Linguistics: EMNLP 2024, Nov 2024
- SimulMTConversational simulmt: Efficient simultaneous translation with large language modelsarXiv preprint arXiv:2402.10552, Nov 2024
- DNADo-Not-Answer: Evaluating Safeguards in LLMsIn Findings of the Association for Computational Linguistics: EACL 2024, Mar 2024
2023
2023
- Collective STSCollective Human Opinions in Semantic Textual SimilarityTransactions of the Association for Computational Linguistics, Nov 2023
- PhD ThesisTowards Accurate and Reliable Modelling for Semantic Textual SimilarityThe University of Melbourne, Nov 2023
2022
2022
- NoisyRegulationNoisy Label Regularisation for Textual RegressionIn Proceedings of the 29th International Conference on Computational Linguistics, Oct 2022
- DiformerDiformer: Directional Transformer for Neural Machine TranslationIn Proceedings of the 23rd Annual Conference of the European Association for Machine Translation, Jun 2022
2021
2021
- MTJoint-training on Symbiosis Networks for Deep Nueral Machine Translation modelsarXiv preprint arXiv:2112.11642, Aug 2021
- MTSelf-distillation mixup training for non-autoregressive neural machine translationarXiv preprint arXiv:2112.11640, Nov 2021
- MTHW-TSC’s Participation at WMT 2021 Quality Estimation Shared TaskIn Proceedings of the Sixth Conference on Machine Translation, Nov 2021
- MTThe HW-TSC’s Offline Speech Translation Systems for IWSLT 2021 EvaluationarXiv preprint arXiv:2108.03845, Nov 2021
- MTIncorporating complete syntactical knowledge for spoken language understandingIn Knowledge Graph and Semantic Computing: Knowledge Graph Empowers New Infrastructure Construction: 6th China Conference, CCKS 2021, Guangzhou, China, November 4-7, 2021, Proceedings 6, Nov 2021
- MTIncorporating Complete Syntactical Knowledge for Spoken Language UnderstandingIn Knowledge Graph and Semantic Computing: Knowledge Graph Empowers New Infrastructure Construction: 6th China Conference, CCKS 2021, Guangzhou, China, November 4-7, 2021, Proceedings, Nov 2021
2020
2020
- MedicalA multi-pass sieve for clinical concept normalizationTraitement Automatique Des Langues, Nov 2020