Publications
Research output from the Applied NLP Group
TurkBench: A Benchmark for Evaluating Turkish LLMs
arXiv preprint arXiv:2601.07020, 2026
FIBER: A Multilingual Evaluation Resource for Factual Inference Bias
arXiv preprint arXiv:2512.11110, 2025
Evaluating the quality of benchmark datasets for low-resource languages: A case study on Turkish
Proc. of the 4th Workshop on Generation, Evaluation and Metrics, 2025
OpenEthics: A Comprehensive Ethical Evaluation of Open-Source Generative LLMs
arXiv preprint arXiv:2505.16036, 2025

Adapting open-source generative LLMs for low-resource languages: A case study for Turkish
Proc. of the 4th Workshop on Multilingual Representation Learning, 2024

MiDe22: An annotated multi-event tweet dataset for misinformation detection
Proc. of the 2024 Joint Int. Conf. on Computational Linguistics, 2024

JL-Hate: An Annotated Dataset for Joint Learning of Hate Speech and Target Detection
Proc. of the 2024 Joint Int. Conf. on Computational Linguistics, 2024
PejorativITy: Disambiguating pejorative epithets to improve misogyny detection
Proc. of the 2024 Joint Int. Conf. on Computational Linguistics, 2024
Arc-nlp at climateactivism 2024: Stance and hate speech detection
Proc. of the 7th Workshop on Challenges of Automated Processing, 2024
Detecting Misinformation on Social Media Using Community Insights
ACM Transactions on Intelligent Systems and Technology, 2024
Constructing ensembles for hate speech detection
Natural Language Processing, 2024
SiMiD: Similarity-based misinformation detection via communities
2023 10th Int. Conf. on Social Networks Analysis, 2023
ARC-NLP at PAN 2023: Writing Style Detection
arXiv preprint arXiv:2307.14913, 2023
ARC-NLP at PAN 2023: Hierarchical long text classification
arXiv preprint arXiv:2307.14912, 2023
Arc-nlp at multimodal hate speech event detection 2023
arXiv preprint arXiv:2307.13829, 2023
Zero and few-shot hate speech detection in earthquake disaster
2023 31st Signal Processing and Communications Conf. (SIU), 2023
The effect of gender bias on hate speech detection
Signal, Image and Video Processing 17 (4), 2023

Impact of tokenization on language models: An analysis for Turkish
ACM Trans. on Asian and Low-Resource Language Information, 2023
Tweets under the rubble: Detection of messages calling for help (v1)
arXiv preprint arXiv:2302.13403, 2023
Tweets under the rubble: Detection of messages calling for help (v2)
arXiv preprint arXiv:2302.13403, 2023
ARC-NLP at CASE 2022: Ensemble learning for protest detection
Proc. of the 5th Workshop on Automated Processing, 2022

Understanding social engagements: Comparative analysis of Twitter
Social network analysis and mining 12 (1), 2022

Named entity recognition in Turkish: A comparative study
Information Processing & Management 59 (6), 2022

D2U: distance-to-uniform learning for out-of-scope detection
Proc. of NAACL 2022, 2022

Blacklivesmatter 2020: analysis of deleted and suspended users
Proc. of the 14th ACM Web Science Conference, 2022

Large-scale hate speech detection with cross-domain transfer
Proc. of the 13th Language Resources and Evaluation Conference, 2022
Slot filling for voice assistants
2022 30th Signal Processing and Communications Conf. (SIU), 2022

ARC-NLP at CheckThat!-2022: Contradiction for Harmful Tweets
CLEF (Working Notes), 2022
Event-related microblog retrieval in Turkish
Turkish Journal of Electrical Engineering & Computer Sciences, 2022
Conqx: Semantic expansion of spoken queries
arXiv preprint arXiv:2109.00729, 2021
Topic Detection based on Deep Learning in Turkish Microblogs
2021 29th Signal Processing and Communications Conf. (SIU), 2021
Intent classification based on deep learning in Turkish dialogs
2021 29th Signal Processing and Communications Conf. (SIU), 2021

Tweet Length Matters: Topic Detection in Microblogs
European Conference on Information Retrieval, 2021
Türkçe mikroblog metinlerinde derin öğrenme tabanlı konu tespiti
IEEE, 2021

KLOOS: KL divergence-based out-of-scope intent detection
Proc. of the 43rd international ACM SIGIR conference, 2020
Crosssimon: Probabilistic approach to OSN simulation
2019 IEEE Int. Conf. on Intelligence and Security Informatics, 2019
SimON-Feedback: Performance tuning in social simulation
2019 IEEE Int. Conf. on Intelligence and Security Informatics, 2019
Deep learning approach to modeling temporal networks on Reddit
2019 IEEE Int. Conf. on Intelligence and Security Informatics, 2019

Discovering story chains: zigzagged search and news actors
Journal of the Association for Information Science and Technology, 2017
Early prediction of public reactions using microblogs
7th BCS-IRSG Symposium on Future Directions, 2017
Past, present, and future on news streams (Thesis)
Bilkent University, 2017

A front-page news-selection algorithm based on topic modelling
Journal of Information Science 41 (5), 2015
Türkçe Haber Yazılarında Sosyal Ağların İncelenmesi
17. Akademik Bilişim Konferansı, 2015
News Selection with Topic Modeling
5th BCS-IRSG Symposium (FDIA 2013), 2013
Haber Yığınlarında Konu Başlıklarının Belirlenmesi
29. Ulusal Bilişim Kurultayı (Bilişim 2012), 2012
Squeezing the ensemble pruning: Faster news categorization
European Conference on Information Retrieval, 2012
Ensemble pruning for text categorization (Data Partitioning)
Asia Information Retrieval Symposium, 2011

Developing a text categorization template for Turkish news
2011 Int. Symposium on Innovations in Intelligent Systems, 2011
Text categorization and ensemble pruning in Turkish news (Thesis)
Bilkent University, 2011
METU NLP Lab