Middle East Technical University
Department of Computer Engineering
METU LLM Benchmark is a comprehensive evaluation framework designed to assess the performance of Large Language Models on Turkish language tasks. This project aims to facilitate the development of more accurate and reliable Turkish language models.