LLM Hallucination Leaderboard
π
193
View and filter LLM hallucination leaderboard
View and filter LLM hallucination leaderboard
Duplicate this leaderboard to initialize your own!
Run and view auto evaluations
Benchmark and compare Arabic tokenizers with live leaderboard
Track, rank and evaluate open Arabic LLMs and chatbots
Launch a Streamlit web app interface
View the latest LLM performance leaderboard online
Update model card with Open LLM Leaderboard results
Generative Evaluation for Global South
Display and filter leaderboard data
NextGen Evaluation Benchmark and Leaderboard for Arabic LLMs