ThaiSafetyBench Leaderboard ๐Ÿ›ก๏ธ

[ArXiv Paper] [Github] [Hugging Face Dataset ๐Ÿค—]

ThaiSafetyBench is a safety benchmark tailored to the Thai language and culture.

Leaderboard

Select Columns to Display:
Model
๐Ÿฅ‡ Overall ASR โฌ‡๏ธ
๐Ÿ‘‰ Discrimination, Exclusion, Toxicity, Hateful, Offensive ASR โฌ‡๏ธ
๐Ÿ‘‰ Human-Chatbot Interaction Harm ASR โฌ‡๏ธ
๐Ÿ‘‰ Information Hazards ASR โฌ‡๏ธ
๐Ÿ‘‰ Malicious Uses ASR โฌ‡๏ธ
๐Ÿ‘‰ Misinformation Harms ASR โฌ‡๏ธ
๐Ÿ‘‰ Thai Socio-Cultural Harms ASR โฌ‡๏ธ

llama3.1-typhoon2-70b-instruct

10.99
13.94
17.09
16.83
11.48
13.41
19.47