Benchmark Run Details

System Prompt

You are helping with a language translation task.
When translating a word from EN to ZH:
- Provide the most direct and common translation
- Give only the base form of the word
- Do not include articles unless they are part of the standard translation
- Do not provide explanations or alternative translations

Run Summary

Model lmstudio/granite-3.3-8b-instruct
Benchmark 0050_translation_en_zh
Normed Score 98
Run Timestamp 2025-04-29 17:19:30

Question-Level Details

Question ID Score Evaluation Time (ms) Debug Info
0050_translation_en_zh:0 100 1576 { "response": "花", "expected": "花", "is_correct": true }
[+]
0050_translation_en_zh:1 100 1367 { "response": "开车", "expected": "开车", "is_correct": true }
[+]
0050_translation_en_zh:10 100 1338 { "response": "山", "expected": "山", "is_correct": true }
[+]
0050_translation_en_zh:11 100 1422 { "response": "唱歌", "expected": "唱歌", "is_correct": true }
[+]
0050_translation_en_zh:12 100 1311 { "response": "心", "expected": "心", "is_correct": true }
[+]
0050_translation_en_zh:13 100 1445 { "response": "明亮", "expected": "明亮", "is_correct": true }
[+]
0050_translation_en_zh:14 100 1394 { "response": "跳舞", "expected": "跳舞", "is_correct": true }
[+]
0050_translation_en_zh:15 100 1343 { "response": "安静", "expected": "安静", "is_correct": true }
[+]
0050_translation_en_zh:16 100 1282 { "response": "树", "expected": "树", "is_correct": true }
[+]
0050_translation_en_zh:17 100 1309 { "response": "写", "expected": "写", "is_correct": true }
[+]
0050_translation_en_zh:18 100 1351 { "response": "天空", "expected": "天空", "is_correct": true }
[+]
0050_translation_en_zh:19 100 1465 { "response": "新鲜", "expected": "新鲜", "is_correct": true }
[+]
0050_translation_en_zh:2 100 1367 { "response": "最强", "expected": "最强", "is_correct": true }
[+]
0050_translation_en_zh:20 100 1416 { "response": "朋友", "expected": "朋友", "is_correct": true }
[+]
0050_translation_en_zh:21 100 1388 { "response": "跑", "expected": "跑", "is_correct": true }
[+]
0050_translation_en_zh:22 100 1375 { "response": "圆", "expected": "圆", "is_correct": true }
[+]
0050_translation_en_zh:23 100 1319 { "response": "风", "expected": "风", "is_correct": true }
[+]
0050_translation_en_zh:24 100 1461 { "response": "温暖", "expected": "温暖", "is_correct": true }
[+]
0050_translation_en_zh:25 100 1378 { "response": "石头", "expected": "石头", "is_correct": true }
[+]
0050_translation_en_zh:26 100 1362 { "response": "深", "expected": "深", "is_correct": true }
[+]
0050_translation_en_zh:27 100 1406 { "response": "游泳", "expected": "游泳", "is_correct": true }
[+]
0050_translation_en_zh:28 100 1338 { "response": "鸟", "expected": "鸟", "is_correct": true }
[+]
0050_translation_en_zh:29 100 1321 { "response": "甜", "expected": "甜", "is_correct": true }
[+]
0050_translation_en_zh:3 100 1300 { "response": "书", "expected": "书", "is_correct": true }
[+]
0050_translation_en_zh:30 100 1296 { "response": "云", "expected": "云", "is_correct": true }
[+]
0050_translation_en_zh:31 100 1428 { "response": "微笑", "expected": "微笑", "is_correct": true }
[+]
0050_translation_en_zh:32 100 1365 { "response": "软", "expected": "软", "is_correct": true }
[+]
0050_translation_en_zh:33 100 1444 { "response": "雨", "expected": "雨", "is_correct": true }
[+]
0050_translation_en_zh:34 100 1381 { "response": "成长", "expected": "成长", "is_correct": true }
[+]
0050_translation_en_zh:35 100 1361 { "response": "快", "expected": "快", "is_correct": true }
[+]
0050_translation_en_zh:36 100 1355 { "response": "星星", "expected": "星星", "is_correct": true }
[+]
0050_translation_en_zh:37 100 1519 { "response": "跳舞", "expected": "跳舞", "is_correct": true }
[+]
0050_translation_en_zh:38 100 1285 { "response": "重", "expected": "重", "is_correct": true }
[+]
0050_translation_en_zh:39 100 1316 { "response": "火", "expected": "火", "is_correct": true }
[+]
0050_translation_en_zh:4 100 1329 { "response": "吃", "expected": "吃", "is_correct": true }
[+]
0050_translation_en_zh:40 0 3293 { "response": "", "expected": "呼吸", "is_correct": false }
[+]
0050_translation_en_zh:41 100 1432 { "response": "锋利", "expected": "锋利", "is_correct": true }
[+]
0050_translation_en_zh:42 100 1412 { "response": "盐", "expected": "盐", "is_correct": true }
[+]
0050_translation_en_zh:43 100 1412 { "response": "躲藏", "expected": "躲藏", "is_correct": true }
[+]
0050_translation_en_zh:44 100 1400 { "response": "光滑", "expected": "光滑", "is_correct": true }
[+]
0050_translation_en_zh:45 100 1370 { "response": "草", "expected": "草", "is_correct": true }
[+]
0050_translation_en_zh:46 100 1318 { "response": "飞", "expected": "飞", "is_correct": true }
[+]
0050_translation_en_zh:47 100 1369 { "response": "冷", "expected": "冷", "is_correct": true }
[+]
0050_translation_en_zh:48 100 1359 { "response": "沙", "expected": "沙", "is_correct": true }
[+]
0050_translation_en_zh:49 100 1364 { "response": "落下", "expected": "落下", "is_correct": true }
[+]
0050_translation_en_zh:5 100 1475 { "response": "美丽", "expected": "美丽", "is_correct": true }
[+]
0050_translation_en_zh:50 100 1376 { "response": "轻", "expected": "轻", "is_correct": true }
[+]
0050_translation_en_zh:6 100 1500 { "response": "水", "expected": "水", "is_correct": true }
[+]
0050_translation_en_zh:7 100 1394 { "response": "睡觉", "expected": "睡觉", "is_correct": true }
[+]
0050_translation_en_zh:8 100 1448 { "response": "月亮", "expected": "月亮", "is_correct": true }
[+]
0050_translation_en_zh:9 100 1346 { "response": "笑", "expected": "笑", "is_correct": true }
[+]