Benchmark Run Details

System Prompt

You are helping with a language translation task.
When translating a word from EN to FR:
- Provide the most direct and common translation
- Give only the base form of the word
- Do not include articles unless they are part of the standard translation
- Do not provide explanations or alternative translations

Run Summary

Model gemini-2.5-flash-preview-04-17
Benchmark 0050_translation_en_fr
Normed Score 100
Run Timestamp 2025-04-24 19:28:01

Question-Level Details

Question ID Score Evaluation Time (ms) Debug Info
0050_translation_en_fr:0 100 1197 { "response": "fleur", "expected": "fleur", "is_correct": true }
[+]
0050_translation_en_fr:1 100 1867 { "response": "conduire", "expected": "conduire", "is_correct": true }
[+]
0050_translation_en_fr:10 100 2879 { "response": "montagne", "expected": "montagne", "is_correct": true }
[+]
0050_translation_en_fr:11 100 1225 { "response": "chanter", "expected": "chanter", "is_correct": true }
[+]
0050_translation_en_fr:12 100 1266 { "response": "cœur", "expected": "cœur", "is_correct": true }
[+]
0050_translation_en_fr:13 100 1190 { "response": "lumineux", "expected": "lumineux", "is_correct": true }
[+]
0050_translation_en_fr:14 100 1567 { "response": "danser", "expected": "danser", "is_correct": true }
[+]
0050_translation_en_fr:15 100 1553 { "response": "silencieux", "expected": "silencieux", "is_correct": true }
[+]
0050_translation_en_fr:16 100 970 { "response": "arbre", "expected": "arbre", "is_correct": true }
[+]
0050_translation_en_fr:17 100 1127 { "response": "écrire", "expected": "écrire", "is_correct": true }
[+]
0050_translation_en_fr:18 100 1369 { "response": "ciel", "expected": "ciel", "is_correct": true }
[+]
0050_translation_en_fr:19 100 1031 { "response": "frais", "expected": "frais", "is_correct": true }
[+]
0050_translation_en_fr:2 100 1312 { "response": "le plus fort", "expected": "le plus fort", "is_correct": true }
[+]
0050_translation_en_fr:20 100 1176 { "response": "ami", "expected": "ami", "is_correct": true }
[+]
0050_translation_en_fr:21 100 1566 { "response": "courir", "expected": "courir", "is_correct": true }
[+]
0050_translation_en_fr:22 100 1147 { "response": "rond", "expected": "rond", "is_correct": true }
[+]
0050_translation_en_fr:23 100 955 { "response": "vent", "expected": "vent", "is_correct": true }
[+]
0050_translation_en_fr:24 100 1242 { "response": "chaud", "expected": "chaud", "is_correct": true }
[+]
0050_translation_en_fr:25 100 1055 { "response": "pierre", "expected": "pierre", "is_correct": true }
[+]
0050_translation_en_fr:26 100 1401 { "response": "profond", "expected": "profond", "is_correct": true }
[+]
0050_translation_en_fr:27 100 1130 { "response": "nager", "expected": "nager", "is_correct": true }
[+]
0050_translation_en_fr:28 100 1229 { "response": "oiseau", "expected": "oiseau", "is_correct": true }
[+]
0050_translation_en_fr:29 100 1013 { "response": "sucré", "expected": "sucré", "is_correct": true }
[+]
0050_translation_en_fr:3 100 1270 { "response": "livre", "expected": "livre", "is_correct": true }
[+]
0050_translation_en_fr:30 100 1228 { "response": "nuage", "expected": "nuage", "is_correct": true }
[+]
0050_translation_en_fr:31 100 1637 { "response": "sourire", "expected": "sourire", "is_correct": true }
[+]
0050_translation_en_fr:32 100 1151 { "response": "doux", "expected": "doux", "is_correct": true }
[+]
0050_translation_en_fr:33 100 1344 { "response": "pluie", "expected": "pluie", "is_correct": true }
[+]
0050_translation_en_fr:34 100 2652 { "response": "grandir", "expected": "grandir", "is_correct": true }
[+]
0050_translation_en_fr:35 100 1570 { "response": "rapide", "expected": "rapide", "is_correct": true }
[+]
0050_translation_en_fr:36 100 1166 { "response": "étoile", "expected": "étoile", "is_correct": true }
[+]
0050_translation_en_fr:37 100 1429 { "response": "danser", "expected": "danser", "is_correct": true }
[+]
0050_translation_en_fr:38 100 1376 { "response": "lourd", "expected": "lourd", "is_correct": true }
[+]
0050_translation_en_fr:39 100 1532 { "response": "feu", "expected": "feu", "is_correct": true }
[+]
0050_translation_en_fr:4 100 1179 { "response": "manger", "expected": "manger", "is_correct": true }
[+]
0050_translation_en_fr:40 100 1668 { "response": "respirer", "expected": "respirer", "is_correct": true }
[+]
0050_translation_en_fr:41 100 1538 { "response": "pointu", "expected": "pointu", "is_correct": true }
[+]
0050_translation_en_fr:42 100 1224 { "response": "sel", "expected": "sel", "is_correct": true }
[+]
0050_translation_en_fr:43 100 1213 { "response": "cacher", "expected": "cacher", "is_correct": true }
[+]
0050_translation_en_fr:44 100 1345 { "response": "lisse", "expected": "lisse", "is_correct": true }
[+]
0050_translation_en_fr:45 100 1050 { "response": "herbe", "expected": "herbe", "is_correct": true }
[+]
0050_translation_en_fr:46 100 1918 { "response": "voler", "expected": "voler", "is_correct": true }
[+]
0050_translation_en_fr:47 100 1463 { "response": "froid", "expected": "froid", "is_correct": true }
[+]
0050_translation_en_fr:48 100 1254 { "response": "sable", "expected": "sable", "is_correct": true }
[+]
0050_translation_en_fr:49 100 1481 { "response": "tomber", "expected": "tomber", "is_correct": true }
[+]
0050_translation_en_fr:5 100 1681 { "response": "beau/belle", "expected": "beau/belle", "is_correct": true }
[+]
0050_translation_en_fr:50 100 1323 { "response": "léger", "expected": "léger", "is_correct": true }
[+]
0050_translation_en_fr:6 100 1383 { "response": "eau", "expected": "eau", "is_correct": true }
[+]
0050_translation_en_fr:7 100 1753 { "response": "dormir", "expected": "dormir", "is_correct": true }
[+]
0050_translation_en_fr:8 100 1405 { "response": "lune", "expected": "lune", "is_correct": true }
[+]
0050_translation_en_fr:9 100 2242 { "response": "rire", "expected": "rire", "is_correct": true }
[+]