Benchmark Run Details

Run Summary

Model gemma2:9b:Q4_0
Benchmark 0050_translation_en_fr
Normed Score 98
Run Timestamp 2025-03-26 19:39:23

Question-Level Details

Question ID Score Evaluation Time (ms) Debug Info
0050_translation_en_fr:0 100 6327 {}
[+]
0050_translation_en_fr:1 100 1237 {}
[+]
0050_translation_en_fr:10 100 1248 {}
[+]
0050_translation_en_fr:11 100 1194 {}
[+]
0050_translation_en_fr:12 100 1147 {}
[+]
0050_translation_en_fr:13 100 1162 {}
[+]
0050_translation_en_fr:14 100 1110 {}
[+]
0050_translation_en_fr:15 100 1115 {}
[+]
0050_translation_en_fr:16 100 1273 {}
[+]
0050_translation_en_fr:17 100 1287 {}
[+]
0050_translation_en_fr:18 100 1120 {}
[+]
0050_translation_en_fr:19 100 1203 {}
[+]
0050_translation_en_fr:2 100 1290 {}
[+]
0050_translation_en_fr:20 100 1359 {}
[+]
0050_translation_en_fr:21 100 1122 {}
[+]
0050_translation_en_fr:22 100 1180 {}
[+]
0050_translation_en_fr:23 100 1175 {}
[+]
0050_translation_en_fr:24 100 1188 {}
[+]
0050_translation_en_fr:25 100 1161 {}
[+]
0050_translation_en_fr:26 100 1215 {}
[+]
0050_translation_en_fr:27 100 1231 {}
[+]
0050_translation_en_fr:28 100 1245 {}
[+]
0050_translation_en_fr:29 100 1273 {}
[+]
0050_translation_en_fr:3 100 1185 {}
[+]
0050_translation_en_fr:30 100 1232 {}
[+]
0050_translation_en_fr:31 100 1186 {}
[+]
0050_translation_en_fr:32 100 1214 {}
[+]
0050_translation_en_fr:33 100 1214 {}
[+]
0050_translation_en_fr:34 100 1242 {}
[+]
0050_translation_en_fr:35 100 1145 {}
[+]
0050_translation_en_fr:36 100 1200 {}
[+]
0050_translation_en_fr:37 100 950 {}
[+]
0050_translation_en_fr:38 100 1203 {}
[+]
0050_translation_en_fr:39 100 1043 {}
[+]
0050_translation_en_fr:4 100 1259 {}
[+]
0050_translation_en_fr:40 100 1211 {}
[+]
0050_translation_en_fr:41 100 1310 {}
[+]
0050_translation_en_fr:42 100 1149 {}
[+]
0050_translation_en_fr:43 100 1299 {}
[+]
0050_translation_en_fr:44 100 1320 {}
[+]
0050_translation_en_fr:45 100 1105 {}
[+]
0050_translation_en_fr:46 100 1174 {}
[+]
0050_translation_en_fr:47 100 1192 {}
[+]
0050_translation_en_fr:48 100 1193 {}
[+]
0050_translation_en_fr:49 100 1320 {}
[+]
0050_translation_en_fr:5 0 1184 { "response": "beau", "expected": "beau/belle" }
[+]
0050_translation_en_fr:50 100 1203 {}
[+]
0050_translation_en_fr:6 100 1158 {}
[+]
0050_translation_en_fr:7 100 1245 {}
[+]
0050_translation_en_fr:8 100 1181 {}
[+]
0050_translation_en_fr:9 100 1054 {}
[+]