Benchmark Run Details

Run Summary

Model gemma3:1b:Q4_K_M
Benchmark 0050_translation_en_fr
Normed Score 84
Run Timestamp 2025-03-26 19:45:45

Question-Level Details

Question ID Score Evaluation Time (ms) Debug Info
0050_translation_en_fr:0 100 963 {}
[+]
0050_translation_en_fr:1 100 260 {}
[+]
0050_translation_en_fr:10 100 357 {}
[+]
0050_translation_en_fr:11 100 289 {}
[+]
0050_translation_en_fr:12 100 314 {}
[+]
0050_translation_en_fr:13 100 305 {}
[+]
0050_translation_en_fr:14 0 313 { "response": "dancer", "expected": "danser" }
[+]
0050_translation_en_fr:15 100 330 {}
[+]
0050_translation_en_fr:16 100 295 {}
[+]
0050_translation_en_fr:17 100 316 {}
[+]
0050_translation_en_fr:18 100 341 {}
[+]
0050_translation_en_fr:19 100 346 {}
[+]
0050_translation_en_fr:2 0 272 { "response": "léger", "expected": "le plus fort" }
[+]
0050_translation_en_fr:20 100 416 {}
[+]
0050_translation_en_fr:21 100 412 {}
[+]
0050_translation_en_fr:22 100 392 {}
[+]
0050_translation_en_fr:23 100 352 {}
[+]
0050_translation_en_fr:24 100 313 {}
[+]
0050_translation_en_fr:25 100 282 {}
[+]
0050_translation_en_fr:26 100 272 {}
[+]
0050_translation_en_fr:27 100 263 {}
[+]
0050_translation_en_fr:28 100 276 {}
[+]
0050_translation_en_fr:29 100 264 {}
[+]
0050_translation_en_fr:3 100 308 {}
[+]
0050_translation_en_fr:30 100 284 {}
[+]
0050_translation_en_fr:31 100 285 {}
[+]
0050_translation_en_fr:32 100 283 {}
[+]
0050_translation_en_fr:33 100 288 {}
[+]
0050_translation_en_fr:34 100 306 {}
[+]
0050_translation_en_fr:35 100 306 {}
[+]
0050_translation_en_fr:36 100 324 {}
[+]
0050_translation_en_fr:37 0 319 { "response": "danse", "expected": "danser" }
[+]
0050_translation_en_fr:38 100 346 {}
[+]
0050_translation_en_fr:39 100 324 {}
[+]
0050_translation_en_fr:4 100 286 {}
[+]
0050_translation_en_fr:40 100 347 {}
[+]
0050_translation_en_fr:41 0 363 { "response": "sharp", "expected": "pointu" }
[+]
0050_translation_en_fr:42 100 285 {}
[+]
0050_translation_en_fr:43 0 275 { "response": "casser", "expected": "cacher" }
[+]
0050_translation_en_fr:44 100 263 {}
[+]
0050_translation_en_fr:45 100 249 {}
[+]
0050_translation_en_fr:46 100 250 {}
[+]
0050_translation_en_fr:47 100 244 {}
[+]
0050_translation_en_fr:48 100 254 {}
[+]
0050_translation_en_fr:49 0 240 { "response": "danser", "expected": "tomber" }
[+]
0050_translation_en_fr:5 0 256 { "response": "beau", "expected": "beau/belle" }
[+]
0050_translation_en_fr:50 100 265 {}
[+]
0050_translation_en_fr:6 100 291 {}
[+]
0050_translation_en_fr:7 0 572 { "response": "duler", "expected": "dormir" }
[+]
0050_translation_en_fr:8 100 966 {}
[+]
0050_translation_en_fr:9 100 525 {}
[+]