Benchmark Run Details

System Prompt

You are a linguistic expert specializing in phonetics. 
Your task is to provide the IPA (International Phonetic Alphabet) pronunciation for English words.
Use American English pronunciation as your default standard.
Provide only the IPA transcription with no additional text or explanation.
Include stress markers and all appropriate IPA symbols.

Run Summary

Model gpt-4.1-nano-2025-04-14
Benchmark 0061_english_to_ipa
Normed Score 30
Run Timestamp 2025-04-24 18:08:49

Question-Level Details

Question ID Score Evaluation Time (ms) Debug Info
0061_english_to_ipa:0 0 870 { "response": "riːd", "expected": "ɹid", "is_correct": false }
[+]
0061_english_to_ipa:1 100 705 { "response": "rɛd", "expected": "ɹɛd", "is_correct": true }
[+]
0061_english_to_ipa:10 0 681 { "response": "θruː", "expected": "θɹu", "is_correct": false }
[+]
0061_english_to_ipa:11 100 535 { "response": "naɪt", "expected": "naɪt", "is_correct": true }
[+]
0061_english_to_ipa:12 0 481 { "response": "saɪˈkɑːlədʒi", "expected": "saɪˈkɑlədʒi", "is_correct": false }
[+]
0061_english_to_ipa:13 0 565 { "response": "kɝˈnɛl", "expected": "ˈkɜɹnəl", "is_correct": false }
[+]
0061_english_to_ipa:14 0 518 { "response": "ˈsʌtəl", "expected": "ˈsʌtl̩", "is_correct": false }
[+]
0061_english_to_ipa:15 0 504 { "response": "ɪˈpɪt.ə.mi", "expected": "əˈpɪtəmi", "is_correct": false }
[+]
0061_english_to_ipa:16 0 407 { "response": "ˈkeɪ.ɑs", "expected": "ˈkeɪɑs", "is_correct": false }
[+]
0061_english_to_ipa:17 0 510 { "response": "kjuː", "expected": "kju", "is_correct": false }
[+]
0061_english_to_ipa:18 100 424 { "response": "ki", "expected": "ki", "is_correct": true }
[+]
0061_english_to_ipa:19 0 604 { "response": "ˈaɪ.lənd", "expected": "ˈaɪlənd", "is_correct": false }
[+]
0061_english_to_ipa:2 100 411 { "response": "ˈmɪnɪt", "expected": "ˈmɪnɪt", "is_correct": true }
[+]
0061_english_to_ipa:20 0 3066 { "response": "ˈskɛdʒuːl", "expected": "ˈskɛdʒul", "is_correct": false }
[+]
0061_english_to_ipa:21 0 674 { "response": "ˈliːʒər", "expected": "ˈliʒɚ", "is_correct": false }
[+]
0061_english_to_ipa:22 100 513 { "response": "dɛt", "expected": "dɛt", "is_correct": true }
[+]
0061_english_to_ipa:23 0 558 { "response": "ˈsæm.ən", "expected": "ˈsæmən", "is_correct": false }
[+]
0061_english_to_ipa:24 100 555 { "response": "jɒt", "expected": "jɑt", "is_correct": true }
[+]
0061_english_to_ipa:25 100 502 { "response": "aɪl", "expected": "aɪl", "is_correct": true }
[+]
0061_english_to_ipa:26 0 389 { "response": "ɛər", "expected": "ɛɹ", "is_correct": false }
[+]
0061_english_to_ipa:27 0 467 { "response": "ˈɑːnɪst", "expected": "ˈɑnɪst", "is_correct": false }
[+]
0061_english_to_ipa:28 100 786 { "response": "noʊm", "expected": "noʊm", "is_correct": true }
[+]
0061_english_to_ipa:29 0 613 { "response": "nuˈmoʊ.njə", "expected": "nuˈmoʊniə", "is_correct": false }
[+]
0061_english_to_ipa:3 0 507 { "response": "ˈmɪnɪt", "expected": "maɪˈnut", "is_correct": false }
[+]
0061_english_to_ipa:30 0 519 { "response": "ˈzaɪ.lə.foʊn", "expected": "ˈzaɪləfoʊn", "is_correct": false }
[+]
0061_english_to_ipa:31 100 502 { "response": "/flɛm/", "expected": "flɛm", "is_correct": true }
[+]
0061_english_to_ipa:32 0 2785 { "response": "ˈiðər", "expected": "ˈiðɚ", "is_correct": false }
[+]
0061_english_to_ipa:33 0 600 { "response": "ˈkærəˌmɛl", "expected": "ˈkɛɹəməl", "is_correct": false }
[+]
0061_english_to_ipa:34 0 559 { "response": "ˈdeɪ.tə", "expected": "ˈdeɪtə", "is_correct": false }
[+]
0061_english_to_ipa:35 0 508 { "response": "ruːt", "expected": "ɹut", "is_correct": false }
[+]
0061_english_to_ipa:36 0 453 { "response": "ˈɔːf.tən", "expected": "ˈɔfən", "is_correct": false }
[+]
0061_english_to_ipa:37 0 507 { "response": "ˈsɪr.əp", "expected": "ˈsɪɹəp", "is_correct": false }
[+]
0061_english_to_ipa:38 100 464 { "response": "ˈpraɪvəsi", "expected": "ˈpɹaɪvəsi", "is_correct": true }
[+]
0061_english_to_ipa:39 0 3229 { "response": "ˈʒɑːnr", "expected": "ˈʒɑnɹə", "is_correct": false }
[+]
0061_english_to_ipa:4 0 450 { "response": "bæs", "expected": "beɪs", "is_correct": false }
[+]
0061_english_to_ipa:5 100 582 { "response": "bæs", "expected": "bæs", "is_correct": true }
[+]
0061_english_to_ipa:6 100 527 { "response": "wɪnd", "expected": "wɪnd", "is_correct": true }
[+]
0061_english_to_ipa:7 0 578 { "response": "wɪnd", "expected": "waɪnd", "is_correct": false }
[+]
0061_english_to_ipa:8 0 401 { "response": "tɪər", "expected": "tɪɹ", "is_correct": false }
[+]
0061_english_to_ipa:9 0 597 { "response": "tɪər", "expected": "tɛɹ", "is_correct": false }
[+]