Speller Test Results for: «sp-wordtype-mw-sma.txt»
Overview
Technical data
Language tested: sma
Document tested: sp-wordtype-mw-sma.txt
Speller tool: AppleScript driving MS Word
Speller tool version: MS Word 14.1.3 (= 2011), AppleScript version: $Revision: 46521 $ $Date: 2011-09-22 19:49:46 +0200 (Thu, 22 Sep 2011) $
Speller lexicon version: Åarjelsaemien, version 1.0, 2010-11-30
Test Date: 20110922-2338
Test Type: wordtype
Processing time (user+system time): No data available
Speed: No data available
Max memory usage of the speller (max value of /bin/ps -o rss= PID sampled every 10 second): No data available
Result summary
Nº of input words: 98
Nº of spelling errors: 49 (50,0% of all words)
| Speller view | Wrong tokenisation | |||
|---|---|---|---|---|
| Speller Positive (Nº of flagged words): 46 | Speller Negative (Nº of accepted words): 30 | All tokenisation errors: 0 |
||
| Reality | Nº of real errors: 49 | Nº of true positives (detected real errors): 38 | Nº of false negatives (unflagged spelling errors): 0 | Nº of errouneously tokenized spelling errors: 0 |
| Nº of real correct words: 49 | Nº of false positives (incorrectly flagged words): 8 | Nº of true negatives (unflagged correct words): 30 | Nº of errouneously tokenized correct words: 0 |
|
11 input word(s) was/were discarded because the speller (or MS Word) could not deal with them properly. The calculation of Precision, Recall and Accuracy below does NOT include this/these word(s).
Precision (tp/(tp+fp)):
82,61%
Recall (tp/(tp+fn)):
100,0%
Accuracy ((tp+tn)/words):
89,47%
| Spelling errors: 49 | ||||
|---|---|---|---|---|
| Simple errors (edit dist. 1): | Errors with edit distance 2: | Errors with edit distance ≥3: |
||
| Suggestion statistics for true positives (= 38 = 100%): | 41 (83,67%) | 7 (14,29%) | 1 (2,04%) | |
| Nº of detected spelling errors with correct suggestion in top 3: | 22 (57.89 %) | 20 | 2 | 0 |
| Nº of detected spelling errors with correct suggestionbelow top 3: | 4 (10.53 %) | 4 | 0 | 0 |
| Nº of detected spelling errors with only wrong suggestions: | 12 (31.58 %) | 8 | 3 | 1 |
| Nº of detected spelling errors with no suggestions at all: | 0 (0 %) | 0 | 0 | 0 |
| Undetected spelling errors: | 0 | 0 | 0 | 0 |
True positives (38)
Spelling errors with correct suggestions (26)
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| moerensærnieh | moerensaernieh | 2 |
| Gen noun + noun compound |
| jælam | jealam | 2 |
| Verb |
| vaeredimmiesaakenjaevriem | vaeriedimmiesaakenjaevriem | 1 |
| Deverbal noun + noun + noun compound |
| durriemahkeldhgåetijammesistie | durriemahkeldhgåatijammesistie | 1 |
| Prefix + derivated adj |
| aeblehtsvoetegærjaamhtesh | aeblehtsvoetegærjaaamhtesh | 1 |
| Noun + noun + noun compound |
| moereprohtokollemistijæjnedh | moereprotokollemistijæjnedh | 1 |
| Noun + noun + derived noun compound |
| balvedhlaketje | balvedhlaaketje | 1 |
| Adj |
| bienjengåaksahvearaldahke | bïenjengåaksahvearaldahke | 1 |
| Compound noun |
| gervelibie | gervielibie | 1 |
| Deverbal verb |
| doelmestid | doelmestidh | 1 |
| Deverbal verb |
| gieltijesakesanne | gieltijesaakesanne | 1 |
| Deverbal noun + noun compound |
| Tjellientjånnide | Tjellientjåånnide | 1 |
| Name |
| dataooperatöörigujmie | dataoperatöörigujmie | 1 |
| Compound noun |
| mïelebarkemasseth | mïelebarkemassedh | 1 |
| Gen noun + noun compound |
| dulmedibe | dulmedibie | 1 |
| Deverbal verb |
| nïejtetjesse | nïejtetjasse | 1 |
| Denominal noun |
| vuasatalleles | vuasahtalleles | 1 |
| Deverbal adj |
| barketalledh | barkehtalledh | 1 |
| Deverbal verb |
| treempesovvimh | treemhpesovvimh | 1 |
| Deverbal verb |
| jaevrijes | jaavrijes | 1 |
| Denominal adj |
| laahpalassesne | laahpelassesne | 1 |
| Deverbal adj |
| byssegåetijægan | byssegåetiejægan | 1 |
| Deverbal verb |
| gierehtsadtjese | gieriehtsadtjese | 1 |
| Deverbal noun |
| aeblehtsvoetenath | aeblehtsvoetenadth | 1 |
| Noun |
| laahkoemesside | laahkoemasside | 1 |
| Derivated noun |
| noeriebinie | nueriebinie | 1 |
| Adjective comparative |
Spelling errors with only incorrect suggestions (12)
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| tsoetsesjidiege | tsoedtsesjidie | 3 |
| Deverbal verb + clitic |
| bïssemnniejtetjijstie | bïssemennïejtetjijstie | 2 |
| Deverbal noun + deverbal noun compound |
| ledtiejmoeresoejjegimmiengeannah | ledtiejmoeresoejjegimmiengænnah | 2 |
| Noun + noun + deverbal noun + clitic |
| sijjiesaerninniejtinegan | sijjiesaerniennïejtinegan | 2 |
| Noun + noun +noun + clitic |
| basteldihkiegarresvoetemem | bastaldihkiegarresvoetemem | 1 |
| Deverbal adj + derivated noun compound |
| dåeriedimmiejdåajoejen | dåeriedimmiejdåajojen | 1 |
| Derived adj + derived noun compound |
| gealtelesvoetegeh | gealtelesvoetegih | 1 |
| Deadjectival noun + clitic |
| laahpemebiernijste | laehpemebiernijste | 1 |
| Infinite verb + noun |
| ovmessietjoelijiprihtjegigujmie | ovmessietjoelijiprïhtjegigujmie | 1 |
| Adj + denominal noun + noun |
| ruevtienbiestememoervaarjoen | ruevtienbiestememoerevaarjoen | 1 |
| Noun + deverbal noun + noun + noun compound |
| tjoepehtalligangan | tjoehpehtalligangan | 1 |
| Deverbal verb + clitc |
| vïjhtestoerretjuetiegøøktetjuetievïjhteluhkiegööktinie | vïjhtestoerretjuetiegøøktetjuetievïjhteluhkiegööktine | 1 |
| Number compound |
False positives (8)
False positives with suggestions (8)
| Input word | Suggestions | |
|---|---|---|
| dåeriedimmiejdåajojen |
| |
| gealtelesvoetegih |
| |
| laehpemebiernijste |
| |
| ledtiejmoeresoejjegimmiengænnah |
| |
| ovmessietjoelijiprïhtjegigujmie |
| |
| sijjiesaerniennïejtinegan |
| |
| tjoehpehtalligangan |
| |
| vïjhtestoerretjuetiegøøktetjuetievïjhteluhkiegööktine |
|
True negatives (30)
The correctly accepted words are listed here for easy copy&paste into Word to quick check regressions in new versions of the speller: No correctly spelled words accepted by an earlier speller should be rejected by later spellers. To check, just copy and paste these words to Word, and see if you get any red underlines. Duplicate words are not repeated, unless one is followed by some punctuation. The words are reverse sorted according to Unicode character code.
vuasahtalleles vaeriedimmiesaakenjaevriem tsoedtsesjidie treemhpesovvimh Tjellientjåånnide ruevtienbiestememoerevaarjoen nueriebinie nïejtetjasse moereprotokollemistijæjnedh moerensaernieh mïelebarkemassedh laahpelassesne laahkoemasside jealam jaavrijes gieriehtsadtjese gieltijesaakesanne gervielibie durriemahkeldhgåatijammesistie dulmedibie doelmestidh dataoperatöörigujmie byssegåetiejægan bïssemennïejtetjijstie bïenjengåaksahvearaldahke bastaldihkiegarresvoetemem barkehtalledh balvedhlaaketje aeblehtsvoetenadth aeblehtsvoetegærjaaamhtesh
Disregarded by speller (11)
When using AppleScript and Microsoft Word to run the speller tests, there are some words that are impossible to spell check. This is due to a "feature" of MS Word's implementation of AppleScript - hyphens, colons and full stops are not recognised as part of the string to be checked, and the words are instead split in two (or more) at each non-letter, and each part is spell-checked alone. This of course leads to meaningless speller output. They are therefore not considered, but are listed here for reference. This only happens when using AppleScript. In normal, interactive use, such words are correctly dealt with by the speller/Word.
| Input word | Expected correction | Editing distance |
||
|---|---|---|---|---|
| Fort-de-France-geirhkie | Fort-de-France-gierhkie | 2 | Name + noun compound | |
| st.diedråanjoemijstie | St.died-råanjoemijstie | 2 | Abbr + derivated noun compound | |
| 10-galvåajja | 10-gaalvåajja | 1 | Digit + noun compound | |
| AOF-åabpeje | AOF-åabpije | 1 | Acro + Derivated noun compound | |
| AP-beetnegeh | AP-beetnegh | 1 | Acro + noun compound | |
| Bas-stredet | Bass-stredet | 1 | Name | |
| Fisklös-gieregijsie | Fisklös-gieregijstie | 1 | Name | |
| Koskimies-soejkesjimmereagkaj | Koskimies-soejkesjimmiereagkaj | 1 | Name + noun + noun compound | |
| mistije-NRK:esne | mistije-NRK:sne | 1 | Derivated noun + acro compound | |
| St.died-samkahierkie | St.died-samkashierkie | 1 | Abbr + adj + noun compound | |
| Tjellientjåånne-Tjiddiemaajjinie | Tjellientjåånne-Tjiddiemaajjine | 1 | Name gen + name compound |
All test data
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| 10-gaalvåajja | ||||
| aeblehtsvoetegærjaaamhtesh | ||||
| aeblehtsvoetenadth | ||||
| AOF-åabpije | ||||
| AP-beetnegh | ||||
| balvedhlaaketje | ||||
| barkehtalledh | ||||
| Bass-stredet | ||||
| bastaldihkiegarresvoetemem | ||||
| bïenjengåaksahvearaldahke | ||||
| bïssemennïejtetjijstie | ||||
| byssegåetiejægan | ||||
| dåeriedimmiejdåajojen |
| |||
| dataoperatöörigujmie | ||||
| doelmestidh | ||||
| dulmedibie | ||||
| durriemahkeldhgåatijammesistie | ||||
| Fisklös-gieregijstie | ||||
| Fort-de-France-gierhkie | ||||
| gealtelesvoetegih |
| |||
| gervielibie | ||||
| gieltijesaakesanne | ||||
| gieriehtsadtjese | ||||
| jaavrijes | ||||
| jealam | ||||
| Koskimies-soejkesjimmiereagkaj | ||||
| laahkoemasside | ||||
| laahpelassesne | ||||
| laehpemebiernijste |
| |||
| ledtiejmoeresoejjegimmiengænnah |
| |||
| mïelebarkemassedh | ||||
| mistije-NRK:sne | ||||
| moerensaernieh | ||||
| moereprotokollemistijæjnedh | ||||
| nïejtetjasse | ||||
| nueriebinie | ||||
| ovmessietjoelijiprïhtjegigujmie |
| |||
| ruevtienbiestememoerevaarjoen | ||||
| sijjiesaerniennïejtinegan |
| |||
| St.died-råanjoemijstie | ||||
| St.died-samkashierkie | ||||
| Tjellientjåånne-Tjiddiemaajjine | ||||
| Tjellientjåånnide | ||||
| tjoehpehtalligangan |
| |||
| treemhpesovvimh | ||||
| tsoedtsesjidie | ||||
| vaeriedimmiesaakenjaevriem | ||||
| vïjhtestoerretjuetiegøøktetjuetievïjhteluhkiegööktine |
| |||
| vuasahtalleles | ||||
| St.died-samkahierkie | St.died-samkashierkie | 1 | Abbr + adj + noun compound | |
| st.diedråanjoemijstie | St.died-råanjoemijstie | 2 | Abbr + derivated noun compound | |
| AOF-åabpeje | AOF-åabpije | 1 | Acro + Derivated noun compound | |
| AP-beetnegeh | AP-beetnegh | 1 | Acro + noun compound | |
| balvedhlaketje | balvedhlaaketje | 1 |
| Adj |
| ovmessietjoelijiprihtjegigujmie | ovmessietjoelijiprïhtjegigujmie | 1 |
| Adj + denominal noun + noun |
| noeriebinie | nueriebinie | 1 |
| Adjective comparative |
| bienjengåaksahvearaldahke | bïenjengåaksahvearaldahke | 1 |
| Compound noun |
| dataooperatöörigujmie | dataoperatöörigujmie | 1 |
| Compound noun |
| gealtelesvoetegeh | gealtelesvoetegih | 1 |
| Deadjectival noun + clitic |
| jaevrijes | jaavrijes | 1 |
| Denominal adj |
| nïejtetjesse | nïejtetjasse | 1 |
| Denominal noun |
| laahkoemesside | laahkoemasside | 1 |
| Derivated noun |
| mistije-NRK:esne | mistije-NRK:sne | 1 | Derivated noun + acro compound | |
| dåeriedimmiejdåajoejen | dåeriedimmiejdåajojen | 1 |
| Derived adj + derived noun compound |
| laahpalassesne | laahpelassesne | 1 |
| Deverbal adj |
| vuasatalleles | vuasahtalleles | 1 |
| Deverbal adj |
| basteldihkiegarresvoetemem | bastaldihkiegarresvoetemem | 1 |
| Deverbal adj + derivated noun compound |
| gierehtsadtjese | gieriehtsadtjese | 1 |
| Deverbal noun |
| bïssemnniejtetjijstie | bïssemennïejtetjijstie | 2 |
| Deverbal noun + deverbal noun compound |
| vaeredimmiesaakenjaevriem | vaeriedimmiesaakenjaevriem | 1 |
| Deverbal noun + noun + noun compound |
| gieltijesakesanne | gieltijesaakesanne | 1 |
| Deverbal noun + noun compound |
| barketalledh | barkehtalledh | 1 |
| Deverbal verb |
| byssegåetijægan | byssegåetiejægan | 1 |
| Deverbal verb |
| doelmestid | doelmestidh | 1 |
| Deverbal verb |
| dulmedibe | dulmedibie | 1 |
| Deverbal verb |
| gervelibie | gervielibie | 1 |
| Deverbal verb |
| treempesovvimh | treemhpesovvimh | 1 |
| Deverbal verb |
| tjoepehtalligangan | tjoehpehtalligangan | 1 |
| Deverbal verb + clitc |
| tsoetsesjidiege | tsoedtsesjidie | 3 |
| Deverbal verb + clitic |
| 10-galvåajja | 10-gaalvåajja | 1 | Digit + noun compound | |
| mïelebarkemasseth | mïelebarkemassedh | 1 |
| Gen noun + noun compound |
| moerensærnieh | moerensaernieh | 2 |
| Gen noun + noun compound |
| laahpemebiernijste | laehpemebiernijste | 1 |
| Infinite verb + noun |
| Bas-stredet | Bass-stredet | 1 | Name | |
| Fisklös-gieregijsie | Fisklös-gieregijstie | 1 | Name | |
| Tjellientjånnide | Tjellientjåånnide | 1 |
| Name |
| Koskimies-soejkesjimmereagkaj | Koskimies-soejkesjimmiereagkaj | 1 | Name + noun + noun compound | |
| Fort-de-France-geirhkie | Fort-de-France-gierhkie | 2 | Name + noun compound | |
| Tjellientjåånne-Tjiddiemaajjinie | Tjellientjåånne-Tjiddiemaajjine | 1 | Name gen + name compound | |
| aeblehtsvoetenath | aeblehtsvoetenadth | 1 |
| Noun |
| ruevtienbiestememoervaarjoen | ruevtienbiestememoerevaarjoen | 1 |
| Noun + deverbal noun + noun + noun compound |
| moereprohtokollemistijæjnedh | moereprotokollemistijæjnedh | 1 |
| Noun + noun + derived noun compound |
| ledtiejmoeresoejjegimmiengeannah | ledtiejmoeresoejjegimmiengænnah | 2 |
| Noun + noun + deverbal noun + clitic |
| sijjiesaerninniejtinegan | sijjiesaerniennïejtinegan | 2 |
| Noun + noun +noun + clitic |
| aeblehtsvoetegærjaamhtesh | aeblehtsvoetegærjaaamhtesh | 1 |
| Noun + noun + noun compound |
| vïjhtestoerretjuetiegøøktetjuetievïjhteluhkiegööktinie | vïjhtestoerretjuetiegøøktetjuetievïjhteluhkiegööktine | 1 |
| Number compound |
| durriemahkeldhgåetijammesistie | durriemahkeldhgåatijammesistie | 1 |
| Prefix + derivated adj |
| jælam | jealam | 2 |
| Verb |

