Speller Test Results for: «Markansluska.correct.doc.xml»
Overview
Technical data
Language tested: sme
Document tested: Markansluska.correct.doc.xml
Speller tool: AppleScript driving MS Word
Speller tool version: MS Word 11.4.2, AppleScript version: $Revision: 1320 $ $Date: 2011-09-12 12:12:03 +0200 (man, 12 sep 2011) $
Speller lexicon version: Davvisámi, version 1.0.1, 2008-05-13
Test Date: 20080527
Test Type: correct-corpus
Processing time (user+system time): No data available
Speed: No data available
Max memory usage of the speller (max value of /bin/ps -o rss= PID sampled every 10 second): No data available
Result summary
Nº of input words: 715
Nº of spelling errors: 105 (14,69% of all words)
| Speller view | Wrong tokenisation | |||
|---|---|---|---|---|
| Speller Positive (Nº of flagged words): 111 | Speller Negative (Nº of accepted words): 558 | All tokenisation errors: 0 |
||
| Reality | Nº of real errors: 105 | Nº of true positives (detected real errors): 99 | Nº of false negatives (unflagged spelling errors): 2 | Nº of errouneously tokenized spelling errors: 0 |
| Nº of real correct words: 610 | Nº of false positives (incorrectly flagged words): 12 | Nº of true negatives (unflagged correct words): 556 | Nº of errouneously tokenized correct words: 0 |
|
4 input word(s) was/were discarded because the speller (or MS Word) could not deal with them properly. The calculation of Precision, Recall and Accuracy below does NOT include this/these word(s).
Precision (tp/(tp+fp)):
89,19%
Recall (tp/(tp+fn)):
98,02%
Accuracy ((tp+tn)/words):
97,91%
| Spelling errors: 105 | ||||
|---|---|---|---|---|
| Simple errors (edit dist. 1): | Errors with edit distance 2: | Errors with edit distance ≥3: |
||
| Suggestion statistics for true positives (= 99 = 100%): | 79 (75,24%) | 22 (20,95%) | 4 (3,81%) | |
| Nº of detected spelling errors with correct suggestion in top 3: | 79 (79.8 %) | 64 | 13 | 2 |
| Nº of detected spelling errors with correct suggestionbelow top 3: | 11 (11.11 %) | 4 | 5 | 2 |
| Nº of detected spelling errors with only wrong suggestions: | 9 (9.09 %) | 7 | 2 | 0 |
| Nº of detected spelling errors with no suggestions at all: | 0 (0 %) | 0 | 0 | 0 |
| Undetected spelling errors: | 2 | 1 | 1 | 0 |
True positives (99)
Spelling errors with correct suggestions (90)
| Input word | Expected correction | Editing distance | Suggestions |
|---|---|---|---|
| oaccu | oaččo | 3 |
|
| oaccu | oaččo | 3 |
|
| Mannjá | Maŋŋá | 3 |
|
| mannjá | maŋŋá | 3 |
|
| Janos | Jánoš | 2 |
|
| odda | ođđa | 2 |
|
| odda | ođđa | 2 |
|
| odda | ođđa | 2 |
|
| odda | ođđa | 2 |
|
| geahccalan | geahččalan | 2 |
|
| sadda | šaddá | 2 |
|
| dábálaccat | dábálaččat | 2 |
|
| láhkonisgoahtá | lahkonišgoahtá | 2 |
|
| oddasiin | ođđasiin | 2 |
|
| filbmafestiválabilleahttaid | filbmafestiválabileahtaid | 2 |
|
| parlamentaralas | parlamentáralaš | 2 |
|
| parlamentaralas | parlamentáralaš | 2 |
|
| daza | dáža | 2 |
|
| sápmelaccaid | sápmelaččaid | 2 |
|
| sápmelaccat | sápmelaččat | 2 |
|
| sápmelaccaid | sápmelaččaid | 2 |
|
| speazzut | speažžut | 2 |
|
| Janos | Janoš | 1 |
|
| Janos | Janoš | 1 |
|
| Janos | Janoš | 1 |
|
| dagazii | dagašii | 1 |
|
| SáMI | SÁMI | 1 |
|
| jamma | jámma | 1 |
|
| ploggestit | bloggestit | 1 |
|
| gittá | gitta | 1 |
|
| cohkkán | čohkkán | 1 |
|
| Ankke | Aŋkke | 1 |
|
| puslespeallui | puslespellui | 1 |
|
| Àigu | Áigu | 1 |
|
| jienasteaddjiid | jienasteddjiid | 1 |
|
| smiehtagoahtit | smiehttagoahtit | 1 |
|
| boade | boađe | 1 |
|
| buolasta | buolašta | 1 |
|
| cállán | čállán | 1 |
|
| volvo | Volvo | 1 |
|
| Nordkjosbotn'ii | Nordkjosbotnii | 1 |
|
| nordkjosbotn | Nordkjosbotn | 1 |
|
| boaldámusa | boaldámuša | 1 |
|
| Gárasávvonii | Gárasavvonii | 1 |
|
| alo | álo | 1 |
|
| iezan | iežan | 1 |
|
| seanga | seaŋga | 1 |
|
| juovlaskeankkaid | juovlaskeaŋkkaid | 1 |
|
| geahcadit | geahčadit | 1 |
|
| juovlaskeankagávppašeapmi | juovlaskeaŋkagávppašeapmi | 1 |
|
| skeankkaid | skeaŋkkaid | 1 |
|
| skeankkaid | skeaŋkkaid | 1 |
|
| juovlaskeankkaid | juovlaskeaŋkkaid | 1 |
|
| juovlaskeankan | juovlaskeaŋkan | 1 |
|
| juovlaskeankkaid | juovlaskeaŋkkaid | 1 |
|
| tinggaid | tiŋggaid | 1 |
|
| jodiheaddji | jođiheaddji | 1 |
|
| jodiheaddji | jođiheaddji | 1 |
|
| diedihan | dieđihan | 1 |
|
| iezas | iežas | 1 |
|
| NSRas | NSR:s | 1 |
|
| nissonolmmos | nissonolmmoš | 1 |
|
| Samedikki | Sámedikki | 1 |
|
| presdideantan | presideantan | 1 |
|
| NSRas | NSR:s | 1 |
|
| dáza | dáža | 1 |
|
| opposisuvdnii | opposišuvdnii | 1 |
|
| diede | dieđe | 1 |
|
| posisuvnnas | posišuvnnas | 1 |
|
| ásahuvvoi | ásahuvvui | 1 |
|
| opposisuvnnas | opposišuvnnas | 1 |
|
| jáhkkihit | jáhkihit | 1 |
|
| posisuvnnas | posišuvnnas | 1 |
|
| saddat | šaddat | 1 |
|
| sat | šat | 1 |
|
| duodai | duođai | 1 |
|
| dáza | dáža | 1 |
|
| Nágodit | Nagodit | 1 |
|
| cielga | čielga | 1 |
|
| sámeorganisasuvdna | sámeorganisašuvdna | 1 |
|
| NSRa | NSR | 1 |
|
| ieza | ieža | 1 |
|
| sat | šat | 1 |
|
| visuvnnat | višuvnnat | 1 |
|
| giedaid | gieđaid | 1 |
|
| sattai | šattai | 1 |
|
| dupmejuvvon | dubmejuvvon | 1 |
|
| soaita | soaitá | 1 |
|
| ieza | ieža | 1 |
|
| duodai | duođai | 1 |
|
Spelling errors with only incorrect suggestions (9)
| Input word | Expected correction | Editing distance | Suggestions |
|---|---|---|---|
| saldegeazis | šaldegeažis | 2 |
|
| saldi | šalddi | 2 |
|
| MáRKANSLUSKA | MÁRKANSLUSKA | 1 |
|
| Norggabealde | Norgga bealde | 1 |
|
| NSRii | NSR:ii | 1 |
|
| NSRii | NSR:ii | 1 |
|
| rievttimielde | rievtti mielde | 1 |
|
| rievttimielde | rievtti mielde | 1 |
|
| statoilkohpa | Statoilkohpa | 1 |
|
False positives (12)
False positives with suggestions (12)
| Input word | Suggestions |
|---|---|
| billeahtaid |
|
| dábálas |
|
| dongeribuksagávppi |
|
| dongeribuvssat |
|
| eanetlohkku |
|
| eanetlohkku |
|
| ekologálaš |
|
| Sámedikkeválggaid |
|
| sat |
|
| ullubáiddit |
|
| ullubuvssaid |
|
| ullubuvssat |
|
False negatives (2)
| Input word | Expected correction | Editing distance |
|---|---|---|
| 2005’as | 2005:s | 2 |
| Norggabeale | Norgga beale | 1 |
True negatives (556)
The correctly accepted words are listed here for easy copy&paste into Word to quick check regressions in new versions of the speller: No correctly spelled words accepted by an earlier speller should be rejected by later spellers. To check, just copy and paste these words to Word, and see if you get any red underlines. Duplicate words are not repeated, unless one is followed by some punctuation. The words are reverse sorted according to Unicode character code.
Vuosttas vuoján vuoi vuodján viimmat vigget vel vealahit veahket veahá váttis várrepresideanta válljet vállje váikkuha Vaikke unnimus unna Trosten stuoridit stohkosat stobuid stivret stivrejupmái stivrejumis speanta Son son somá Soaitá snuvssaid smiehttat skihpáriidda skáhpat sis siiddus sii seamma sávvan Sara Sápmi Sámit Sámiid sámepolitihkka sámegillii Sámedikkis Sámediggi sáhttit sáhttet sáhttán sáhtašin saddet šaddat saddá ruoktut RUOKTU Riikasearvvis rievdadus rájes rájáid rájá rahpat ráhkadit politihkka politihka ožžot ožžon ovttasbargat ovtta ovddit ovddasta ovdamearka ovdal ostet orrut olmmoš ollu olles olbmuid Olbmot olbmot oktii okta Odne oasttán oastit oastimin oanehis oalle oaivvildit oaivvilda oaidnit nuvttá nuppi Nu nu NSR Norgga namma mus munnje Mun mun muitala Muhto muhto Muhtin muhtin mu moaitigoahtit Mo mo Mis mis min Mikkel Mii mii mat mas Márjá mantra manne mánáid Makkár maiddái maid luonddu lohkamin logai lihkolažžan Lene Lei lei ledje Leat leat Lean lean leamas Lea lea láven láibi Keskitalo káfiid káfe Jus jus juovllat juovlaruohtta juovlanigá juhkat Johan joatkit Jo jo jna jáhkkit jagi Ja ja In in ii Ihtin iežan Ieš ieš homofiilla hirbmat hilgot heajos hárjánan Hansen han hálit haga Guovdageidnui guokte gullo gulan gova gos golbma goas Go go gitta Gii gii gievkkanis ges geat geas Geainna ge galgga galgat galgá gal gáidan gáhtte fuolkkit fitnet fitnen fitne fitnat finadan fertet ferten fáhkka espressomášiidna ehpet eddon Eatni earát earán eanet eaktodáhtolašbarggu eai dusse dovdu doppe doarjut DIHKKI dievva dien diehtit deavdimin deavddán de dávviriiguin dávviriid dávvirat dát dat Dasalassin dás das dárbbaša danne Dán dán Dan dan Dál dál dakkár dajan dajai daja dahket dádjon buvssat buvssaid bures buot buorre Buorit buorit buohkat brihkká bottu borramušgálvvuid borramuš boahtá bivval bistet bisánan birra billašuvvet biktasat bijada biilii bellodat bellodagas bellodagaid bellodaga behtohallat beassat beassá beare bealde BB Bargiidbellodaga bargat bálkká báibman Ávžžuhan ástta Ánne amas álo almmái alla alit álggus álbmoga Aili ahte áhkku áddjá
Disregarded by speller (4)
When using AppleScript and Microsoft Word to run the speller tests, there are some words that are impossible to spell check. This is due to a "feature" of MS Word's implementation of AppleScript - hyphens, colons and full stops are not recognised as part of the string to be checked, and the words are instead split in two (or more) at each non-letter, and each part is spell-checked alone. This of course leads to meaningless speller output. They are therefore not considered, but are listed here for reference. This only happens when using AppleScript. In normal, interactive use, such words are correctly dealt with by the speller/Word.
| Input word | Expected correction | Editing distance |
|---|---|---|
| dnb-feaskáris | DnB-feaskáris | 2 |
| DNB-feaskáris | DnB-feaskáris | 1 |
| duo?aid | duođaid | 1 |
| lulli-Norggas | Lulli-Norggas | 1 |

