Speller Test Results for: «sp-wordtype-hu-sme.txt»
Overview
Technical data
Language tested: sme
Document tested: sp-wordtype-hu-sme.txt
Speller tool: Hunspell, command line version
Speller tool version: @(#) International Ispell Version 3.2.06 (but really Hunspell 1.3.2)
Speller lexicon version: Davvisámi 1.0beta10-2012-05-10
Test Date: 20120511-1332
Test Type: wordtype
Processing time (user+system time): 49 minutes 45,98 seconds
Speed: 0,05 words/second
Max memory usage of the speller (max value of /bin/ps -o rss= PID sampled every 10 second): 87 788 Kb
Result summary
Nº of input words: 162
Nº of spelling errors: 81 (50,0% of all words)
| Speller view | Wrong tokenisation | |||
|---|---|---|---|---|
| Speller Positive (Nº of flagged words): 108 | Speller Negative (Nº of accepted words): 51 | All tokenisation errors: 3 |
||
| Reality | Nº of real errors: 81 | Nº of true positives (detected real errors): 77 | Nº of false negatives (unflagged spelling errors): 2 | Nº of errouneously tokenized spelling errors: 2 |
| Nº of real correct words: 81 | Nº of false positives (incorrectly flagged words): 31 | Nº of true negatives (unflagged correct words): 49 | Nº of errouneously tokenized correct words: 1 |
|
Precision (tp/(tp+fp)):
71,3%
Recall (tp/(tp+fn)):
97,47%
Accuracy ((tp+tn)/words):
79,25%
| Spelling errors: 81 | ||||
|---|---|---|---|---|
| Simple errors (edit dist. 1): | Errors with edit distance 2: | Errors with edit distance ≥3: |
||
| Suggestion statistics for true positives (= 77 = 100%): | 66 (81,48%) | 15 (18,52%) | 0 (0,0%) | |
| Nº of detected spelling errors with correct suggestion in top 3: | 40 (51.95 %) | 35 | 5 | 0 |
| Nº of detected spelling errors with correct suggestionbelow top 3: | 1 (1.3 %) | 1 | 0 | 0 |
| Nº of detected spelling errors with only wrong suggestions: | 36 (46.75 %) | 26 | 10 | 0 |
| Nº of detected spelling errors with no suggestions at all: | 0 (0 %) | 0 | 0 | 0 |
| Undetected spelling errors: | 2 | 2 | 0 | 0 |
True positives (77)
Spelling errors with correct suggestions (41)
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| guoktenuppelotčoarvvagiin | guovttenuppelotčoarvvagiin | 2 |
| Number + adj compound |
| reavggáhattot | reavggáhaddot | 2 |
| Deverbal verb |
| reahččut | reahccut | 2 |
| Adverb |
| divttemeahttumat | diktemeahttumat | 2 |
| Deverbal adj |
| guvttii-guvttii | guvttiid-guvttiid | 2 |
| adverb |
| amahan | ammahan | 1 |
| particle |
| AP-rávagat | AP-rávvagat | 1 |
| Acro + noun compound |
| eahpet | ehpet | 1 |
| V+IV+Neg+Ind+Pl2 |
| leahkoset | lehkoset | 1 |
| V+IV+Imprt+Pl3 |
| duolmmastestit | duolmmasteastit | 1 |
| Deverbal verb |
| doppe-garadastimis | doppe-garradastimis | 1 |
| Adverb + derivated noun compound |
| eahpi | eahppi | 1 |
| V+IV+Neg+Ind+Du2 |
| lehppet | lehpet | 1 |
| V+IV+Ind+Prs+Pl2 |
| bugu | buhu | 1 |
| interjection |
| masti-NRK:as | masti-NRK:s | 1 |
| Derivated noun + acro compound |
| máŋgasoarttagiiguin | máŋggasoarttagiiguin | 1 |
| Number + adj compound |
| bargahaddat | barggahaddat | 1 |
| Deverbal verb |
| jávrralaččaid | jávrrálaččaid | 1 |
| Denominal adj |
| nieidažaččas | nieiddažaččas | 1 |
| Denominal noun |
| čájetmeahtun | čájetmeahttun | 1 |
| Deverbal adj |
| čalmástuvvamuššii | čalmmástuvvamuššii | 1 |
| Deverbal noun |
| birgegoadán | birgegoađán | 1 |
| Deverbal verb |
| Gaskavággerratráššan | Gaskavággeerratráššan | 1 |
| Name |
| Davimusaláš-geađggát | Davimušaláš-geađggát | 1 |
| Name + noun compound |
| čielastuvažit | cielastuvažit | 1 |
| Deverbal verb |
| suhtastuvvabeahti | suhtastuvvabeahtti | 1 |
| Deverbal verb |
| gáttebuidda | gáttibuidda | 1 |
| Noun comparative |
| bárggakeahttá | barggakeahttá | 1 |
| Nominal verbform |
| dustlassan | dustolassan | 1 |
| Derived adj |
| eaŧ | eat | 1 |
| V+IV+Neg+Ind+Pl1 |
| beaive | beaivi | 1 |
| not compounding form |
| váive | váivi | 1 |
| not compounding form |
| alitgearde | alitgeardi | 1 |
| not compounding form |
| lietne | letne | 1 |
| V+IV+Ind+Prs+Du1 |
| leabá | leaba | 1 |
| V+IV+Ind+Prs+Du3 |
| alloská | alloska | 1 |
| V+IV+Neg+Imprt+Du3 |
| állos | allos | 1 |
| V+IV+Neg+Imprt+Sg3 |
| gaikaide | gaikkaide | 1 |
| Pron+Indef+Pl+Ill |
| makkaris | makkáris | 1 |
| Pron+Interr+Sg+Loc |
| dahttai | dahtai | 1 |
| Pron+Indef+Sg+Nom |
| nuorttabelde | nuorttabealde | 1 |
| adverb/po/pp |
Spelling errors with only incorrect suggestions (36)
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| 10-jahkasacča | 10-jahkásačča | 2 |
| Digit + noun compound |
| barggahaddalinban | barggahattalinban | 2 |
| Deverbal verb + clitic |
| garvelesttáledje | garvileasttáledje | 2 |
| Deverbal verb |
| goadatcielastuvvanbárteuvsa | goađátcielastuvvanbárteuvsa | 2 |
| Noun + deverbal deverbal noun + noun + noun compound |
| guovttenuplotnamagiid | guovttenuppelotnamagiid | 2 |
| Number + adj compound |
| heajosvuohtaláhkaroaddan | heajosvuođaláhkaroaddan | 2 |
| Deadjectival noun + noun + noun compound |
| heajosvuohtaláhkaroattaidegis | heajosvuođaláhkaroattaidegis | 2 |
| Deadjectival noun + noun + noun compound + clitic |
| Orjješ-láhppoluobbalis | Orjješ-Láhpoluobbalis | 2 |
| Prefix + Name |
| seammalagana | seammalágána | 2 |
| Pron + adj compound |
| Svartneskilenas | Svartnæskilenis | 2 |
| Name |
| allonbiernaiguin | allonbiertnaiguin | 1 |
| Infinite verb + noun compound |
| almmehemiidduostumiin | almmihemiidduostumiin | 1 |
| Derived adj + derived noun compound |
| anti-dábalaččaide | anti-dábálaččaide | 1 |
| Prefix + derivated adj |
| árjjátlovihiidda | árjjátluovihiidda | 1 |
| Derivated noun - from name |
| bearjdatáigge | bearjadatáigge | 1 |
| Noun + noun with shortened genitive compound |
| birgegoađaledjenai | birgegoađáledjenai | 1 |
| Deverbal verb + clitic |
| čalmmástuvamušago | čalmmástuvvamušago | 1 |
| Deverbal noun + clitic |
| čielastuvvanbárteuksa | cielastuvvanbárteuksa | 1 |
| Deverbal deverbal noun + noun + noun compound |
| cielástuvvanbárteuvssaiguinge | cielastuvvanbárteuvssaiguinge | 1 |
| Deverbal deverbal noun + noun + noun compound + clitic |
| cuohtiguokteloginjeallje | čuohtiguokteloginjeallje | 1 |
| čuohti |
| dildddahkamiinnis | dilddedahkamiinnis | 1 |
| Gen noun + derivated noun compound |
| divtasvuonagadoalvu | divttasvuonagadoalvu | 1 |
| Derivated noun - from name + derivated noun compound |
| dolkesáhkasat | dolkisáhkasat | 1 |
| Derverbal noun + noun compound |
| gálvoguodahaddanidjagárvvuin | gálvoguođahaddanidjagárvvuin | 1 |
| Noun + deverbal deverbal noun + noun + noun compound |
| geagugeahčai | geagugeahčái | 1 |
| Gen noun + noun compound |
| gielddosolbmuidbarggováhčamiidda | gielddosolbmuidbargováhčamiidda | 1 |
| Adj + noun + noun + noun compound |
| Guovdageainu-Romssa | Guovdageainnu-Romssa | 1 |
| Name gen + name compound |
| Koskkivuori-plánenreaiddut | Koskivuori-plánenreaiddut | 1 |
| Name + noun + noun compound |
| láhppemeahtungarasvuođaid | láhppemeahttungarasvuođaid | 1 |
| Deverbal adj + derivated noun compound |
| máistenhuksemušaset | máistinhuksemušaset | 1 |
| Noun + deverbal noun compound |
| moaddeloht | moaddelot | 1 |
| moadde |
| muorrageasgahpirii | muorrageašgahpirii | 1 |
| Noun + noun + noun compound |
| njealječuođiduhátgávccičuođiviđainnuppelogiin | njeallječuođiduhátgávccičuođiviđainnuppelogiin | 1 |
| Number compound |
| njealječuođiduhátgávccičuođiviđanuppelotlágániidda | njeallječuođiduhátgávccičuođiviđanuppelotlágániidda | 1 |
| Number compound + adj compound |
| olbmuidbarggovissir | olbmuidbargovissir | 1 |
| Noun + noun + noun compound |
| Svartnæskilenashal | Svartnæskilenishal | 1 |
| Name + clitic |
False positives (31)
False positives with suggestions (31)
| Input word | Suggestions | |
|---|---|---|
| allonbiertnaiguin |
| |
| almmihemiidduostumiin |
| |
| anti-dábálaččaide |
| |
| árjjátluovihiidda |
| |
| barggahattalinban |
| |
| bearjadatáigge |
| |
| birgegoađáledjenai |
| |
| čalmmástuvvamušago |
| |
| cielastuvvanbárteuksa |
| |
| cielastuvvanbárteuvssaiguinge |
| |
| čuohtiguokteloginjeallje |
| |
| dilddedahkamiinnis |
| |
| divttasvuonagadoalvu |
| |
| dolkisáhkasat |
| |
| elge |
| |
| gálvoguođahaddanidjagárvvuin |
| |
| geagugeahčái |
| |
| gielddosolbmuidbargováhčamiidda |
| |
| goađátcielastuvvanbárteuvsa |
| |
| heajosvuođaláhkaroaddan |
| |
| heajosvuođaláhkaroattaidegis |
| |
| Koskivuori-plánenreaiddut |
| |
| láhppemeahttungarasvuođaid |
| |
| máistinhuksemušaset |
| |
| moaddelot |
| |
| muorrageašgahpirii |
| |
| njeallječuođiduhátgávccičuođiviđainnuppelogiin |
| |
| njeallječuođiduhátgávccičuođiviđanuppelotlágániidda |
| |
| olbmuidbargovissir |
| |
| Orjješ-Láhpoluobbalis |
| |
| Svartnæskilenishal |
|
False negatives (2)
| Input word | Expected correction | Editing distance |
||
|---|---|---|---|---|
| ANC-reahcut | ANC-reahccut | 1 | Acro + noun compound | |
| elege | elge | 1 | conjunction |
True negatives (49)
The correctly accepted words are listed here for easy copy&paste into Word to quick check regressions in new versions of the speller: No correctly spelled words accepted by an earlier speller should be rejected by later spellers. To check, just copy and paste these words to Word, and see if you get any red underlines. Duplicate words are not repeated, unless one is followed by some punctuation. The words are reverse sorted according to Unicode character code.
váivi Svartnæskilenis suhtastuvvabeahtti st.dieđ-ravgalastin seammalágána reavggáhaddot reahccut nuorttabealde nieiddažaččas máŋggasoarttagiiguin masti-NRK:s makkáris letne lehpet lehkoset leaba jávrrálaččaid guvttiid-guvttiid guovttenuppelotnamagiid guovttenuppelotčoarvvagiin Guovdageainnu-Romssa gáttibuidda Gaskavággeerratráššan garvileasttáledje gaikkaide ehpet eat eahppi dustolassan duolmmasteastit doppe-garradastimis diktemeahttumat Davimušaláš-geađggát dahtai cielastuvažit čalmmástuvvamuššii čájetmeahttun buhu birgegoađán beaivi barggakeahttá barggahaddat AP-rávvagat ANC-reahccut ammahan alloska allos alitgeardi 10-jahkásačča
Tokenization Errors (3)
Tokenization Errors of Misspellings (2)
| Input word | Expected correction | Editing distance | Speller Tokens |
||
|---|---|---|---|---|---|
| st.dieđ-láikesbiro | st.dieđ-láikkesbiro | 1 |
| Abbr + adj + noun compound | |
| st.dieđravgalastin | st.dieđ-ravgalastin | 1 |
| Abbr + derivated noun compound |
Tokenization Errors of Correct Words (1)
| Input word | Speller Tokens |
||
|---|---|---|---|
| st.dieđ-láikkesbiro |
|
All test data
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| 10-jahkásačča | ||||
| alitgeardi | ||||
| allonbiertnaiguin |
| |||
| allos | ||||
| alloska | ||||
| almmihemiidduostumiin |
| |||
| ammahan | ||||
| ANC-reahccut | ||||
| anti-dábálaččaide |
| |||
| AP-rávvagat | ||||
| árjjátluovihiidda |
| |||
| barggahaddat | ||||
| barggahattalinban |
| |||
| barggakeahttá | ||||
| beaivi | ||||
| bearjadatáigge |
| |||
| birgegoađáledjenai |
| |||
| birgegoađán | ||||
| buhu | ||||
| čájetmeahttun | ||||
| čalmmástuvvamušago |
| |||
| čalmmástuvvamuššii | ||||
| cielastuvažit | ||||
| cielastuvvanbárteuksa |
| |||
| cielastuvvanbárteuvssaiguinge |
| |||
| čuohtiguokteloginjeallje |
| |||
| dahtai | ||||
| Davimušaláš-geađggát | ||||
| diktemeahttumat | ||||
| dilddedahkamiinnis |
| |||
| divttasvuonagadoalvu |
| |||
| dolkisáhkasat |
| |||
| doppe-garradastimis | ||||
| duolmmasteastit | ||||
| dustolassan | ||||
| eahppi | ||||
| eat | ||||
| ehpet | ||||
| elge |
| |||
| gaikkaide | ||||
| gálvoguođahaddanidjagárvvuin |
| |||
| garvileasttáledje | ||||
| Gaskavággeerratráššan | ||||
| gáttibuidda | ||||
| geagugeahčái |
| |||
| gielddosolbmuidbargováhčamiidda |
| |||
| goađátcielastuvvanbárteuvsa |
| |||
| Guovdageainnu-Romssa | ||||
| guovttenuppelotčoarvvagiin | ||||
| guovttenuppelotnamagiid | ||||
| guvttiid-guvttiid | ||||
| heajosvuođaláhkaroaddan |
| |||
| heajosvuođaláhkaroattaidegis |
| |||
| jávrrálaččaid | ||||
| Koskivuori-plánenreaiddut |
| |||
| láhppemeahttungarasvuođaid |
| |||
| leaba | ||||
| lehkoset | ||||
| lehpet | ||||
| letne | ||||
| máistinhuksemušaset |
| |||
| makkáris | ||||
| masti-NRK:s | ||||
| máŋggasoarttagiiguin | ||||
| moaddelot |
| |||
| muorrageašgahpirii |
| |||
| nieiddažaččas | ||||
| njeallječuođiduhátgávccičuođiviđainnuppelogiin |
| |||
| njeallječuođiduhátgávccičuođiviđanuppelotlágániidda |
| |||
| nuorttabealde | ||||
| olbmuidbargovissir |
| |||
| Orjješ-Láhpoluobbalis |
| |||
| reahccut | ||||
| reavggáhaddot | ||||
| seammalágána | ||||
| st.dieđ-láikkesbiro | ||||
| st.dieđ-ravgalastin | ||||
| suhtastuvvabeahtti | ||||
| Svartnæskilenis | ||||
| Svartnæskilenishal |
| |||
| váivi | ||||
| st.dieđ-láikesbiro | st.dieđ-láikkesbiro | 1 | Abbr + adj + noun compound | |
| st.dieđravgalastin | st.dieđ-ravgalastin | 1 | Abbr + derivated noun compound | |
| ANC-reahcut | ANC-reahccut | 1 | Acro + noun compound | |
| AP-rávagat | AP-rávvagat | 1 |
| Acro + noun compound |
| gielddosolbmuidbarggováhčamiidda | gielddosolbmuidbargováhčamiidda | 1 |
| Adj + noun + noun + noun compound |
| guvttii-guvttii | guvttiid-guvttiid | 2 |
| adverb |
| reahččut | reahccut | 2 |
| Adverb |
| nuorttabelde | nuorttabealde | 1 |
| adverb/po/pp |
| doppe-garadastimis | doppe-garradastimis | 1 |
| Adverb + derivated noun compound |
| elege | elge | 1 | conjunction | |
| cuohtiguokteloginjeallje | čuohtiguokteloginjeallje | 1 |
| čuohti |
| heajosvuohtaláhkaroaddan | heajosvuođaláhkaroaddan | 2 |
| Deadjectival noun + noun + noun compound |
| heajosvuohtaláhkaroattaidegis | heajosvuođaláhkaroattaidegis | 2 |
| Deadjectival noun + noun + noun compound + clitic |
| jávrralaččaid | jávrrálaččaid | 1 |
| Denominal adj |
| nieidažaččas | nieiddažaččas | 1 |
| Denominal noun |
| masti-NRK:as | masti-NRK:s | 1 |
| Derivated noun + acro compound |
| árjjátlovihiidda | árjjátluovihiidda | 1 |
| Derivated noun - from name |
| divtasvuonagadoalvu | divttasvuonagadoalvu | 1 |
| Derivated noun - from name + derivated noun compound |
| dustlassan | dustolassan | 1 |
| Derived adj |
| almmehemiidduostumiin | almmihemiidduostumiin | 1 |
| Derived adj + derived noun compound |
| dolkesáhkasat | dolkisáhkasat | 1 |
| Derverbal noun + noun compound |
| čájetmeahtun | čájetmeahttun | 1 |
| Deverbal adj |
| divttemeahttumat | diktemeahttumat | 2 |
| Deverbal adj |
| láhppemeahtungarasvuođaid | láhppemeahttungarasvuođaid | 1 |
| Deverbal adj + derivated noun compound |
| čielastuvvanbárteuksa | cielastuvvanbárteuksa | 1 |
| Deverbal deverbal noun + noun + noun compound |
| cielástuvvanbárteuvssaiguinge | cielastuvvanbárteuvssaiguinge | 1 |
| Deverbal deverbal noun + noun + noun compound + clitic |
| čalmástuvvamuššii | čalmmástuvvamuššii | 1 |
| Deverbal noun |
| čalmmástuvamušago | čalmmástuvvamušago | 1 |
| Deverbal noun + clitic |
| bargahaddat | barggahaddat | 1 |
| Deverbal verb |
| birgegoadán | birgegoađán | 1 |
| Deverbal verb |
| čielastuvažit | cielastuvažit | 1 |
| Deverbal verb |
| duolmmastestit | duolmmasteastit | 1 |
| Deverbal verb |
| garvelesttáledje | garvileasttáledje | 2 |
| Deverbal verb |
| reavggáhattot | reavggáhaddot | 2 |
| Deverbal verb |
| suhtastuvvabeahti | suhtastuvvabeahtti | 1 |
| Deverbal verb |
| barggahaddalinban | barggahattalinban | 2 |
| Deverbal verb + clitic |
| birgegoađaledjenai | birgegoađáledjenai | 1 |
| Deverbal verb + clitic |
| 10-jahkasacča | 10-jahkásačča | 2 |
| Digit + noun compound |
| dildddahkamiinnis | dilddedahkamiinnis | 1 |
| Gen noun + derivated noun compound |
| geagugeahčai | geagugeahčái | 1 |
| Gen noun + noun compound |
| allonbiernaiguin | allonbiertnaiguin | 1 |
| Infinite verb + noun compound |
| bugu | buhu | 1 |
| interjection |
| moaddeloht | moaddelot | 1 |
| moadde |
| Gaskavággerratráššan | Gaskavággeerratráššan | 1 |
| Name |
| Svartneskilenas | Svartnæskilenis | 2 |
| Name |
| Svartnæskilenashal | Svartnæskilenishal | 1 |
| Name + clitic |
| Koskkivuori-plánenreaiddut | Koskivuori-plánenreaiddut | 1 |
| Name + noun + noun compound |
| Davimusaláš-geađggát | Davimušaláš-geađggát | 1 |
| Name + noun compound |
| Guovdageainu-Romssa | Guovdageainnu-Romssa | 1 |
| Name gen + name compound |
| bárggakeahttá | barggakeahttá | 1 |
| Nominal verbform |
| alitgearde | alitgeardi | 1 |
| not compounding form |
| beaive | beaivi | 1 |
| not compounding form |
| váive | váivi | 1 |
| not compounding form |
| gálvoguodahaddanidjagárvvuin | gálvoguođahaddanidjagárvvuin | 1 |
| Noun + deverbal deverbal noun + noun + noun compound |
| goadatcielastuvvanbárteuvsa | goađátcielastuvvanbárteuvsa | 2 |
| Noun + deverbal deverbal noun + noun + noun compound |
| máistenhuksemušaset | máistinhuksemušaset | 1 |
| Noun + deverbal noun compound |
| muorrageasgahpirii | muorrageašgahpirii | 1 |
| Noun + noun + noun compound |
| olbmuidbarggovissir | olbmuidbargovissir | 1 |
| Noun + noun + noun compound |
| bearjdatáigge | bearjadatáigge | 1 |
| Noun + noun with shortened genitive compound |
| gáttebuidda | gáttibuidda | 1 |
| Noun comparative |
| guoktenuppelotčoarvvagiin | guovttenuppelotčoarvvagiin | 2 |
| Number + adj compound |
| guovttenuplotnamagiid | guovttenuppelotnamagiid | 2 |
| Number + adj compound |
| máŋgasoarttagiiguin | máŋggasoarttagiiguin | 1 |
| Number + adj compound |
| njealječuođiduhátgávccičuođiviđainnuppelogiin | njeallječuođiduhátgávccičuođiviđainnuppelogiin | 1 |
| Number compound |
| njealječuođiduhátgávccičuođiviđanuppelotlágániidda | njeallječuođiduhátgávccičuođiviđanuppelotlágániidda | 1 |
| Number compound + adj compound |
| amahan | ammahan | 1 |
| particle |
| anti-dábalaččaide | anti-dábálaččaide | 1 |
| Prefix + derivated adj |
| Orjješ-láhppoluobbalis | Orjješ-Láhpoluobbalis | 2 |
| Prefix + Name |
| seammalagana | seammalágána | 2 |
| Pron + adj compound |
| gaikaide | gaikkaide | 1 |
| Pron+Indef+Pl+Ill |
| dahttai | dahtai | 1 |
| Pron+Indef+Sg+Nom |
| makkaris | makkáris | 1 |
| Pron+Interr+Sg+Loc |
| leahkoset | lehkoset | 1 |
| V+IV+Imprt+Pl3 |
| lietne | letne | 1 |
| V+IV+Ind+Prs+Du1 |
| leabá | leaba | 1 |
| V+IV+Ind+Prs+Du3 |
| lehppet | lehpet | 1 |
| V+IV+Ind+Prs+Pl2 |
| alloská | alloska | 1 |
| V+IV+Neg+Imprt+Du3 |
| állos | allos | 1 |
| V+IV+Neg+Imprt+Sg3 |
| eahpi | eahppi | 1 |
| V+IV+Neg+Ind+Du2 |
| eaŧ | eat | 1 |
| V+IV+Neg+Ind+Pl1 |
| eahpet | ehpet | 1 |
| V+IV+Neg+Ind+Pl2 |

