Speller Test Results for: «sp-regression-pl-smj.txt»
Overview
Technical data
Language tested: smj
Document tested: sp-regression-pl-smj.txt
Speller tool: Polderland command line tool
Speller tool version: spellSamiLule version 103.3.0.4, Mar 23 2009, 13:53:28
Speller lexicon version: Julevsáme, version 2.2, 20111221-51681
Test Date: 20111222-1240
Test Type: regression
Processing time (user+system time): No data available
Speed: No data available
Max memory usage of the speller (max value of /bin/ps -o rss= PID sampled every 10 second): No data available
Result summary
Nº of input words: 496
Nº of spelling errors: 158 (31,85% of all words)
| Speller view | Wrong tokenisation | |||
|---|---|---|---|---|
| Speller Positive (Nº of flagged words): 312 | Speller Negative (Nº of accepted words): 184 | All tokenisation errors: 0 |
||
| Reality | Nº of real errors: 158 | Nº of true positives (detected real errors): 156 | Nº of false negatives (unflagged spelling errors): 2 | Nº of errouneously tokenized spelling errors: 0 |
| Nº of real correct words: 338 | Nº of false positives (incorrectly flagged words): 156 | Nº of true negatives (unflagged correct words): 182 | Nº of errouneously tokenized correct words: 0 |
|
Precision (tp/(tp+fp)):
50,0%
Recall (tp/(tp+fn)):
98,73%
Accuracy ((tp+tn)/words):
68,15%
| Spelling errors: 158 | ||||
|---|---|---|---|---|
| Simple errors (edit dist. 1): | Errors with edit distance 2: | Errors with edit distance ≥3: |
||
| Suggestion statistics for true positives (= 156 = 100%): | 108 (68,35%) | 42 (26,58%) | 8 (5,06%) | |
| Nº of detected spelling errors with correct suggestion in top 3: | 48 (30.77 %) | 33 | 13 | 2 |
| Nº of detected spelling errors with correct suggestionbelow top 3: | 4 (2.56 %) | 4 | 0 | 0 |
| Nº of detected spelling errors with only wrong suggestions: | 68 (43.59 %) | 48 | 18 | 2 |
| Nº of detected spelling errors with no suggestions at all: | 36 (23.08 %) | 22 | 10 | 4 |
| Undetected spelling errors: | 2 | 1 | 1 | 0 |
True positives (156)
Spelling errors with correct suggestions (52)
| Input word | Expected correction | Editing distance | Suggestions | Bug ID | Comment |
|---|---|---|---|---|---|
| Nárvijkan | Narvijkan | 1 |
| 407 | missing smj names; entry was SUBed |
| maññel | maŋŋel | 2 |
| 427 | ń not part of word |
| mańńel | maŋŋel | 2 |
| 427 | ń not part of word |
| VG:an | VG:n | 1 |
| 503 | acro inflection |
| 1983:as | 1983:s | 1 |
| 509 | accepts SUB-ed entries |
| 1983:aj | 1983:j | 1 |
| 509 | accepts SUB-ed entries |
| VG:as | VG:s | 1 |
| 509 | accepts SUB-ed entries |
| VG:aj | VG:j | 1 |
| 509 | accepts SUB-ed entries |
| dåjmadibmesáhka | dåjmadibmesak | 3 |
| 536 | "impossible" compound forms |
| 1883:a | 1883 | 2 |
| 555 | speller does not suggest 1883 for 1883:a |
| NRK:a | NRK | 2 |
| 555 | speller does not suggest 1883 for 1883:a |
| åvdåsvastet | åvdåsvásstet | 2 |
| 556 | non-existent word accepted |
| 38jahkásasj | 38-jahkásasj | 1 |
| 580 | suggests 38jahkásasj for 38-jahkásasj |
| vasstet | vásstet | 1 |
| 589 | suggestions involving vowel-replacement |
| vihttalágásj | vijddalágásj | 3 |
| 619 | numerals and pronouns to NAMÁK and SASJ fails |
| gålmmålágásj | gålmålágásj | 1 |
| 619 | numerals and pronouns to NAMÁK and SASJ fails |
| Divtasvuona-Oarjevuona | Divtasvuona-Oarjjevuona | 1 |
| 634 | Prop gen + hyphen + Prop gen |
| Kitaadábálasj | Kitaa-dábálasj | 1 |
| 649 | name+adj compounds without hyphen |
| Sirgesdábálasj | Sirges-dábálasj | 1 |
| 649 | name+adj compounds without hyphen |
| badjeGájnaj | Badje-Gájnaj | 2 |
| 650 | noun prefix+name compound without hyphen |
| nuppáj | nubbáj | 2 |
| 651 | pp>bb phonetic rule does not apply |
| rappe | rabbe | 2 |
| 651 | pp>bb phonetic rule does not apply |
| SABMe | SÁBME | 2 |
| 652 | UPPERCASE-typos only get acronym-suggestions |
| SABME | SÁBME | 1 |
| 652 | UPPERCASE-typos only get acronym-suggestions |
| saame | sáme | 2 |
| 658 | suggestions saame (suggestions Saame (with capital S) is proper so in that meaning this bug won't be fixed unless we remove the proper-noun) |
| 10-biejvvásattja | 10-bæjvvásattja | 2 |
| 711 | numeral compounds and cases |
| akttak | aktak | 1 |
| 744 | numerals + clitic |
| madduj | mädduj | 1 |
| 776 | norm-fst does not recognize ä when u in sec. syll. |
| dabttjum | däbttjum | 1 |
| 776 | norm-fst does not recognize ä when u in sec. syll. |
| Seskaröcd | Seskarö-cd | 1 |
| 805 | Nouns/propers+acronyms |
| Berntv | Bern-tv | 1 |
| 805 | Nouns/propers+acronyms |
| itjas | ietjas | 1 |
| 815 | Case-forms of iesj not recognized |
| itjanis | ietjanis | 1 |
| 815 | Case-forms of iesj not recognized |
| alasis | allasis | 1 |
| 815 | Case-forms of iesj not recognized |
| itjajnis | ietjajnis | 1 |
| 815 | Case-forms of iesj not recognized |
| CV'j | CV:j | 1 |
| 913 | acronyms+cases |
| NRK:ajn | NRK:jn | 1 |
| 913 | acronyms+cases |
| muottejahkásasj | moattejahkásasj | 2 |
| 963 | lot of adjectives missing |
| muottestávval | moattestávval | 2 |
| 963 | lot of adjectives missing |
| ådo | ådå | 1 |
| 963 | lot of adjectives missing |
| suorra | stuorra | 1 |
| 963 | lot of adjectives missing |
| buovre | buorre | 1 |
| 963 | lot of adjectives missing |
| moattijienalasj | moattejienalasj | 1 |
| 963 | lot of adjectives missing |
| minimalla | minimálla | 1 |
| 963 | lot of adjectives missing |
| misska | miesska | 1 |
| 963 | lot of adjectives missing |
| miha | mihá | 1 |
| 963 | lot of adjectives missing |
| dábalasj | dábálasj | 1 |
| 963 | lot of adjectives missing |
| digitala | digitála | 1 |
| 963 | lot of adjectives missing |
| Almoduvváj | Almoduváj | 1 |
| 971 | uvváj and uvvát |
| aneduvváj | aneduváj | 1 |
| 971 | uvváj and uvvát |
| mujttaluvvát | mujttaluvvat | 1 |
| 971 | uvváj and uvvát |
| bielkeduvváj | bielkeduváj | 1 |
| 971 | uvváj and uvvát |
Spelling errors with only incorrect suggestions (68)
| Input word | Expected correction | Editing distance | Suggestions | Bug ID | Comment |
|---|---|---|---|---|---|
| barrnev | bárnev | 2 |
| ||
| Galileajávrregáttev | Galilea-jávrregáttev | 1 |
| ||
| bievdedåhpe | bievddedåhpe | 1 |
| 511 | doesn't follow comp tags |
| stuora-Fuoskok | stuorafuoskok | 2 |
| 518 | nominal + Prop/Der compound |
| Stuora-Fuoskok | Stuorafuoskok | 2 |
| 518 | nominal + Prop/Der compound |
| stuorFuoskok | stuorafuoskok | 2 |
| 518 | nominal + Prop/Der compound |
| StuorFuoskok | Stuorafuoskok | 2 |
| 518 | nominal + Prop/Der compound |
| stuor-Fuoskok | stuorafuoskok | 2 |
| 518 | nominal + Prop/Der compound |
| Stuor-Fuoskok | Stuorafuoskok | 2 |
| 518 | nominal + Prop/Der compound |
| stuorFuosskok | Stuor-Fuosskok | 2 |
| 518 | nominal + Prop/Der compound |
| stuorra-Fuoskok | stuorrafuoskok | 2 |
| 518 | nominal + Prop/Der compound |
| Stuorra-Fuoskok | Stuorrafuoskok | 2 |
| 518 | nominal + Prop/Der compound |
| stuoraFuoskok | stuorafuoskok | 1 |
| 518 | nominal + Prop/Der compound |
| StuoraFuoskok | Stuorafuoskok | 1 |
| 518 | nominal + Prop/Der compound |
| stuora-fuoskok | stuorafuoskok | 1 |
| 518 | nominal + Prop/Der compound |
| stuor-fuoskok | stuorafuoskok | 1 |
| 518 | nominal + Prop/Der compound |
| Stuor-fuoskok | Stuorafuoskok | 1 |
| 518 | nominal + Prop/Der compound |
| StuorFuosskok | Stuor-Fuosskok | 1 |
| 518 | nominal + Prop/Der compound |
| stuorraFuoskok | stuorrafuoskok | 1 |
| 518 | nominal + Prop/Der compound |
| StuorraFuoskok | Stuorrafuoskok | 1 |
| 518 | nominal + Prop/Der compound |
| stuorra-fuoskok | stuorrafuoskok | 1 |
| 518 | nominal + Prop/Der compound |
| Stuorra-fuoskok | Stuorrafuoskok | 1 |
| 518 | nominal + Prop/Der compound |
| Bq:a | Bq | 2 |
| 555 | speller does not suggest 1883 for 1883:a |
| iemeLinn | ieme-Linn | 1 |
| 595 | prefix+name as split comp without hyphen |
| tsåhkeLot | tsåhke-Lot | 1 |
| 595 | prefix+name as split comp without hyphen |
| Divtasvuonaahke | Divtasvuona-ahke | 1 |
| 617 | left-compound-tags affects proper-nouns too |
| Lofåhtajguollima | Lofåhta-guollima | 1 |
| 617 | left-compound-tags affects proper-nouns too |
| Råmsåjdille | Råmså-dille | 1 |
| 617 | left-compound-tags affects proper-nouns too |
| oajvij | åjvij | 2 |
| 618 | dipht simpl oajvve>åjvijn |
| oajvijda | åjvijda | 2 |
| 618 | dipht simpl oajvve>åjvijn |
| oajvijn | åjvijn | 2 |
| 618 | dipht simpl oajvve>åjvijn |
| oajvijs | åjvijs | 2 |
| 618 | dipht simpl oajvve>åjvijn |
| Aktaalmasj | Akta almasj | 1 |
| 641 | numeral+noun compounds |
| guovtealmatja | guovte almatja | 1 |
| 641 | numeral+noun compounds |
| 1700-Låhko | 1700-låhko | 1 |
| 647 | numerals+NOUN |
| gáktsalågeantjuotakta | gáktsalågenantjuotakta | 1 |
| 692 | numeral-variants |
| gietjatjuodevitta | gietjatjuodevihtta | 1 |
| 692 | numeral-variants |
| guk | guok | 1 |
| 692 | numeral-variants |
| guklåk | guoklåk | 1 |
| 692 | numeral-variants |
| guokduhat | guokduhát | 1 |
| 692 | numeral-variants |
| guoktjut | guoktjuot | 1 |
| 692 | numeral-variants |
| guoktuvsan | guoktuvsán | 1 |
| 692 | numeral-variants |
| vittatjuot | vihttatjuot | 1 |
| 692 | numeral-variants |
| 10ijjs | 10-ijá | 3 |
| 711 | numeral compounds and cases |
| 21bieivi | 21-biejve | 3 |
| 711 | numeral compounds and cases |
| 1996:itd | 1996:jt | 2 |
| 711 | numeral compounds and cases |
| javlla-CDan | javlla-CD:n | 1 |
| 717 | noun-acro compounds |
| gålmmågahper | gålmmå gahpera | 2 |
| 721 | nom- and gen-numerals make compounds with nouns |
| åvtåmuora | åvtå muora | 1 |
| 721 | nom- and gen-numerals make compounds with nouns |
| guovtesuollu | guovte suollu | 1 |
| 721 | nom- and gen-numerals make compounds with nouns |
| giehtjavge | gietjavge | 1 |
| 744 | numerals + clitic |
| guhtage | guhttage | 1 |
| 744 | numerals + clitic |
| guokteges | guoktege | 1 |
| 744 | numerals + clitic |
| guoktetk | guoktek | 1 |
| 744 | numerals + clitic |
| guovttege | guovtege | 1 |
| 744 | numerals + clitic |
| njielljege | nielljege | 1 |
| 744 | numerals + clitic |
| tjuohtik | tjuohtek | 1 |
| 744 | numerals + clitic |
| vidange | vidánge | 1 |
| 744 | numerals + clitic |
| amuorra | a-muorra | 1 |
| 785 | does not recognize alphabet-abbr+noun |
| Cmuorra | C-muorra | 1 |
| 785 | does not recognize alphabet-abbr+noun |
| dmuorra | d-muorra | 1 |
| 785 | does not recognize alphabet-abbr+noun |
| Emuorra | E-muorra | 1 |
| 785 | does not recognize alphabet-abbr+noun |
| muorraas | muorra-as | 1 |
| 805 | Nouns/propers+acronyms |
| muorraNRK | muorra-NRK | 1 |
| 805 | Nouns/propers+acronyms |
| ænujn | enujn | 1 |
| 816 | word not recognized |
| NRK-an | NRK:n | 2 |
| 913 | acronyms+cases |
| nummar291 | nummar-91 | 1 |
| 1144 | numerals compound to nouns without hyphen |
| 0-mujttalibmesáhka | 0-mujttalimesáhka | 1 |
| 1145 | impossible compounds |
Spelling errors without suggestions (36)
| Input word | Expected correction | Editing distance | Bug ID | Comment |
|---|---|---|---|---|
| Allaskåvllådåjmadagas | Allaskåvllådåjmadagás | 1 | compound with typo, no suggestion | |
| Barggomárnanin | Barggomárnánin | 1 | compound with typo, no suggestion | |
| vidjor9jit | vidjurijt | 3 | 398 | vidjor9jit vs vidjurijt |
| álggoálmmugis-álggoálmmugij | álggoálmmugis álggoálmmugij | 1 | 482 | double hyphen |
| sábmeguolbbe | sámeguolbbe | 1 | 489 | speller does not follow compound-tagging |
| suobmaguolbbe | suomaguolbbe | 1 | 489 | speller does not follow compound-tagging |
| varresvuohtastasjåvnnå | varresvuodastasjåvnnå | 2 | 506 | doesn't follow comp-tags |
| Nijbegoasskem | Nijbbegoasskem | 1 | 511 | doesn't follow comp tags |
| uvsagáldo | uksagáldo | 1 | 511 | doesn't follow comp tags |
| Stuora-fuoskok | Stuorafuoskok | 1 | 518 | nominal + Prop/Der compound |
| bednagadtjbijlla | bednagatjijlla | 2 | 526 | wrong compounding forms accepted |
| jávrádtjbijlla | jávrásjbijlla | 2 | 526 | wrong compounding forms accepted |
| væráldbijlla | væráltbijlla | 1 | 526 | wrong compounding forms accepted |
| ráddnávuohtamuorra | ráddnávuodamuorra | 2 | 536 | "impossible" compound forms |
| bievdeguorra | bievddeguorra | 1 | 539 | speller does not follow compound-tagging |
| gájtjabájkka | gájtsabajkka | 2 | 557 | Missing suggestion when multiple errors across compound boundary |
| girkkomusijkkaRa | girkkomusijkkára | 2 | 582 | noun+Prop without hyphen |
| NRKGE | NRK-EG | 2 | 607 | acro-compounds without hyphen |
| me-rijkaieŋŋilsgiellaj | Amerijka-ieŋŋilsgiellaj | 3 | 616 | Bispadime-me-ráden |
| Åhpadim-me-konsulænnta | Åhpadim-m-konsulænnta | 1 | 616 | Bispadime-me-ráden |
| Åhpadis-me-konsulænnta | Åhpadis-m-konsulænnta | 1 | 616 | Bispadime-me-ráden |
| Bispadime-me-ráden | Bispadime-m-ráden | 1 | 616 | Bispadime-me-ráden |
| gettsugadtjajbiejvve | gettsugattjajbiejvve | 1 | 624 | gen of ÅLLAGASJ-adj as first prt of cmp |
| sjimugadtjabiejvve | sjimugattjabiejvve | 1 | 624 | gen of ÅLLAGASJ-adj as first prt of cmp |
| kåntåvrråa-jådediddje | kåntåvrrå-Raa-jådediddje | 3 | 629 | a taking part of compound |
| giehtjalåkvitta | giehtjalåkvihtta | 1 | 692 | numeral-variants |
| lågeanguoktatjuodeguhttalåk | lågenanguoktatjuodeguhttalåk | 1 | 692 | numeral-variants |
| nelljelåk | nielljelåk | 1 | 692 | numeral-variants |
| NSR-Lindon | NRK-London | 3 | 805 | Nouns/propers+acronyms |
| NRK-Finnmarko | NRK-Finnmárko | 1 | 805 | Nouns/propers+acronyms |
| rijkajuohkostjiektjamav | rijkajuogostjiektjamav | 2 | 817 | should not be accepted |
| Nuortalijguovlojn | Nuorttalijguovlojn | 1 | 916 | prefixes not working |
| polárdutkamin | polardutkamin | 1 | 916 | prefixes not working |
| NRKNRKNRK | NRK-NRK-NRK | 2 | 933 | acros compound to each other without hyphens |
| Åhpaditge-konsulænnta | Åhpadit-g-konsulænnta | 2 | 1122 | inf+noun compounds |
| bullar1-dutkamin | bullar-1-dutkamin | 1 | 1144 | numerals compound to nouns without hyphen |
False positives (156)
False positives with suggestions (103)
| Input word | Suggestions | Bug ID | Comment |
|---|---|---|---|
| oahppamdiededimev |
| actio compound | |
| siellosujtov |
| missing compound | |
| Svierigadárogielan |
| name compound i Kintel och Korhon=vuonadár...med liten bok | |
| vidjurijt |
| 398 | vidjor9jit vs vidjurijt |
| allaskåvllågähttjalus |
| 411 | Swedish ä |
| Allaskåvllågähttjalus |
| 411 | Swedish ä |
| däbbaga |
| 411 | ä not recognised |
| dæbbaga |
| 411 | ä not recognised |
| gæhtjáj |
| 411 | ä not recognised |
| gähtjáj |
| 411 | ä not recognised |
| akta |
| 418 | missing numeral |
| ållo |
| 418 | missing numeral |
| guokta |
| 418 | missing numeral |
| stivrrimássjetjállagij |
| 419 | 3-part compound |
| alvvásláhkáj |
| 421 | adj+noun |
| buorredagojt |
| 421 | adj+noun |
| gievramuorra |
| 421 | adj-gen+noun |
| gievrasalmasj |
| 421 | adj-attr+noun |
| gievrasmuorra |
| 421 | adj-attr+noun |
| gievrramuorra |
| 421 | adj-nom+noun |
| mælggatåhpadusán |
| 421 | adj+noun |
| skihpahuodnahijn |
| 421 | adj+noun |
| skihpasujtár |
| 421 | adj+noun |
| M:jda |
| 435 | roman numerals missing |
| MXVII:j |
| 435 | roman numerals missing |
| XII |
| 435 | roman numerals missing |
| XV:s |
| 435 | roman numerals missing |
| lågenanguokta |
| 481 | missing numeral (12) |
| lågenanniellja |
| 481 | missing numeral (14) |
| millijåvnå |
| 481 | missing numerals |
| millijåvnnå |
| 481 | missing numerals |
| álggoálmmugij |
| 482 | double hyphen |
| álggoálmmugis |
| 482 | double hyphen |
| guovlojn |
| 482 | double hyphen |
| sámeguolbbe |
| 489 | speller does not follow compound-tagging |
| bijllage |
| 496 | unrecognised clitics |
| bijllak |
| 496 | unrecognised clitics |
| máhkak |
| 496 | unrecognised clitics |
| ájge |
| 507 | unrec words |
| ájgev |
| 507 | unrec words |
| ájgij |
| 507 | unrec words |
| báŋkas |
| 507 | unrec words |
| báŋŋka |
| 507 | unrec words |
| báŋŋkaj |
| 507 | unrec words |
| biehtse |
| 507 | unrec words |
| biehtsemánno |
| 507 | unrec words |
| biele |
| 507 | unrec words |
| dile |
| 507 | unrec words |
| fámon |
| 507 | unrec words |
| guossa |
| 507 | unrec words |
| mierredimájgijt |
| 507 | unrec words |
| muorra |
| 507 | unrec words |
| siebrijs |
| 507 | unrec words |
| siebrre |
| 507 | unrec words |
| skåvlås |
| 507 | unrec words |
| skåvlåsj |
| 507 | unrec words |
| suohkanin |
| 507 | unrec words |
| vuojnno |
| 507 | unrec words |
| bievddedåhpe |
| 511 | doesn't follow comp tags |
| stuorafuoskok |
| 518 | nominal + Prop/Der compound |
| stuorrafuoskok |
| 518 | nominal + Prop/Der compound |
| dåjmadimsáhka |
| 536 | "impossible" compound forms |
| muorraguorra |
| 539 | speller does not follow compound-tagging |
| lågenanguokta |
| 594 | lågenanguokte not recognized |
| lågenanguoktáj |
| 594 | lågenanguokte not recognized |
| lågenanguovtes |
| 594 | lågenanguokte not recognized |
| C-giella |
| 596 | C-giellan is not accepted |
| guoktalåk |
| 599 | numeral attr:s on lot |
| vihttalåk |
| 599 | numeral attr:s on lot |
| sáme-dáro |
| 600 | Gen+hyph compound |
| bargogáddjon |
| 615 | actios and actors as second part of compound |
| bargojduvddemis |
| 615 | actios and actors as second part of compound |
| Rájerasstimijn |
| 615 | actios and actors as second part of compound |
| julev- |
| 627 | prefix + hyhpen does not get accepted |
| gáktsalågenantjuotakta |
| 692 | numeral-variants |
| guok |
| 692 | numeral-variants |
| guokduhát |
| 692 | numeral-variants |
| guoklåk |
| 692 | numeral-variants |
| guoktjuot |
| 692 | numeral-variants |
| nielljelåk |
| 692 | numeral-variants |
| vihttatjuot |
| 692 | numeral-variants |
| 10-ijá |
| 711 | numeral compounds and cases |
| 21-biejve |
| 711 | numeral compounds and cases |
| javlla-CD |
| 717 | noun-acro compounds |
| gietjavge |
| 744 | numerals + clitic |
| guhttage |
| 744 | numerals + clitic |
| guoktege |
| 744 | numerals + clitic |
| guoktek |
| 744 | numerals + clitic |
| guovtege |
| 744 | numerals + clitic |
| nielljege |
| 744 | numerals + clitic |
| vidánge |
| 744 | numerals + clitic |
| a-muorra |
| 785 | does not recognize alphabet-abbr+noun |
| C-muorra |
| 785 | does not recognize alphabet-abbr+noun |
| d-muorra |
| 785 | does not recognize alphabet-abbr+noun |
| E-muorra |
| 785 | does not recognize alphabet-abbr+noun |
| muorra-as |
| 805 | Nouns/propers+acronyms |
| muorra-NRK |
| 805 | Nouns/propers+acronyms |
| enujn |
| 816 | word not recognized |
| boahtemtjuovggat |
| 837 | noun+adjective compounds |
| boahtemtjuovggis |
| 837 | noun+adjective compounds |
| muorraguovggat |
| 837 | noun+adjective compounds |
| muorraguovggis |
| 837 | noun+adjective compounds |
| nummar-91 |
| 1144 | numerals compound to nouns without hyphen |
False positives without suggestions (53)
Skåvllådåjmadak sebrudakalmasj riegádimgånågis Barggogaskostime åbmudaksiebrre oahppambiednikvuogádagá Iednegiellaoahppaábnnasa bielleájggeåhpadussaj Allaskåvllådievnastusdåjmadahka polardutkamin suomaguolbbe VG-kafean VG-biebbmo varresvuodastasjåvnnå joarkkaskåvlåv isetábnas álmmukallaskåvlån allaskåvllåj niehkkealmasjvuorro merragálvvogárvvo goasskemrájddohadde uksagáldo Nijbbegoasskem Stuorrafuoskok Stuorafuoskok væráltbijlla jávrásjbijlla bednagasjbijlla Sákkaldakguolle ráddnávuodamuorra Noajddudakmuorra patronmuorra lihttemaskinsielgge kaninmuorra doktorandgukse muorajguorra girkkomusijkkár jubmeldievnnoiellemij lågenanguovtev gålmmålåk rájijrasstimin kåntåvrråjådediddje 051-bágo miereunnedimskåvve 1700-LÅHKO 0-juojggamaárbbedábev lågenanguoktatjuodeguhttalåk guoktuvsán gietjatjuodevihtta giehtjalåkvihtta boatsoj-sujtto Nuorttalijguovlojn
False negatives (2)
| Input word | Expected correction | Editing distance | Bug ID | Comment |
|---|---|---|---|---|
| 0-nubbejådediddje | 0-ruobbejådediddje | 2 | 648 | unmotivated suggestions with numeral+noun |
| CV-s | CV:s | 1 | 913 | acronyms+cases |
True negatives (182)
The correctly accepted words are listed here for easy copy&paste into Word to quick check regressions in new versions of the speller: No correctly spelled words accepted by an earlier speller should be rejected by later spellers. To check, just copy and paste these words to Word, and see if you get any red underlines. Duplicate words are not repeated, unless one is followed by some punctuation. The words are reverse sorted according to Unicode character code.
X:j X Vuostasj vuostasj vuostamuttjan vuolgadime vuojnávge Voaga vissa Visa- Vindela VG:s VG:n VG:j VG vällahit vahkkusasj vællahit USA urmak tjådålasj svieriga- Svieriga Suortta stuorra Stuorgiedde snutjek snuhpalasj Skilkká Seskarö-cd Sálatædno Sálat Ruogŋo riegádibme råsjmak råntsak Råmsså PDF: PDF Ofuohttá oanegasj nuppev Nuorttalij NSR: NSR- NSR NRK-NRK-NRK NRK-London NRK-Finnmárko NRK-EG NRK:n NRK:jn Nordlánnda Narvijkan nälggot nælggot Myra-Land muoralasj munjik moattestávval moattejienalasj moattejahkásasj minimálla mihá mige miesska mierredimijt miedek maŋŋel mavga mánnusasj månge Malmöj måjge M ludtjusasj lásjmak lånudim- L:jn L juohkkahasj juoga joavddelasj jållåribme Jáhkkulasj jærjálasj jáddamahtes IV ietjas ietjanis ietjajnis ietjá ienebuv huohppelasj gus guovtekultuvralasj guhtis guhkaájggásattjat grámmalasj goalmát gievrrak gievrrage gievrak gievrage gávtsebiejvvásasj Gasska-Nordlánnda gaskostibme garrasabmusit gålmålágásj gågulasj Fuolldá Dundarevuobme doarek Divtasvuona-Oarjjevuona divna Diksná digitála deblak dáv Dát dát dássta dássásasj dasi dárbbolasj danna dan dålusjájggásasj Dálásjájggásasj dakkirtjuodak dakkirijt dakkirge dájt dájs Dajna Daj dábálasj D:n D CV:s CV:j Børde-Rene buorre Bern-tv BB-Rød-Rene Bálákvuovdde bajedimij åvtånálán avtanálán åvtålágásj avtak åvddånimev åvdåsvásstet åvdåstibmáj asstelasj ásjmak Arentz-Hansen Åmo allasis ållagasj älla aktak ájnegasj Ájluokta Áillun Áillonieida Áillohaš Áili Áile-Kaas Áile Áila ådå Accra 38-jahkásasj 3:sijn 3:dijn 1996:jt 1992-93:s 1983:s 1983:j 10-bæjvvásattja
Grouped by bug #
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| vidjor9jit | vidjurijt | 3 |
| vidjor9jit vs vidjurijt |
| vidjurijt |
| vidjor9jit vs vidjurijt |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| Nárvijkan | Narvijkan | 1 |
| missing smj names; entry was SUBed |
| Narvijkan | missing smj names; entry was SUBed | |||
| Svieriga | missing names | |||
| Malmöj | missing names |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| nælggot | ä not recognised | |||
| nälggot | ä not recognised | |||
| dæbbaga |
| ä not recognised | ||
| däbbaga |
| ä not recognised | ||
| vællahit | ä not recognised | |||
| vällahit | ä not recognised | |||
| gæhtjáj |
| ä not recognised | ||
| gähtjáj |
| ä not recognised | ||
| Allaskåvllågähttjalus |
| Swedish ä | ||
| allaskåvllågähttjalus |
| Swedish ä |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| ållo |
| missing numeral | ||
| akta |
| missing numeral | ||
| guokta |
| missing numeral |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| Allaskåvllådievnastusdåjmadahka |
| 3-part compound | ||
| bielleájggeåhpadussaj |
| 3-part compound | ||
| oahppambiednikvuogádagá |
| 3-part compound | ||
| stivrrimássjetjállagij |
| 3-part compound | ||
| Iednegiellaoahppaábnnasa |
| 3-part compound |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| bajedimij | 3-syllable actio | |||
| gaskostibme | 3-syllable actio | |||
| jållåribme | 3-syllable actio | |||
| mierredimijt | 3-syllable actio | |||
| åvddånimev | missing 3-syllabic actio | |||
| åvdåstibmáj | 3-syllable actio N+Sg+Ill |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| alvvásláhkáj |
| adj+noun | ||
| buorredagojt |
| adj+noun | ||
| Dálásjájggásasj | adj+noun | |||
| dålusjájggásasj | adj+noun | |||
| gievrasalmasj |
| adj-attr+noun | ||
| gievrasmuorra |
| adj-attr+noun | ||
| gievramuorra |
| adj-gen+noun | ||
| gievrramuorra |
| adj-nom+noun | ||
| mælggatåhpadusán |
| adj+noun | ||
| skihpahuodnahijn |
| adj+noun | ||
| skihpasujtár |
| adj+noun |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| garrasabmusit | adj->adv |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| maññel | maŋŋel | 2 |
| ń not part of word |
| mańńel | maŋŋel | 2 |
| ń not part of word |
| maŋŋel | ń not part of word |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| X | roman numerals missing | |||
| L | roman numerals missing | |||
| D | roman numerals missing | |||
| M | roman numerals missing | |||
| X:j | roman numerals missing | |||
| D:n | roman numerals missing | |||
| L:jn | roman numerals missing | |||
| M:jda |
| roman numerals missing | ||
| IV | roman numerals missing | |||
| XII |
| roman numerals missing | ||
| XV:s |
| roman numerals missing | ||
| MXVII:j |
| roman numerals missing |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| lågenanniellja |
| missing numeral (14) | ||
| lågenanguokta |
| missing numeral (12) | ||
| millijåvnå |
| missing numerals | ||
| millijåvnnå |
| missing numerals |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| Nuorttalij | double hyphen | |||
| guovlojn |
| double hyphen | ||
| álggoálmmugis |
| double hyphen | ||
| álggoálmmugij |
| double hyphen | ||
| álggoálmmugis-álggoálmmugij | álggoálmmugis álggoálmmugij | 1 |
| double hyphen |
| polardutkamin |
| double hyphen |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| USA | missing bare abbrs | |||
| NSR | missing bare abbrs | |||
| missing bare abbrs | ||||
| PDF: | missing bare abbrs | |||
| NSR- | missing bare abbrs | |||
| NSR: | missing bare abbrs |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| Áila | missing names | |||
| Áile | missing names | |||
| Áili | missing names | |||
| Áillun | missing names |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| dan | missing pronoun | |||
| dáv | missing pronoun | |||
| dát | missing pronoun | |||
| Daj | missing pronoun | |||
| Dát | missing pronoun | |||
| Dajna | missing pronoun | |||
| dájs | missing pronoun | |||
| dasi | missing pronoun | |||
| dájt | missing pronoun | |||
| dássta | missing pronoun | |||
| danna | missing pronoun | |||
| divna | missing pronoun | |||
| ietjá | missing pronoun | |||
| guhtis | missing pronoun | |||
| avtak | missing pronoun | |||
| juoga | missing pronoun | |||
| nuppev | missing pronoun | |||
| mige | missing pronoun | |||
| mavga | missing pronoun | |||
| ienebuv | missing pronoun | |||
| dakkirijt | missing pronoun | |||
| juohkkahasj | missing pronoun | |||
| vuostamuttjan | missing pronouns |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| sábmeguolbbe | sámeguolbbe | 1 |
| speller does not follow compound-tagging |
| suobmaguolbbe | suomaguolbbe | 1 |
| speller does not follow compound-tagging |
| sámeguolbbe |
| speller does not follow compound-tagging | ||
| suomaguolbbe |
| speller does not follow compound-tagging |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| máhkak |
| unrecognised clitics | ||
| månge | unrecognised clitics | |||
| bijllage |
| unrecognised clitics | ||
| bijllak |
| unrecognised clitics | ||
| vuojnávge | unrecognised clitics | |||
| gievrrak | unrecognised clitics gievrra+Nom | |||
| gievrrage | unrecognised clitics gievrra+Nom | |||
| gievrak | unrecognised clitics gievrra+Gen | |||
| gievrage | unrecognised clitics gievrra+Gen | |||
| måjge | unrecognised clitics | |||
| munjik | unrecognised clitics | |||
| dakkirge | unrecognised clitics |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| VG:an | VG:n | 1 |
| acro inflection |
| VG:n | acro inflection | |||
| VG | nom and comp of acro | |||
| VG-kafean |
| nom and comp of acro | ||
| VG-biebbmo |
| nom and comp of acro |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| varresvuohtastasjåvnnå | varresvuodastasjåvnnå | 2 |
| doesn't follow comp-tags |
| varresvuodastasjåvnnå |
| doesn't follow comp-tags |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| fámon |
| unrec words | ||
| ájge |
| unrec words | ||
| ájgev |
| unrec words | ||
| ájgij |
| unrec words | ||
| vissa | unrec words | |||
| siebrijs |
| unrec words | ||
| siebrre |
| unrec words | ||
| Visa- | unrec words | |||
| báŋŋka |
| unrec words | ||
| báŋkas |
| unrec words | ||
| báŋŋkaj |
| unrec words | ||
| biehtsemánno |
| unrec words | ||
| joarkkaskåvlåv |
| unrec words | ||
| skåvlås |
| unrec words | ||
| skåvlåsj |
| unrec words | ||
| álmmukallaskåvlån |
| unrec words | ||
| allaskåvllåj |
| unrec words | ||
| isetábnas |
| unrec words | ||
| biele |
| unrec words | ||
| mierredimájgijt |
| unrec words | ||
| muorra |
| unrec words | ||
| suohkanin |
| unrec words | ||
| guossa |
| unrec words | ||
| biehtse |
| unrec words | ||
| vuojnno |
| unrec words | ||
| dile |
| unrec words | ||
| gågulasj | unrec words | |||
| Jáhkkulasj | unrec words | |||
| tjådålasj | unrec words | |||
| dárbbolasj | unrec words | |||
| oanegasj | unrec words | |||
| ájnegasj | unrec words | |||
| joavddelasj | unrec words | |||
| huohppelasj | unrec words | |||
| asstelasj | unrec words | |||
| ållagasj | unrec words | |||
| dássásasj | unrec words | |||
| ludtjusasj | unrec words | |||
| vahkkusasj | unrec words | |||
| mánnusasj | unrec words | |||
| deblak | unrec words | |||
| ásjmak | unrec words | |||
| råsjmak | unrec words | |||
| urmak | unrec words | |||
| råntsak | unrec words | |||
| lásjmak | unrec words | |||
| miedek | unrec words | |||
| snutjek | unrec words | |||
| doarek | unrec words |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| VG:as | VG:s | 1 |
| accepts SUB-ed entries |
| VG:aj | VG:j | 1 |
| accepts SUB-ed entries |
| 1983:as | 1983:s | 1 |
| accepts SUB-ed entries |
| 1983:aj | 1983:j | 1 |
| accepts SUB-ed entries |
| VG:s | accepts SUB-ed entries | |||
| VG:j | accepts SUB-ed entries | |||
| 1983:s | accepts SUB-ed entries | |||
| 1983:j | accepts SUB-ed entries |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| niehkkealmasjvuorro |
| 3-part compound | ||
| goasskemrájddohadde |
| 3-part compound | ||
| merragálvvogárvvo |
| 3-part compound |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| Nijbegoasskem | Nijbbegoasskem | 1 |
| doesn't follow comp tags |
| bievdedåhpe | bievddedåhpe | 1 |
| doesn't follow comp tags |
| uvsagáldo | uksagáldo | 1 |
| doesn't follow comp tags |
| Nijbbegoasskem |
| doesn't follow comp tags | ||
| bievddedåhpe |
| doesn't follow comp tags | ||
| uksagáldo |
| doesn't follow comp tags |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| muoralasj | deriv -lasj not recognised | |||
| jærjálasj | deriv -lasj not recognised | |||
| grámmalasj | deriv -lasj not recognised | |||
| snuhpalasj | deriv -lasj not recognised |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| stuorra-fuoskok | stuorrafuoskok | 1 |
| nominal + Prop/Der compound |
| stuorraFuoskok | stuorrafuoskok | 1 |
| nominal + Prop/Der compound |
| stuorra-Fuoskok | stuorrafuoskok | 2 |
| nominal + Prop/Der compound |
| Stuorra-fuoskok | Stuorrafuoskok | 1 |
| nominal + Prop/Der compound |
| StuorraFuoskok | Stuorrafuoskok | 1 |
| nominal + Prop/Der compound |
| Stuorra-Fuoskok | Stuorrafuoskok | 2 |
| nominal + Prop/Der compound |
| stuora-fuoskok | stuorafuoskok | 1 |
| nominal + Prop/Der compound |
| stuoraFuoskok | stuorafuoskok | 1 |
| nominal + Prop/Der compound |
| stuora-Fuoskok | stuorafuoskok | 2 |
| nominal + Prop/Der compound |
| Stuora-fuoskok | Stuorafuoskok | 1 |
| nominal + Prop/Der compound |
| StuoraFuoskok | Stuorafuoskok | 1 |
| nominal + Prop/Der compound |
| Stuora-Fuoskok | Stuorafuoskok | 2 |
| nominal + Prop/Der compound |
| stuor-fuoskok | stuorafuoskok | 1 |
| nominal + Prop/Der compound |
| stuorFuoskok | stuorafuoskok | 2 |
| nominal + Prop/Der compound |
| stuor-Fuoskok | stuorafuoskok | 2 |
| nominal + Prop/Der compound |
| Stuor-fuoskok | Stuorafuoskok | 1 |
| nominal + Prop/Der compound |
| StuorFuoskok | Stuorafuoskok | 2 |
| nominal + Prop/Der compound |
| Stuor-Fuoskok | Stuorafuoskok | 2 |
| nominal + Prop/Der compound |
| stuorFuosskok | Stuor-Fuosskok | 2 |
| nominal + Prop/Der compound |
| StuorFuosskok | Stuor-Fuosskok | 1 |
| nominal + Prop/Der compound |
| Stuorafuoskok |
| nominal + Prop/Der compound | ||
| Stuorrafuoskok |
| nominal + Prop/Der compound | ||
| stuorafuoskok |
| nominal + Prop/Der compound | ||
| stuorrafuoskok |
| nominal + Prop/Der compound |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| væráldbijlla | væráltbijlla | 1 |
| wrong compounding forms accepted |
| bednagadtjbijlla | bednagatjijlla | 2 |
| wrong compounding forms accepted |
| jávrádtjbijlla | jávrásjbijlla | 2 |
| wrong compounding forms accepted |
| væráltbijlla |
| wrong compounding forms accepted | ||
| bednagasjbijlla |
| wrong compounding forms accepted | ||
| jávrásjbijlla |
| wrong compounding forms accepted |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| ráddnávuohtamuorra | ráddnávuodamuorra | 2 |
| "impossible" compound forms |
| dåjmadibmesáhka | dåjmadibmesak | 3 |
| "impossible" compound forms |
| Noajddudakmuorra |
| "impossible" compound forms | ||
| Sákkaldakguolle |
| "impossible" compound forms | ||
| ráddnávuodamuorra |
| "impossible" compound forms | ||
| dåjmadimsáhka |
| "impossible" compound forms |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| kaninmuorra |
| some short compound-forms not working | ||
| patronmuorra |
| some short compound-forms not working | ||
| lihttemaskinsielgge |
| some short compound-forms not working | ||
| doktorandgukse |
| some short compound-forms not working |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| bievdeguorra | bievddeguorra | 1 |
| speller does not follow compound-tagging |
| muorraguorra |
| speller does not follow compound-tagging | ||
| muorajguorra |
| speller does not follow compound-tagging |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| Áillohaš | names wrongly converted from sme to smj | |||
| Áillonieida | names wrongly converted from sme to smj | |||
| Arentz-Hansen | names wrongly converted from sme to smj |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| 1883:a | 1883 | 2 |
| speller does not suggest 1883 for 1883:a |
| Bq:a | Bq | 2 |
| speller does not suggest 1883 for 1883:a |
| NRK:a | NRK | 2 |
| speller does not suggest 1883 for 1883:a |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| åvdåsvásstet | non-existent word accepted | |||
| åvdåsvastet | åvdåsvásstet | 2 |
| non-existent word accepted |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| gájtjabájkka | gájtsabajkka | 2 |
| Missing suggestion when multiple errors across compound boundary |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| Børde-Rene | name+name suggestions with double hyphens | |||
| Áile-Kaas | name+name suggestions with double hyphens | |||
| Myra-Land | name+name suggestions with double hyphens | |||
| BB-Rød-Rene | name+name suggestions with double hyphens |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| 38jahkásasj | 38-jahkásasj | 1 |
| suggests 38jahkásasj for 38-jahkásasj |
| 38-jahkásasj | suggests 38jahkásasj for 38-jahkásasj |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| girkkomusijkkár |
| noun+Prop without hyphen | ||
| girkkomusijkkaRa | girkkomusijkkára | 2 |
| noun+Prop without hyphen |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| jubmeldievnnoiellemij |
| word not recognized |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| vasstet | vásstet | 1 |
| suggestions involving vowel-replacement |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| lågenanguokta |
| lågenanguokte not recognized | ||
| lågenanguovtev |
| lågenanguokte not recognized | ||
| lågenanguovtes |
| lågenanguokte not recognized | ||
| lågenanguoktáj |
| lågenanguokte not recognized |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| tsåhkeLot | tsåhke-Lot | 1 |
| prefix+name as split comp without hyphen |
| iemeLinn | ieme-Linn | 1 |
| prefix+name as split comp without hyphen |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| C-giella |
| C-giellan is not accepted |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| gålmmålåk |
| numeral attr:s on lot | ||
| guoktalåk |
| numeral attr:s on lot | ||
| vihttalåk |
| numeral attr:s on lot |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| sáme-dáro |
| Gen+hyph compound |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| NRKGE | NRK-EG | 2 |
| acro-compounds without hyphen |
| NRK-EG | acro-compounds without hyphen |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| Rájerasstimijn |
| actios and actors as second part of compound | ||
| rájijrasstimin |
| actios and actors as second part of compound | ||
| bargojduvddemis |
| actios and actors as second part of compound | ||
| bargogáddjon |
| actios and actors as second part of compound |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| Bispadime-me-ráden | Bispadime-m-ráden | 1 |
| Bispadime-me-ráden |
| Åhpadim-me-konsulænnta | Åhpadim-m-konsulænnta | 1 |
| Bispadime-me-ráden |
| Åhpadis-me-konsulænnta | Åhpadis-m-konsulænnta | 1 |
| Bispadime-me-ráden |
| me-rijkaieŋŋilsgiellaj | Amerijka-ieŋŋilsgiellaj | 3 |
| Bispadime-me-ráden |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| Lofåhtajguollima | Lofåhta-guollima | 1 |
| left-compound-tags affects proper-nouns too |
| Råmsåjdille | Råmså-dille | 1 |
| left-compound-tags affects proper-nouns too |
| Divtasvuonaahke | Divtasvuona-ahke | 1 |
| left-compound-tags affects proper-nouns too |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| oajvijda | åjvijda | 2 |
| dipht simpl oajvve>åjvijn |
| oajvijs | åjvijs | 2 |
| dipht simpl oajvve>åjvijn |
| oajvij | åjvij | 2 |
| dipht simpl oajvve>åjvijn |
| oajvijn | åjvijn | 2 |
| dipht simpl oajvve>åjvijn |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| åvtånálán | numerals and pronouns to NAMÁK and SASJ fails | |||
| gålmålágásj | numerals and pronouns to NAMÁK and SASJ fails | |||
| avtanálán | numerals and pronouns to NAMÁK and SASJ fails | |||
| åvtålágásj | numerals and pronouns to NAMÁK and SASJ fails | |||
| gávtsebiejvvásasj | numerals and pronouns to NAMÁK and SASJ fails | |||
| dakkirtjuodak | numerals and pronouns to NAMÁK and SASJ fails | |||
| gålmmålágásj | gålmålágásj | 1 |
| numerals and pronouns to NAMÁK and SASJ fails |
| vihttalágásj | vijddalágásj | 3 |
| numerals and pronouns to NAMÁK and SASJ fails |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| gettsugadtjajbiejvve | gettsugattjajbiejvve | 1 |
| gen of ÅLLAGASJ-adj as first prt of cmp |
| sjimugadtjabiejvve | sjimugattjabiejvve | 1 |
| gen of ÅLLAGASJ-adj as first prt of cmp |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| Ájluokta | lule sami names missing | |||
| Åmo | lule sami names missing | |||
| Skilkká | lule sami names missing | |||
| Ofuohttá | lule sami names missing | |||
| Gasska-Nordlánnda | lule sami names missing | |||
| Nordlánnda | lule sami names missing | |||
| Råmsså | lule sami names missing | |||
| Fuolldá | lule sami names missing | |||
| Voaga | lule sami names missing | |||
| Sálat | lule sami names missing | |||
| Suortta | lule sami names missing | |||
| Stuorgiedde | lule sami names missing | |||
| Sálatædno | lule sami names missing | |||
| Ruogŋo | lule sami names missing | |||
| Dundarevuobme | lule sami names missing | |||
| Diksná | lule sami names missing | |||
| Bálákvuovdde | lule sami names missing | |||
| Vindela | lule sami names missing | |||
| Accra | lule sami names missing |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| julev- |
| prefix + hyhpen does not get accepted | ||
| svieriga- | prefix + hyhpen does not get accepted |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| kåntåvrråa-jådediddje | kåntåvrrå-Raa-jådediddje | 3 |
| a taking part of compound |
| kåntåvrråjådediddje |
| a taking part of compound |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| 051-bágo |
| number compound, numbers starting with 0 |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| Divtasvuona-Oarjjevuona | Prop gen + hyphen + Prop gen | |||
| Divtasvuona-Oarjevuona | Divtasvuona-Oarjjevuona | 1 |
| Prop gen + hyphen + Prop gen |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| miereunnedimskåvve |
| +Left-marked words as middle part of compound |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| Aktaalmasj | Akta almasj | 1 |
| numeral+noun compounds |
| guovtealmatja | guovte almatja | 1 |
| numeral+noun compounds |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| 1992-93:s | cased numeral+numeral compounds |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| 1700-LÅHKO |
| numerals+NOUN | ||
| 1700-Låhko | 1700-låhko | 1 |
| numerals+NOUN |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| 0-nubbejådediddje | 0-ruobbejådediddje | 2 | unmotivated suggestions with numeral+noun | |
| 0-juojggamaárbbedábev |
| unmotivated suggestions with numeral+noun |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| Kitaadábálasj | Kitaa-dábálasj | 1 |
| name+adj compounds without hyphen |
| Sirgesdábálasj | Sirges-dábálasj | 1 |
| name+adj compounds without hyphen |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| badjeGájnaj | Badje-Gájnaj | 2 |
| noun prefix+name compound without hyphen |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| nuppáj | nubbáj | 2 |
| pp>bb phonetic rule does not apply |
| rappe | rabbe | 2 |
| pp>bb phonetic rule does not apply |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| SABME | SÁBME | 1 |
| UPPERCASE-typos only get acronym-suggestions |
| SABMe | SÁBME | 2 |
| UPPERCASE-typos only get acronym-suggestions |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| saame | sáme | 2 |
| suggestions saame (suggestions Saame (with capital S) is proper so in that meaning this bug won't be fixed unless we remove the proper-noun) |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| 3:sijn | caseforms, ordinals and collectives | |||
| 3:dijn | caseforms, ordinals and collectives |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| giehtjalåkvihtta |
| numeral-variants | ||
| gietjatjuodevihtta |
| numeral-variants | ||
| gáktsalågenantjuotakta |
| numeral-variants | ||
| vihttatjuot |
| numeral-variants | ||
| guok |
| numeral-variants | ||
| guoklåk |
| numeral-variants | ||
| guoktjuot |
| numeral-variants | ||
| guokduhát |
| numeral-variants | ||
| guoktuvsán |
| numeral-variants | ||
| lågenanguoktatjuodeguhttalåk |
| numeral-variants | ||
| nielljelåk |
| numeral-variants | ||
| giehtjalåkvitta | giehtjalåkvihtta | 1 |
| numeral-variants |
| gietjatjuodevitta | gietjatjuodevihtta | 1 |
| numeral-variants |
| gáktsalågeantjuotakta | gáktsalågenantjuotakta | 1 |
| numeral-variants |
| vittatjuot | vihttatjuot | 1 |
| numeral-variants |
| guk | guok | 1 |
| numeral-variants |
| guklåk | guoklåk | 1 |
| numeral-variants |
| guoktjut | guoktjuot | 1 |
| numeral-variants |
| guokduhat | guokduhát | 1 |
| numeral-variants |
| guoktuvsan | guoktuvsán | 1 |
| numeral-variants |
| lågeanguoktatjuodeguhttalåk | lågenanguoktatjuodeguhttalåk | 1 |
| numeral-variants |
| nelljelåk | nielljelåk | 1 |
| numeral-variants |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| 21bieivi | 21-biejve | 3 |
| numeral compounds and cases |
| 10ijjs | 10-ijá | 3 |
| numeral compounds and cases |
| 1996:itd | 1996:jt | 2 |
| numeral compounds and cases |
| 10-biejvvásattja | 10-bæjvvásattja | 2 |
| numeral compounds and cases |
| 21-biejve |
| numeral compounds and cases | ||
| 10-ijá |
| numeral compounds and cases | ||
| 1996:jt | numeral compounds and cases | |||
| 10-bæjvvásattja | numeral compounds and cases |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| javlla-CD |
| noun-acro compounds | ||
| javlla-CDan | javlla-CD:n | 1 |
| noun-acro compounds |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| åvtåmuora | åvtå muora | 1 |
| nom- and gen-numerals make compounds with nouns |
| guovtesuollu | guovte suollu | 1 |
| nom- and gen-numerals make compounds with nouns |
| gålmmågahper | gålmmå gahpera | 2 |
| nom- and gen-numerals make compounds with nouns |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| aktak | numerals + clitic | |||
| guoktek |
| numerals + clitic | ||
| guoktege |
| numerals + clitic | ||
| guovtege |
| numerals + clitic | ||
| nielljege |
| numerals + clitic | ||
| vidánge |
| numerals + clitic | ||
| guhttage |
| numerals + clitic | ||
| gietjavge |
| numerals + clitic | ||
| akttak | aktak | 1 |
| numerals + clitic |
| guoktetk | guoktek | 1 |
| numerals + clitic |
| guokteges | guoktege | 1 |
| numerals + clitic |
| guovttege | guovtege | 1 |
| numerals + clitic |
| njielljege | nielljege | 1 |
| numerals + clitic |
| vidange | vidánge | 1 |
| numerals + clitic |
| guhtage | guhttage | 1 |
| numerals + clitic |
| giehtjavge | gietjavge | 1 |
| numerals + clitic |
| tjuohtik | tjuohtek | 1 |
| numerals + clitic |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| madduj | mädduj | 1 |
| norm-fst does not recognize ä when u in sec. syll. |
| dabttjum | däbttjum | 1 |
| norm-fst does not recognize ä when u in sec. syll. |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| amuorra | a-muorra | 1 |
| does not recognize alphabet-abbr+noun |
| Cmuorra | C-muorra | 1 |
| does not recognize alphabet-abbr+noun |
| dmuorra | d-muorra | 1 |
| does not recognize alphabet-abbr+noun |
| Emuorra | E-muorra | 1 |
| does not recognize alphabet-abbr+noun |
| a-muorra |
| does not recognize alphabet-abbr+noun | ||
| C-muorra |
| does not recognize alphabet-abbr+noun | ||
| d-muorra |
| does not recognize alphabet-abbr+noun | ||
| E-muorra |
| does not recognize alphabet-abbr+noun |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| Seskaröcd | Seskarö-cd | 1 |
| Nouns/propers+acronyms |
| Berntv | Bern-tv | 1 |
| Nouns/propers+acronyms |
| muorraas | muorra-as | 1 |
| Nouns/propers+acronyms |
| muorraNRK | muorra-NRK | 1 |
| Nouns/propers+acronyms |
| NRK-Finnmarko | NRK-Finnmárko | 1 |
| Nouns/propers+acronyms |
| NSR-Lindon | NRK-London | 3 |
| Nouns/propers+acronyms |
| Seskarö-cd | Nouns/propers+acronyms | |||
| Bern-tv | Nouns/propers+acronyms | |||
| muorra-as |
| Nouns/propers+acronyms | ||
| muorra-NRK |
| Nouns/propers+acronyms | ||
| NRK-Finnmárko | Nouns/propers+acronyms | |||
| NRK-London | Nouns/propers+acronyms |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| ietjas | Case-forms of iesj not recognized | |||
| ietjanis | Case-forms of iesj not recognized | |||
| allasis | Case-forms of iesj not recognized | |||
| ietjajnis | Case-forms of iesj not recognized | |||
| itjas | ietjas | 1 |
| Case-forms of iesj not recognized |
| itjanis | ietjanis | 1 |
| Case-forms of iesj not recognized |
| alasis | allasis | 1 |
| Case-forms of iesj not recognized |
| itjajnis | ietjajnis | 1 |
| Case-forms of iesj not recognized |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| enujn |
| word not recognized | ||
| ænujn | enujn | 1 |
| word not recognized |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| rijkajuohkostjiektjamav | rijkajuogostjiektjamav | 2 |
| should not be accepted |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| muorraguovggat |
| noun+adjective compounds | ||
| muorraguovggis |
| noun+adjective compounds | ||
| boahtemtjuovggat |
| noun+adjective compounds | ||
| boahtemtjuovggis |
| noun+adjective compounds |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| boatsoj-sujtto |
| hyphens or not in compounds |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| CV-s | CV:s | 1 | acronyms+cases | |
| CV'j | CV:j | 1 |
| acronyms+cases |
| NRK:ajn | NRK:jn | 1 |
| acronyms+cases |
| NRK-an | NRK:n | 2 |
| acronyms+cases |
| CV:s | acronyms+cases | |||
| CV:j | acronyms+cases | |||
| NRK:jn | acronyms+cases | |||
| NRK:n | acronyms+cases |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| polárdutkamin | polardutkamin | 1 |
| prefixes not working |
| Nuortalijguovlojn | Nuorttalijguovlojn | 1 |
| prefixes not working |
| polardutkamin |
| prefixes not working | ||
| Nuorttalijguovlojn |
| prefixes not working |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| NRK-NRK-NRK | acros compound to each other without hyphens | |||
| NRKNRKNRK | NRK-NRK-NRK | 2 |
| acros compound to each other without hyphens |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| moattejahkásasj | lot of adjectives missing | |||
| moattestávval | lot of adjectives missing | |||
| moattejienalasj | lot of adjectives missing | |||
| minimálla | lot of adjectives missing | |||
| miesska | lot of adjectives missing | |||
| mihá | lot of adjectives missing | |||
| stuorra | lot of adjectives missing | |||
| ådå | lot of adjectives missing | |||
| buorre | lot of adjectives missing | |||
| dábálasj | lot of adjectives missing | |||
| digitála | lot of adjectives missing | |||
| muottejahkásasj | moattejahkásasj | 2 |
| lot of adjectives missing |
| muottestávval | moattestávval | 2 |
| lot of adjectives missing |
| moattijienalasj | moattejienalasj | 1 |
| lot of adjectives missing |
| minimalla | minimálla | 1 |
| lot of adjectives missing |
| misska | miesska | 1 |
| lot of adjectives missing |
| miha | mihá | 1 |
| lot of adjectives missing |
| suorra | stuorra | 1 |
| lot of adjectives missing |
| ådo | ådå | 1 |
| lot of adjectives missing |
| buovre | buorre | 1 |
| lot of adjectives missing |
| dábalasj | dábálasj | 1 |
| lot of adjectives missing |
| digitala | digitála | 1 |
| lot of adjectives missing |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| Almoduvváj | Almoduváj | 1 |
| uvváj and uvvát |
| aneduvváj | aneduváj | 1 |
| uvváj and uvvát |
| mujttaluvvát | mujttaluvvat | 1 |
| uvváj and uvvát |
| bielkeduvváj | bielkeduváj | 1 |
| uvváj and uvvát |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| Åhpaditge-konsulænnta | Åhpadit-g-konsulænnta | 2 |
| inf+noun compounds |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| bullar1-dutkamin | bullar-1-dutkamin | 1 |
| numerals compound to nouns without hyphen |
| nummar291 | nummar-91 | 1 |
| numerals compound to nouns without hyphen |
| nummar-91 |
| numerals compound to nouns without hyphen |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| 0-mujttalibmesáhka | 0-mujttalimesáhka | 1 |
| impossible compounds |
Testpairs not in bugs
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| barrnev | bárnev | 2 |
| |
| Galileajávrregáttev | Galilea-jávrregáttev | 1 |
| |
| oahppamdiededimev |
| actio compound | ||
| riegádimgånågis | actio compound | |||
| lånudim- | actio compound form | |||
| Skåvllådåjmadak | compound | |||
| Allaskåvllådåjmadagas | Allaskåvllådåjmadagás | 1 | compound with typo, no suggestion | |
| Barggomárnanin | Barggomárnánin | 1 | compound with typo, no suggestion | |
| guhkaájggásattjat | lexicalized compound | |||
| jáddamahtes | missing actio forms | |||
| riegádibme | missing actio forms | |||
| vuolgadime | missing actio forms | |||
| åbmudaksiebrre | missing compound | |||
| Barggogaskostime | missing compound | |||
| sebrudakalmasj | missing compound | |||
| siellosujtov |
| missing compound | ||
| goalmát | missing ordinal | |||
| nuppev | missing ordinal | |||
| vuostasj | missing ordinal | |||
| Vuostasj | missing ordinal | |||
| gus | missing particles | |||
| Svierigadárogielan |
| name compound i Kintel och Korhon=vuonadár...med liten bok | ||
| guovtekultuvralasj | no hash at compound border | |||
| älla | strange suggestions |

