Speller Test Results for: «sp-regression-pl-smj.txt»
Overview
Technical data
Language tested: smj
Document tested: sp-regression-pl-smj.txt
Speller tool: Polderland command line tool
Speller tool version: spellSamiLule version 103.3.0.4, Mar 23 2009, 13:53:28
Speller lexicon version: Julevsáme, version 2.2, 20120209-53852
Test Date: 20120213-1037
Test Type: regression
Processing time (user+system time): No data available
Speed: No data available
Max memory usage of the speller (max value of /bin/ps -o rss= PID sampled every 10 second): No data available
Result summary
Nº of input words: 494
Nº of spelling errors: 157 (31,78% of all words)
| Speller view | Wrong tokenisation | |||
|---|---|---|---|---|
| Speller Positive (Nº of flagged words): 144 | Speller Negative (Nº of accepted words): 350 | All tokenisation errors: 0 |
||
| Reality | Nº of real errors: 157 | Nº of true positives (detected real errors): 133 | Nº of false negatives (unflagged spelling errors): 24 | Nº of errouneously tokenized spelling errors: 0 |
| Nº of real correct words: 337 | Nº of false positives (incorrectly flagged words): 11 | Nº of true negatives (unflagged correct words): 326 | Nº of errouneously tokenized correct words: 0 |
|
Precision (tp/(tp+fp)):
92,36%
Recall (tp/(tp+fn)):
84,71%
Accuracy ((tp+tn)/words):
92,91%
| Spelling errors: 157 | ||||
|---|---|---|---|---|
| Simple errors (edit dist. 1): | Errors with edit distance 2: | Errors with edit distance ≥3: |
||
| Suggestion statistics for true positives (= 133 = 100%): | 107 (68,15%) | 42 (26,75%) | 8 (5,1%) | |
| Nº of detected spelling errors with correct suggestion in top 3: | 85 (63.91 %) | 60 | 22 | 3 |
| Nº of detected spelling errors with correct suggestionbelow top 3: | 7 (5.26 %) | 6 | 0 | 1 |
| Nº of detected spelling errors with only wrong suggestions: | 32 (24.06 %) | 24 | 6 | 2 |
| Nº of detected spelling errors with no suggestions at all: | 9 (6.77 %) | 5 | 3 | 1 |
| Undetected spelling errors: | 24 | 12 | 11 | 1 |
True positives (133)
Spelling errors with correct suggestions (92)
| Input word | Expected correction | Editing distance | Suggestions | Bug ID | Comment |
|---|---|---|---|---|---|
| barrnev | bárnev | 2 |
| ||
| Galileajávrregáttev | Galilea-jávrregáttev | 1 |
| ||
| vidjor9jit | vidjurijt | 3 |
| 398 | vidjor9jit vs vidjurijt |
| Nárvijkan | Narvijkan | 1 |
| 407 | missing smj names; entry was SUBed |
| maññel | maŋŋel | 2 |
| 427 | ń not part of word |
| mańńel | maŋŋel | 2 |
| 427 | ń not part of word |
| sábmeguolbbe | sámeguolbbe | 1 |
| 489 | speller does not follow compound-tagging |
| suobmaguolbbe | suomaguolbbe | 1 |
| 489 | speller does not follow compound-tagging |
| VG:an | VG:n | 1 |
| 503 | acro inflection |
| varresvuohtastasjåvnnå | varresvuodastasjåvnnå | 2 |
| 506 | doesn't follow comp-tags |
| VG:as | VG:s | 1 |
| 509 | accepts SUB-ed entries |
| VG:aj | VG:j | 1 |
| 509 | accepts SUB-ed entries |
| StuorFuosskok | Stuor-Fuosskok | 1 |
| 518 | nominal + Prop/Der compound |
| bednagadtjbijlla | bednagatjijlla | 2 |
| 526 | wrong compounding forms accepted |
| væráldbijlla | væráltbijlla | 1 |
| 526 | wrong compounding forms accepted |
| ráddnávuohtamuorra | ráddnávuodamuorra | 2 |
| 536 | "impossible" compound forms |
| 1883:a | 1883 | 2 |
| 555 | speller does not suggest 1883 for 1883:a |
| NRK:a | NRK | 2 |
| 555 | speller does not suggest 1883 for 1883:a |
| åvdåsvastet | åvdåsvásstet | 2 |
| 556 | non-existent word accepted |
| 38jahkásasj | 38-jahkásasj | 1 |
| 580 | suggests 38jahkásasj for 38-jahkásasj |
| girkkomusijkkaRa | girkkomusijkkára | 2 |
| 582 | noun+Prop without hyphen |
| vasstet | vásstet | 1 |
| 589 | suggestions involving vowel-replacement |
| me-rijkaieŋŋilsgiellaj | Amerijka-ieŋŋilsgiellaj | 3 |
| 616 | Bispadime-me-ráden |
| Divtasvuonaahke | Divtasvuona-ahke | 1 |
| 617 | left-compound-tags affects proper-nouns too |
| oajvijda | åjvijda | 2 |
| 618 | dipht simpl oajvve>åjvijn |
| oajvijs | åjvijs | 2 |
| 618 | dipht simpl oajvve>åjvijn |
| oajvij | åjvij | 2 |
| 618 | dipht simpl oajvve>åjvijn |
| oajvijn | åjvijn | 2 |
| 618 | dipht simpl oajvve>åjvijn |
| vihttalágásj | vijddalágásj | 3 |
| 619 | numerals and pronouns to NAMÁK and SASJ fails |
| gålmmålágásj | gålmålágásj | 1 |
| 619 | numerals and pronouns to NAMÁK and SASJ fails |
| gettsugadtjajbiejvve | gettsugattjajbiejvve | 1 |
| 624 | gen of ÅLLAGASJ-adj as first prt of cmp |
| sjimugadtjabiejvve | sjimugattjabiejvve | 1 |
| 624 | gen of ÅLLAGASJ-adj as first prt of cmp |
| Divtasvuona-Oarjevuona | Divtasvuona-Oarjjevuona | 1 |
| 634 | Prop gen + hyphen + Prop gen |
| 1700-Låhko | 1700-låhko | 1 |
| 647 | numerals+NOUN |
| Kitaadábálasj | Kitaa-dábálasj | 1 |
| 649 | name+adj compounds without hyphen |
| Sirgesdábálasj | Sirges-dábálasj | 1 |
| 649 | name+adj compounds without hyphen |
| badjeGájnaj | Badje-Gájnaj | 2 |
| 650 | noun prefix+name compound without hyphen |
| nuppáj | nubbáj | 2 |
| 651 | pp>bb phonetic rule does not apply |
| rappe | rabbe | 2 |
| 651 | pp>bb phonetic rule does not apply |
| SABMe | SÁBME | 2 |
| 652 | UPPERCASE-typos only get acronym-suggestions |
| SABME | SÁBME | 1 |
| 652 | UPPERCASE-typos only get acronym-suggestions |
| saame | sáme | 2 |
| 658 | suggestions saame (suggestions Saame (with capital S) is proper so in that meaning this bug won't be fixed unless we remove the proper-noun) |
| guk | guok | 1 |
| 692 | numeral-variants |
| giehtjalåkvitta | giehtjalåkvihtta | 1 |
| 692 | numeral-variants |
| gietjatjuodevitta | gietjatjuodevihtta | 1 |
| 692 | numeral-variants |
| gáktsalågeantjuotakta | gáktsalågenantjuotakta | 1 |
| 692 | numeral-variants |
| vittatjuot | vihttatjuot | 1 |
| 692 | numeral-variants |
| guklåk | guoklåk | 1 |
| 692 | numeral-variants |
| guoktjut | guoktjuot | 1 |
| 692 | numeral-variants |
| guokduhat | guokduhát | 1 |
| 692 | numeral-variants |
| guoktuvsan | guoktuvsán | 1 |
| 692 | numeral-variants |
| lågeanguoktatjuodeguhttalåk | lågenanguoktatjuodeguhttalåk | 1 |
| 692 | numeral-variants |
| nelljelåk | nielljelåk | 1 |
| 692 | numeral-variants |
| 10ijjs | 10-ijá | 3 |
| 711 | numeral compounds and cases |
| 10-biejvvásattja | 10-bæjvvásattja | 2 |
| 711 | numeral compounds and cases |
| guhtage | guhttage | 1 |
| 744 | numerals + clitic |
| tjuohtik | tjuohtek | 1 |
| 744 | numerals + clitic |
| giehtjavge | gietjavge | 1 |
| 744 | numerals + clitic |
| akttak | aktak | 1 |
| 744 | numerals + clitic |
| guoktetk | guoktek | 1 |
| 744 | numerals + clitic |
| guokteges | guoktege | 1 |
| 744 | numerals + clitic |
| guovttege | guovtege | 1 |
| 744 | numerals + clitic |
| njielljege | nielljege | 1 |
| 744 | numerals + clitic |
| vidange | vidánge | 1 |
| 744 | numerals + clitic |
| madduj | mädduj | 1 |
| 776 | norm-fst does not recognize ä when u in sec. syll. |
| dabttjum | däbttjum | 1 |
| 776 | norm-fst does not recognize ä when u in sec. syll. |
| Seskaröcd | Seskarö-cd | 1 |
| 805 | Nouns/propers+acronyms |
| Berntv | Bern-tv | 1 |
| 805 | Nouns/propers+acronyms |
| itjas | ietjas | 1 |
| 815 | Case-forms of iesj not recognized |
| itjanis | ietjanis | 1 |
| 815 | Case-forms of iesj not recognized |
| alasis | allasis | 1 |
| 815 | Case-forms of iesj not recognized |
| itjajnis | ietjajnis | 1 |
| 815 | Case-forms of iesj not recognized |
| ænujn | enujn | 1 |
| 816 | word not recognized |
| CV'j | CV:j | 1 |
| 913 | acronyms+cases |
| NRK:ajn | NRK:jn | 1 |
| 913 | acronyms+cases |
| polárdutkamin | polardutkamin | 1 |
| 916 | prefixes not working |
| Nuortalijguovlojn | Nuorttalijguovlojn | 1 |
| 916 | prefixes not working |
| muottejahkásasj | moattejahkásasj | 2 |
| 963 | lot of adjectives missing |
| muottestávval | moattestávval | 2 |
| 963 | lot of adjectives missing |
| ådo | ådå | 1 |
| 963 | lot of adjectives missing |
| buovre | buorre | 1 |
| 963 | lot of adjectives missing |
| suorra | stuorra | 1 |
| 963 | lot of adjectives missing |
| moattijienalasj | moattejienalasj | 1 |
| 963 | lot of adjectives missing |
| minimalla | minimálla | 1 |
| 963 | lot of adjectives missing |
| misska | miesska | 1 |
| 963 | lot of adjectives missing |
| miha | mihá | 1 |
| 963 | lot of adjectives missing |
| dábalasj | dábálasj | 1 |
| 963 | lot of adjectives missing |
| digitala | digitála | 1 |
| 963 | lot of adjectives missing |
| Almoduvváj | Almoduváj | 1 |
| 971 | uvváj and uvvát |
| aneduvváj | aneduváj | 1 |
| 971 | uvváj and uvvát |
| mujttaluvvát | mujttaluvvat | 1 |
| 971 | uvváj and uvvát |
| bielkeduvváj | bielkeduváj | 1 |
| 971 | uvváj and uvvát |
Spelling errors with only incorrect suggestions (32)
| Input word | Expected correction | Editing distance | Suggestions | Bug ID | Comment |
|---|---|---|---|---|---|
| Barggomárnanin | Barggomárnánin | 1 |
| compound with typo, no suggestion | |
| álggoálmmugis-álggoálmmugij | álggoálmmugis álggoálmmugij | 1 |
| 482 | double hyphen |
| 1983:aj | 1983:j | 1 |
| 509 | accepts SUB-ed entries |
| 1983:as | 1983:s | 1 |
| 509 | accepts SUB-ed entries |
| stuorFuosskok | Stuor-Fuosskok | 2 |
| 518 | nominal + Prop/Der compound |
| stuora-fuoskok | stuorafuoskok | 1 |
| 518 | nominal + Prop/Der compound |
| stuor-fuoskok | stuorafuoskok | 1 |
| 518 | nominal + Prop/Der compound |
| Stuor-fuoskok | Stuorafuoskok | 1 |
| 518 | nominal + Prop/Der compound |
| stuorra-fuoskok | stuorrafuoskok | 1 |
| 518 | nominal + Prop/Der compound |
| Stuorra-fuoskok | Stuorrafuoskok | 1 |
| 518 | nominal + Prop/Der compound |
| jávrádtjbijlla | jávrásjbijlla | 2 |
| 526 | wrong compounding forms accepted |
| Bq:a | Bq | 2 |
| 555 | speller does not suggest 1883 for 1883:a |
| iemeLinn | ieme-Linn | 1 |
| 595 | prefix+name as split comp without hyphen |
| tsåhkeLot | tsåhke-Lot | 1 |
| 595 | prefix+name as split comp without hyphen |
| Åhpadim-me-konsulænnta | Åhpadim-m-konsulænnta | 1 |
| 616 | Bispadime-me-ráden |
| Åhpadis-me-konsulænnta | Åhpadis-m-konsulænnta | 1 |
| 616 | Bispadime-me-ráden |
| Lofåhtajguollima | Lofåhta-guollima | 1 |
| 617 | left-compound-tags affects proper-nouns too |
| Råmsåjdille | Råmså-dille | 1 |
| 617 | left-compound-tags affects proper-nouns too |
| kåntåvrråa-jådediddje | kåntåvrrå-Raa-jådediddje | 3 |
| 629 | a taking part of compound |
| Aktaalmasj | Akta almasj | 1 |
| 641 | numeral+noun compounds |
| guovtealmatja | guovte almatja | 1 |
| 641 | numeral+noun compounds |
| 21bieivi | 21-biejve | 3 |
| 711 | numeral compounds and cases |
| 1996:itd | 1996:jt | 2 |
| 711 | numeral compounds and cases |
| javlla-CDan | javlla-CD:n | 1 |
| 717 | noun-acro compounds |
| gålmmågahper | gålmmå gahpera | 2 |
| 721 | nom- and gen-numerals make compounds with nouns |
| åvtåmuora | åvtå muora | 1 |
| 721 | nom- and gen-numerals make compounds with nouns |
| guovtesuollu | guovte suollu | 1 |
| 721 | nom- and gen-numerals make compounds with nouns |
| amuorra | a-muorra | 1 |
| 785 | does not recognize alphabet-abbr+noun |
| Cmuorra | C-muorra | 1 |
| 785 | does not recognize alphabet-abbr+noun |
| dmuorra | d-muorra | 1 |
| 785 | does not recognize alphabet-abbr+noun |
| Emuorra | E-muorra | 1 |
| 785 | does not recognize alphabet-abbr+noun |
| NRK-an | NRK:n | 2 |
| 913 | acronyms+cases |
Spelling errors without suggestions (9)
| Input word | Expected correction | Editing distance | Bug ID | Comment |
|---|---|---|---|---|
| Allaskåvllådåjmadagas | Allaskåvllådåjmadagás | 1 | compound with typo, no suggestion | |
| Stuora-fuoskok | Stuorafuoskok | 1 | 518 | nominal + Prop/Der compound |
| gájtjabájkka | gájtsabajkka | 2 | 557 | Missing suggestion when multiple errors across compound boundary |
| NRKGE | NRK-EG | 2 | 607 | acro-compounds without hyphen |
| Bispadime-me-ráden | Bispadime-m-ráden | 1 | 616 | Bispadime-me-ráden |
| NSR-Lindon | NRK-London | 3 | 805 | Nouns/propers+acronyms |
| NRK-Finnmarko | NRK-Finnmárko | 1 | 805 | Nouns/propers+acronyms |
| NRKNRKNRK | NRK-NRK-NRK | 2 | 933 | acros compound to each other without hyphens |
| bullar1-dutkamin | bullar-1-dutkamin | 1 | 1144 | numerals compound to nouns without hyphen |
False positives (11)
False positives with suggestions (8)
| Input word | Suggestions | Bug ID | Comment |
|---|---|---|---|
| M:jda |
| 435 | roman numerals missing |
| MXVII:j |
| 435 | roman numerals missing |
| XII |
| 435 | roman numerals missing |
| XV:s |
| 435 | roman numerals missing |
| stuorafuoskok |
| 518 | nominal + Prop/Der compound |
| stuorrafuoskok |
| 518 | nominal + Prop/Der compound |
| 051-bágo |
| 631 | number compound, numbers starting with 0 |
| boatsoj-sujtto |
| 894 | hyphens or not in compounds |
False positives without suggestions (3)
Stuorrafuoskok Stuorafuoskok 1700-LÅHKO
False negatives (24)
| Input word | Expected correction | Editing distance | Bug ID | Comment |
|---|---|---|---|---|
| bievdedåhpe | bievddedåhpe | 1 | 511 | doesn't follow comp tags |
| Nijbegoasskem | Nijbbegoasskem | 1 | 511 | doesn't follow comp tags |
| uvsagáldo | uksagáldo | 1 | 511 | doesn't follow comp tags |
| stuora-Fuoskok | stuorafuoskok | 2 | 518 | nominal + Prop/Der compound |
| Stuora-Fuoskok | Stuorafuoskok | 2 | 518 | nominal + Prop/Der compound |
| stuorFuoskok | stuorafuoskok | 2 | 518 | nominal + Prop/Der compound |
| StuorFuoskok | Stuorafuoskok | 2 | 518 | nominal + Prop/Der compound |
| stuor-Fuoskok | stuorafuoskok | 2 | 518 | nominal + Prop/Der compound |
| Stuor-Fuoskok | Stuorafuoskok | 2 | 518 | nominal + Prop/Der compound |
| stuorra-Fuoskok | stuorrafuoskok | 2 | 518 | nominal + Prop/Der compound |
| Stuorra-Fuoskok | Stuorrafuoskok | 2 | 518 | nominal + Prop/Der compound |
| stuoraFuoskok | stuorafuoskok | 1 | 518 | nominal + Prop/Der compound |
| StuoraFuoskok | Stuorafuoskok | 1 | 518 | nominal + Prop/Der compound |
| stuorraFuoskok | stuorrafuoskok | 1 | 518 | nominal + Prop/Der compound |
| StuorraFuoskok | Stuorrafuoskok | 1 | 518 | nominal + Prop/Der compound |
| dåjmadibmesáhka | dåjmadibmesak | 3 | 536 | "impossible" compound forms |
| bievdeguorra | bievddeguorra | 1 | 539 | speller does not follow compound-tagging |
| 0-nubbejådediddje | 0-ruobbejådediddje | 2 | 648 | unmotivated suggestions with numeral+noun |
| muorraas | muorra-as | 1 | 805 | Nouns/propers+acronyms |
| muorraNRK | muorra-NRK | 1 | 805 | Nouns/propers+acronyms |
| rijkajuohkostjiektjamav | rijkajuogostjiektjamav | 2 | 817 | should not be accepted |
| CV-s | CV:s | 1 | 913 | acronyms+cases |
| Åhpaditge-konsulænnta | Åhpadit-g-konsulænnta | 2 | 1122 | inf+noun compounds |
| 0-mujttalibmesáhka | 0-mujttalimesáhka | 1 | 1145 | impossible compounds |
True negatives (326)
The correctly accepted words are listed here for easy copy&paste into Word to quick check regressions in new versions of the speller: No correctly spelled words accepted by an earlier speller should be rejected by later spellers. To check, just copy and paste these words to Word, and see if you get any red underlines. Duplicate words are not repeated, unless one is followed by some punctuation. The words are reverse sorted according to Unicode character code.
X:j X Vuostasj vuostasj vuostamuttjan vuolgadime vuojnno vuojnávge Voaga vissa Visa- Vindela vihttatjuot vihttalåk vidjurijt vidánge VG-kafean VG-biebbmo VG:s VG:n VG:j VG varresvuodastasjåvnnå vällahit vahkkusasj væráltbijlla vællahit USA urmak uksagáldo tjådålasj Svierigadárogielan svieriga- Svieriga Suortta suomaguolbbe suohkanin stuorra Stuorgiedde stivrrimássjetjállagij snutjek snuhpalasj Skilkká skihpasujtár skihpahuodnahijn Skåvllådåjmadak skåvlåsj skåvlås siellosujtov siebrre siebrijs Seskarö-cd sebrudakalmasj sámeguolbbe sáme-dáro Sálatædno Sálat Sákkaldakguolle Ruogŋo riegádimgånågis riegádibme råsjmak råntsak Råmsså rájijrasstimin Rájerasstimijn ráddnávuodamuorra polardutkamin PDF: PDF patronmuorra Ofuohttá oanegasj oahppamdiededimev oahppambiednikvuogádagá nuppev Nuorttalijguovlojn Nuorttalij NSR: NSR- NSR NRK-NRK-NRK NRK-London NRK-Finnmárko NRK-EG NRK:n NRK:jn Nordlánnda Noajddudakmuorra Nijbbegoasskem nielljelåk nielljege niehkkealmasjvuorro Narvijkan nälggot nælggot Myra-Land muorra-NRK muorraguovggis muorraguovggat muorraguorra muorra-as muorra muoralasj muorajguorra munjik moattestávval moattejienalasj moattejahkásasj minimálla millijåvnnå millijåvnå mihá mige miesska mierredimijt mierredimájgijt miereunnedimskåvve miedek merragálvvogárvvo maŋŋel mavga mánnusasj månge Malmöj måjge máhkak mælggatåhpadusán M ludtjusasj lihttemaskinsielgge lásjmak lånudim- lågenanniellja lågenanguovtev lågenanguovtes lågenanguoktatjuodeguhttalåk lågenanguoktáj lågenanguokta L:jn L kåntåvrråjådediddje kaninmuorra juohkkahasj juoga julev- jubmeldievnnoiellemij joavddelasj joarkkaskåvlåv jávrásjbijlla javlla-CD jållåribme Jáhkkulasj jærjálasj jáddamahtes IV isetábnas ietjas ietjanis ietjajnis ietjá ienebuv Iednegiellaoahppaábnnasa huohppelasj gus guovtekultuvralasj guovtege guovlojn guossa guoktuvsán guoktjuot guoktek guoktege guoktalåk guokta guoklåk guokduhát guok guhttage guhtis guhkaájggásattjat grámmalasj goasskemrájddohadde goalmát girkkomusijkkár gievrramuorra gievrrak gievrrage gievrasmuorra gievrasalmasj gievramuorra gievrak gievrage gietjavge gietjatjuodevihtta giehtjalåkvihtta gávtsebiejvvásasj Gasska-Nordlánnda gaskostibme garrasabmusit gålmmålåk gålmålágásj gáktsalågenantjuotakta gähtjáj gågulasj gæhtjáj Fuolldá fámon enujn E-muorra Dundarevuobme doktorandgukse doarek d-muorra Divtasvuona-Oarjjevuona divna dile Diksná digitála deblak dáv Dát dát dássta dássásasj dasi dárbbolasj danna dan dålusjájggásasj Dálásjájggásasj dakkirtjuodak dakkirijt dakkirge dájt dájs Dajna dåjmadimsáhka Daj dæbbaga däbbaga dábálasj D:n D CV:s CV:j C-muorra C-giella Børde-Rene buorredagojt buorre boahtemtjuovggis boahtemtjuovggat bijllak bijllage bievddedåhpe bielleájggeåhpadussaj biele biehtsemánno biehtse Bern-tv bednagasjbijlla BB-Rød-Rene báŋŋkaj báŋŋka báŋkas bargojduvddemis bargogáddjon Barggogaskostime Bálákvuovdde bajedimij åvtånálán avtanálán åvtålágásj avtak åvddånimev åvdåsvásstet åvdåstibmáj asstelasj ásjmak Arentz-Hansen a-muorra Åmo alvvásláhkáj álmmukallaskåvlån ållo allaskåvllåj Allaskåvllågähttjalus allaskåvllågähttjalus Allaskåvllådievnastusdåjmadahka allasis ållagasj älla álggoálmmugis álggoálmmugij aktak akta ájnegasj Ájluokta ájgij ájgev ájge Áillun Áillonieida Áillohaš Áili Áile-Kaas Áile Áila ådå Accra åbmudaksiebrre 38-jahkásasj 3:sijn 3:dijn 21-biejve 1996:jt 1992-93:s 1983:s 1983:j 10-ijá 10-bæjvvásattja 0-juojggamaárbbedábev
Grouped by bug #
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| vidjor9jit | vidjurijt | 3 |
| vidjor9jit vs vidjurijt |
| vidjurijt | vidjor9jit vs vidjurijt |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| Nárvijkan | Narvijkan | 1 |
| missing smj names; entry was SUBed |
| Narvijkan | missing smj names; entry was SUBed | |||
| Svieriga | missing names | |||
| Malmöj | missing names |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| nælggot | ä not recognised | |||
| nälggot | ä not recognised | |||
| dæbbaga | ä not recognised | |||
| däbbaga | ä not recognised | |||
| vællahit | ä not recognised | |||
| vällahit | ä not recognised | |||
| gæhtjáj | ä not recognised | |||
| gähtjáj | ä not recognised | |||
| Allaskåvllågähttjalus | Swedish ä | |||
| allaskåvllågähttjalus | Swedish ä |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| ållo | missing numeral | |||
| akta | missing numeral | |||
| guokta | missing numeral |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| Allaskåvllådievnastusdåjmadahka | 3-part compound | |||
| bielleájggeåhpadussaj | 3-part compound | |||
| oahppambiednikvuogádagá | 3-part compound | |||
| stivrrimássjetjállagij | 3-part compound | |||
| Iednegiellaoahppaábnnasa | 3-part compound |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| bajedimij | 3-syllable actio | |||
| gaskostibme | 3-syllable actio | |||
| jållåribme | 3-syllable actio | |||
| mierredimijt | 3-syllable actio | |||
| åvddånimev | missing 3-syllabic actio | |||
| åvdåstibmáj | 3-syllable actio N+Sg+Ill |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| alvvásláhkáj | adj+noun | |||
| buorredagojt | adj+noun | |||
| Dálásjájggásasj | adj+noun | |||
| dålusjájggásasj | adj+noun | |||
| gievrasalmasj | adj-attr+noun | |||
| gievrasmuorra | adj-attr+noun | |||
| gievramuorra | adj-gen+noun | |||
| gievrramuorra | adj-nom+noun | |||
| mælggatåhpadusán | adj+noun | |||
| skihpahuodnahijn | adj+noun | |||
| skihpasujtár | adj+noun |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| garrasabmusit | adj->adv |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| maññel | maŋŋel | 2 |
| ń not part of word |
| mańńel | maŋŋel | 2 |
| ń not part of word |
| maŋŋel | ń not part of word |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| X | roman numerals missing | |||
| L | roman numerals missing | |||
| D | roman numerals missing | |||
| M | roman numerals missing | |||
| X:j | roman numerals missing | |||
| D:n | roman numerals missing | |||
| L:jn | roman numerals missing | |||
| M:jda |
| roman numerals missing | ||
| IV | roman numerals missing | |||
| XII |
| roman numerals missing | ||
| XV:s |
| roman numerals missing | ||
| MXVII:j |
| roman numerals missing |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| lågenanniellja | missing numeral (14) | |||
| lågenanguokta | missing numeral (12) | |||
| millijåvnå | missing numerals | |||
| millijåvnnå | missing numerals |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| Nuorttalij | double hyphen | |||
| guovlojn | double hyphen | |||
| álggoálmmugis | double hyphen | |||
| álggoálmmugij | double hyphen | |||
| álggoálmmugis-álggoálmmugij | álggoálmmugis álggoálmmugij | 1 |
| double hyphen |
| polardutkamin | double hyphen |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| USA | missing bare abbrs | |||
| NSR | missing bare abbrs | |||
| missing bare abbrs | ||||
| PDF: | missing bare abbrs | |||
| NSR- | missing bare abbrs | |||
| NSR: | missing bare abbrs |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| Áila | missing names | |||
| Áile | missing names | |||
| Áili | missing names | |||
| Áillun | missing names |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| dan | missing pronoun | |||
| dáv | missing pronoun | |||
| dát | missing pronoun | |||
| Daj | missing pronoun | |||
| Dát | missing pronoun | |||
| Dajna | missing pronoun | |||
| dájs | missing pronoun | |||
| dasi | missing pronoun | |||
| dájt | missing pronoun | |||
| dássta | missing pronoun | |||
| danna | missing pronoun | |||
| divna | missing pronoun | |||
| ietjá | missing pronoun | |||
| guhtis | missing pronoun | |||
| avtak | missing pronoun | |||
| juoga | missing pronoun | |||
| nuppev | missing pronoun | |||
| mige | missing pronoun | |||
| mavga | missing pronoun | |||
| ienebuv | missing pronoun | |||
| dakkirijt | missing pronoun | |||
| juohkkahasj | missing pronoun | |||
| vuostamuttjan | missing pronouns |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| sábmeguolbbe | sámeguolbbe | 1 |
| speller does not follow compound-tagging |
| suobmaguolbbe | suomaguolbbe | 1 |
| speller does not follow compound-tagging |
| sámeguolbbe | speller does not follow compound-tagging | |||
| suomaguolbbe | speller does not follow compound-tagging |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| máhkak | unrecognised clitics | |||
| månge | unrecognised clitics | |||
| bijllage | unrecognised clitics | |||
| bijllak | unrecognised clitics | |||
| vuojnávge | unrecognised clitics | |||
| gievrrak | unrecognised clitics gievrra+Nom | |||
| gievrrage | unrecognised clitics gievrra+Nom | |||
| gievrak | unrecognised clitics gievrra+Gen | |||
| gievrage | unrecognised clitics gievrra+Gen | |||
| måjge | unrecognised clitics | |||
| munjik | unrecognised clitics | |||
| dakkirge | unrecognised clitics |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| VG:an | VG:n | 1 |
| acro inflection |
| VG:n | acro inflection | |||
| VG | nom and comp of acro | |||
| VG-kafean | nom and comp of acro | |||
| VG-biebbmo | nom and comp of acro |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| varresvuohtastasjåvnnå | varresvuodastasjåvnnå | 2 |
| doesn't follow comp-tags |
| varresvuodastasjåvnnå | doesn't follow comp-tags |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| fámon | unrec words | |||
| ájge | unrec words | |||
| ájgev | unrec words | |||
| ájgij | unrec words | |||
| vissa | unrec words | |||
| siebrijs | unrec words | |||
| siebrre | unrec words | |||
| Visa- | unrec words | |||
| báŋŋka | unrec words | |||
| báŋkas | unrec words | |||
| báŋŋkaj | unrec words | |||
| biehtsemánno | unrec words | |||
| joarkkaskåvlåv | unrec words | |||
| skåvlås | unrec words | |||
| skåvlåsj | unrec words | |||
| álmmukallaskåvlån | unrec words | |||
| allaskåvllåj | unrec words | |||
| isetábnas | unrec words | |||
| biele | unrec words | |||
| mierredimájgijt | unrec words | |||
| muorra | unrec words | |||
| suohkanin | unrec words | |||
| guossa | unrec words | |||
| biehtse | unrec words | |||
| vuojnno | unrec words | |||
| dile | unrec words | |||
| gågulasj | unrec words | |||
| Jáhkkulasj | unrec words | |||
| tjådålasj | unrec words | |||
| dárbbolasj | unrec words | |||
| oanegasj | unrec words | |||
| ájnegasj | unrec words | |||
| joavddelasj | unrec words | |||
| huohppelasj | unrec words | |||
| asstelasj | unrec words | |||
| ållagasj | unrec words | |||
| dássásasj | unrec words | |||
| ludtjusasj | unrec words | |||
| vahkkusasj | unrec words | |||
| mánnusasj | unrec words | |||
| deblak | unrec words | |||
| ásjmak | unrec words | |||
| råsjmak | unrec words | |||
| urmak | unrec words | |||
| råntsak | unrec words | |||
| lásjmak | unrec words | |||
| miedek | unrec words | |||
| snutjek | unrec words | |||
| doarek | unrec words |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| VG:as | VG:s | 1 |
| accepts SUB-ed entries |
| VG:aj | VG:j | 1 |
| accepts SUB-ed entries |
| 1983:as | 1983:s | 1 |
| accepts SUB-ed entries |
| 1983:aj | 1983:j | 1 |
| accepts SUB-ed entries |
| VG:s | accepts SUB-ed entries | |||
| VG:j | accepts SUB-ed entries | |||
| 1983:s | accepts SUB-ed entries | |||
| 1983:j | accepts SUB-ed entries |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| niehkkealmasjvuorro | 3-part compound | |||
| goasskemrájddohadde | 3-part compound | |||
| merragálvvogárvvo | 3-part compound |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| Nijbegoasskem | Nijbbegoasskem | 1 | doesn't follow comp tags | |
| bievdedåhpe | bievddedåhpe | 1 | doesn't follow comp tags | |
| uvsagáldo | uksagáldo | 1 | doesn't follow comp tags | |
| Nijbbegoasskem | doesn't follow comp tags | |||
| bievddedåhpe | doesn't follow comp tags | |||
| uksagáldo | doesn't follow comp tags |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| muoralasj | deriv -lasj not recognised | |||
| jærjálasj | deriv -lasj not recognised | |||
| grámmalasj | deriv -lasj not recognised | |||
| snuhpalasj | deriv -lasj not recognised |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| stuorra-fuoskok | stuorrafuoskok | 1 |
| nominal + Prop/Der compound |
| stuorraFuoskok | stuorrafuoskok | 1 | nominal + Prop/Der compound | |
| stuorra-Fuoskok | stuorrafuoskok | 2 | nominal + Prop/Der compound | |
| Stuorra-fuoskok | Stuorrafuoskok | 1 |
| nominal + Prop/Der compound |
| StuorraFuoskok | Stuorrafuoskok | 1 | nominal + Prop/Der compound | |
| Stuorra-Fuoskok | Stuorrafuoskok | 2 | nominal + Prop/Der compound | |
| stuora-fuoskok | stuorafuoskok | 1 |
| nominal + Prop/Der compound |
| stuoraFuoskok | stuorafuoskok | 1 | nominal + Prop/Der compound | |
| stuora-Fuoskok | stuorafuoskok | 2 | nominal + Prop/Der compound | |
| Stuora-fuoskok | Stuorafuoskok | 1 |
| nominal + Prop/Der compound |
| StuoraFuoskok | Stuorafuoskok | 1 | nominal + Prop/Der compound | |
| Stuora-Fuoskok | Stuorafuoskok | 2 | nominal + Prop/Der compound | |
| stuor-fuoskok | stuorafuoskok | 1 |
| nominal + Prop/Der compound |
| stuorFuoskok | stuorafuoskok | 2 | nominal + Prop/Der compound | |
| stuor-Fuoskok | stuorafuoskok | 2 | nominal + Prop/Der compound | |
| Stuor-fuoskok | Stuorafuoskok | 1 |
| nominal + Prop/Der compound |
| StuorFuoskok | Stuorafuoskok | 2 | nominal + Prop/Der compound | |
| Stuor-Fuoskok | Stuorafuoskok | 2 | nominal + Prop/Der compound | |
| stuorFuosskok | Stuor-Fuosskok | 2 |
| nominal + Prop/Der compound |
| StuorFuosskok | Stuor-Fuosskok | 1 |
| nominal + Prop/Der compound |
| Stuorafuoskok |
| nominal + Prop/Der compound | ||
| Stuorrafuoskok |
| nominal + Prop/Der compound | ||
| stuorafuoskok |
| nominal + Prop/Der compound | ||
| stuorrafuoskok |
| nominal + Prop/Der compound |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| væráldbijlla | væráltbijlla | 1 |
| wrong compounding forms accepted |
| bednagadtjbijlla | bednagatjijlla | 2 |
| wrong compounding forms accepted |
| jávrádtjbijlla | jávrásjbijlla | 2 |
| wrong compounding forms accepted |
| væráltbijlla | wrong compounding forms accepted | |||
| bednagasjbijlla | wrong compounding forms accepted | |||
| jávrásjbijlla | wrong compounding forms accepted |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| ráddnávuohtamuorra | ráddnávuodamuorra | 2 |
| "impossible" compound forms |
| dåjmadibmesáhka | dåjmadibmesak | 3 | "impossible" compound forms | |
| Noajddudakmuorra | "impossible" compound forms | |||
| Sákkaldakguolle | "impossible" compound forms | |||
| ráddnávuodamuorra | "impossible" compound forms | |||
| dåjmadimsáhka | "impossible" compound forms |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| kaninmuorra | some short compound-forms not working | |||
| patronmuorra | some short compound-forms not working | |||
| lihttemaskinsielgge | some short compound-forms not working | |||
| doktorandgukse | some short compound-forms not working |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| bievdeguorra | bievddeguorra | 1 | speller does not follow compound-tagging | |
| muorraguorra | speller does not follow compound-tagging | |||
| muorajguorra | speller does not follow compound-tagging |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| Áillohaš | names wrongly converted from sme to smj | |||
| Áillonieida | names wrongly converted from sme to smj | |||
| Arentz-Hansen | names wrongly converted from sme to smj |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| 1883:a | 1883 | 2 |
| speller does not suggest 1883 for 1883:a |
| Bq:a | Bq | 2 |
| speller does not suggest 1883 for 1883:a |
| NRK:a | NRK | 2 |
| speller does not suggest 1883 for 1883:a |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| åvdåsvásstet | non-existent word accepted | |||
| åvdåsvastet | åvdåsvásstet | 2 |
| non-existent word accepted |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| gájtjabájkka | gájtsabajkka | 2 |
| Missing suggestion when multiple errors across compound boundary |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| Børde-Rene | name+name suggestions with double hyphens | |||
| Áile-Kaas | name+name suggestions with double hyphens | |||
| Myra-Land | name+name suggestions with double hyphens | |||
| BB-Rød-Rene | name+name suggestions with double hyphens |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| 38jahkásasj | 38-jahkásasj | 1 |
| suggests 38jahkásasj for 38-jahkásasj |
| 38-jahkásasj | suggests 38jahkásasj for 38-jahkásasj |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| girkkomusijkkár | noun+Prop without hyphen | |||
| girkkomusijkkaRa | girkkomusijkkára | 2 |
| noun+Prop without hyphen |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| jubmeldievnnoiellemij | word not recognized |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| vasstet | vásstet | 1 |
| suggestions involving vowel-replacement |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| lågenanguokta | lågenanguokte not recognized | |||
| lågenanguovtev | lågenanguokte not recognized | |||
| lågenanguovtes | lågenanguokte not recognized | |||
| lågenanguoktáj | lågenanguokte not recognized |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| tsåhkeLot | tsåhke-Lot | 1 |
| prefix+name as split comp without hyphen |
| iemeLinn | ieme-Linn | 1 |
| prefix+name as split comp without hyphen |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| C-giella | C-giellan is not accepted |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| gålmmålåk | numeral attr:s on lot | |||
| guoktalåk | numeral attr:s on lot | |||
| vihttalåk | numeral attr:s on lot |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| sáme-dáro | Gen+hyph compound |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| NRKGE | NRK-EG | 2 |
| acro-compounds without hyphen |
| NRK-EG | acro-compounds without hyphen |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| Rájerasstimijn | actios and actors as second part of compound | |||
| rájijrasstimin | actios and actors as second part of compound | |||
| bargojduvddemis | actios and actors as second part of compound | |||
| bargogáddjon | actios and actors as second part of compound |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| Bispadime-me-ráden | Bispadime-m-ráden | 1 |
| Bispadime-me-ráden |
| Åhpadim-me-konsulænnta | Åhpadim-m-konsulænnta | 1 |
| Bispadime-me-ráden |
| Åhpadis-me-konsulænnta | Åhpadis-m-konsulænnta | 1 |
| Bispadime-me-ráden |
| me-rijkaieŋŋilsgiellaj | Amerijka-ieŋŋilsgiellaj | 3 |
| Bispadime-me-ráden |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| Lofåhtajguollima | Lofåhta-guollima | 1 |
| left-compound-tags affects proper-nouns too |
| Råmsåjdille | Råmså-dille | 1 |
| left-compound-tags affects proper-nouns too |
| Divtasvuonaahke | Divtasvuona-ahke | 1 |
| left-compound-tags affects proper-nouns too |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| oajvijda | åjvijda | 2 |
| dipht simpl oajvve>åjvijn |
| oajvijs | åjvijs | 2 |
| dipht simpl oajvve>åjvijn |
| oajvij | åjvij | 2 |
| dipht simpl oajvve>åjvijn |
| oajvijn | åjvijn | 2 |
| dipht simpl oajvve>åjvijn |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| åvtånálán | numerals and pronouns to NAMÁK and SASJ fails | |||
| gålmålágásj | numerals and pronouns to NAMÁK and SASJ fails | |||
| avtanálán | numerals and pronouns to NAMÁK and SASJ fails | |||
| åvtålágásj | numerals and pronouns to NAMÁK and SASJ fails | |||
| gávtsebiejvvásasj | numerals and pronouns to NAMÁK and SASJ fails | |||
| dakkirtjuodak | numerals and pronouns to NAMÁK and SASJ fails | |||
| gålmmålágásj | gålmålágásj | 1 |
| numerals and pronouns to NAMÁK and SASJ fails |
| vihttalágásj | vijddalágásj | 3 |
| numerals and pronouns to NAMÁK and SASJ fails |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| gettsugadtjajbiejvve | gettsugattjajbiejvve | 1 |
| gen of ÅLLAGASJ-adj as first prt of cmp |
| sjimugadtjabiejvve | sjimugattjabiejvve | 1 |
| gen of ÅLLAGASJ-adj as first prt of cmp |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| Ájluokta | lule sami names missing | |||
| Åmo | lule sami names missing | |||
| Skilkká | lule sami names missing | |||
| Ofuohttá | lule sami names missing | |||
| Gasska-Nordlánnda | lule sami names missing | |||
| Nordlánnda | lule sami names missing | |||
| Råmsså | lule sami names missing | |||
| Fuolldá | lule sami names missing | |||
| Voaga | lule sami names missing | |||
| Sálat | lule sami names missing | |||
| Suortta | lule sami names missing | |||
| Stuorgiedde | lule sami names missing | |||
| Sálatædno | lule sami names missing | |||
| Ruogŋo | lule sami names missing | |||
| Dundarevuobme | lule sami names missing | |||
| Diksná | lule sami names missing | |||
| Bálákvuovdde | lule sami names missing | |||
| Vindela | lule sami names missing | |||
| Accra | lule sami names missing |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| julev- | prefix + hyhpen does not get accepted | |||
| svieriga- | prefix + hyhpen does not get accepted |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| kåntåvrråa-jådediddje | kåntåvrrå-Raa-jådediddje | 3 |
| a taking part of compound |
| kåntåvrråjådediddje | a taking part of compound |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| 051-bágo |
| number compound, numbers starting with 0 |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| Divtasvuona-Oarjjevuona | Prop gen + hyphen + Prop gen | |||
| Divtasvuona-Oarjevuona | Divtasvuona-Oarjjevuona | 1 |
| Prop gen + hyphen + Prop gen |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| miereunnedimskåvve | +Left-marked words as middle part of compound |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| Aktaalmasj | Akta almasj | 1 |
| numeral+noun compounds |
| guovtealmatja | guovte almatja | 1 |
| numeral+noun compounds |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| 1992-93:s | cased numeral+numeral compounds |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| 1700-LÅHKO |
| numerals+NOUN | ||
| 1700-Låhko | 1700-låhko | 1 |
| numerals+NOUN |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| 0-nubbejådediddje | 0-ruobbejådediddje | 2 | unmotivated suggestions with numeral+noun | |
| 0-juojggamaárbbedábev | unmotivated suggestions with numeral+noun |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| Kitaadábálasj | Kitaa-dábálasj | 1 |
| name+adj compounds without hyphen |
| Sirgesdábálasj | Sirges-dábálasj | 1 |
| name+adj compounds without hyphen |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| badjeGájnaj | Badje-Gájnaj | 2 |
| noun prefix+name compound without hyphen |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| nuppáj | nubbáj | 2 |
| pp>bb phonetic rule does not apply |
| rappe | rabbe | 2 |
| pp>bb phonetic rule does not apply |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| SABME | SÁBME | 1 |
| UPPERCASE-typos only get acronym-suggestions |
| SABMe | SÁBME | 2 |
| UPPERCASE-typos only get acronym-suggestions |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| saame | sáme | 2 |
| suggestions saame (suggestions Saame (with capital S) is proper so in that meaning this bug won't be fixed unless we remove the proper-noun) |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| 3:sijn | caseforms, ordinals and collectives | |||
| 3:dijn | caseforms, ordinals and collectives |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| giehtjalåkvihtta | numeral-variants | |||
| gietjatjuodevihtta | numeral-variants | |||
| gáktsalågenantjuotakta | numeral-variants | |||
| vihttatjuot | numeral-variants | |||
| guok | numeral-variants | |||
| guoklåk | numeral-variants | |||
| guoktjuot | numeral-variants | |||
| guokduhát | numeral-variants | |||
| guoktuvsán | numeral-variants | |||
| lågenanguoktatjuodeguhttalåk | numeral-variants | |||
| nielljelåk | numeral-variants | |||
| giehtjalåkvitta | giehtjalåkvihtta | 1 |
| numeral-variants |
| gietjatjuodevitta | gietjatjuodevihtta | 1 |
| numeral-variants |
| gáktsalågeantjuotakta | gáktsalågenantjuotakta | 1 |
| numeral-variants |
| vittatjuot | vihttatjuot | 1 |
| numeral-variants |
| guk | guok | 1 |
| numeral-variants |
| guklåk | guoklåk | 1 |
| numeral-variants |
| guoktjut | guoktjuot | 1 |
| numeral-variants |
| guokduhat | guokduhát | 1 |
| numeral-variants |
| guoktuvsan | guoktuvsán | 1 |
| numeral-variants |
| lågeanguoktatjuodeguhttalåk | lågenanguoktatjuodeguhttalåk | 1 |
| numeral-variants |
| nelljelåk | nielljelåk | 1 |
| numeral-variants |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| 21bieivi | 21-biejve | 3 |
| numeral compounds and cases |
| 10ijjs | 10-ijá | 3 |
| numeral compounds and cases |
| 1996:itd | 1996:jt | 2 |
| numeral compounds and cases |
| 10-biejvvásattja | 10-bæjvvásattja | 2 |
| numeral compounds and cases |
| 21-biejve | numeral compounds and cases | |||
| 10-ijá | numeral compounds and cases | |||
| 1996:jt | numeral compounds and cases | |||
| 10-bæjvvásattja | numeral compounds and cases |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| javlla-CD | noun-acro compounds | |||
| javlla-CDan | javlla-CD:n | 1 |
| noun-acro compounds |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| åvtåmuora | åvtå muora | 1 |
| nom- and gen-numerals make compounds with nouns |
| guovtesuollu | guovte suollu | 1 |
| nom- and gen-numerals make compounds with nouns |
| gålmmågahper | gålmmå gahpera | 2 |
| nom- and gen-numerals make compounds with nouns |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| aktak | numerals + clitic | |||
| guoktek | numerals + clitic | |||
| guoktege | numerals + clitic | |||
| guovtege | numerals + clitic | |||
| nielljege | numerals + clitic | |||
| vidánge | numerals + clitic | |||
| guhttage | numerals + clitic | |||
| gietjavge | numerals + clitic | |||
| akttak | aktak | 1 |
| numerals + clitic |
| guoktetk | guoktek | 1 |
| numerals + clitic |
| guokteges | guoktege | 1 |
| numerals + clitic |
| guovttege | guovtege | 1 |
| numerals + clitic |
| njielljege | nielljege | 1 |
| numerals + clitic |
| vidange | vidánge | 1 |
| numerals + clitic |
| guhtage | guhttage | 1 |
| numerals + clitic |
| giehtjavge | gietjavge | 1 |
| numerals + clitic |
| tjuohtik | tjuohtek | 1 |
| numerals + clitic |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| madduj | mädduj | 1 |
| norm-fst does not recognize ä when u in sec. syll. |
| dabttjum | däbttjum | 1 |
| norm-fst does not recognize ä when u in sec. syll. |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| amuorra | a-muorra | 1 |
| does not recognize alphabet-abbr+noun |
| Cmuorra | C-muorra | 1 |
| does not recognize alphabet-abbr+noun |
| dmuorra | d-muorra | 1 |
| does not recognize alphabet-abbr+noun |
| Emuorra | E-muorra | 1 |
| does not recognize alphabet-abbr+noun |
| a-muorra | does not recognize alphabet-abbr+noun | |||
| C-muorra | does not recognize alphabet-abbr+noun | |||
| d-muorra | does not recognize alphabet-abbr+noun | |||
| E-muorra | does not recognize alphabet-abbr+noun |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| Seskaröcd | Seskarö-cd | 1 |
| Nouns/propers+acronyms |
| Berntv | Bern-tv | 1 |
| Nouns/propers+acronyms |
| muorraas | muorra-as | 1 | Nouns/propers+acronyms | |
| muorraNRK | muorra-NRK | 1 | Nouns/propers+acronyms | |
| NRK-Finnmarko | NRK-Finnmárko | 1 |
| Nouns/propers+acronyms |
| NSR-Lindon | NRK-London | 3 |
| Nouns/propers+acronyms |
| Seskarö-cd | Nouns/propers+acronyms | |||
| Bern-tv | Nouns/propers+acronyms | |||
| muorra-as | Nouns/propers+acronyms | |||
| muorra-NRK | Nouns/propers+acronyms | |||
| NRK-Finnmárko | Nouns/propers+acronyms | |||
| NRK-London | Nouns/propers+acronyms |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| ietjas | Case-forms of iesj not recognized | |||
| ietjanis | Case-forms of iesj not recognized | |||
| allasis | Case-forms of iesj not recognized | |||
| ietjajnis | Case-forms of iesj not recognized | |||
| itjas | ietjas | 1 |
| Case-forms of iesj not recognized |
| itjanis | ietjanis | 1 |
| Case-forms of iesj not recognized |
| alasis | allasis | 1 |
| Case-forms of iesj not recognized |
| itjajnis | ietjajnis | 1 |
| Case-forms of iesj not recognized |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| enujn | word not recognized | |||
| ænujn | enujn | 1 |
| word not recognized |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| rijkajuohkostjiektjamav | rijkajuogostjiektjamav | 2 | should not be accepted |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| muorraguovggat | noun+adjective compounds | |||
| muorraguovggis | noun+adjective compounds | |||
| boahtemtjuovggat | noun+adjective compounds | |||
| boahtemtjuovggis | noun+adjective compounds |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| boatsoj-sujtto |
| hyphens or not in compounds |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| CV-s | CV:s | 1 | acronyms+cases | |
| CV'j | CV:j | 1 |
| acronyms+cases |
| NRK:ajn | NRK:jn | 1 |
| acronyms+cases |
| NRK-an | NRK:n | 2 |
| acronyms+cases |
| CV:s | acronyms+cases | |||
| CV:j | acronyms+cases | |||
| NRK:jn | acronyms+cases | |||
| NRK:n | acronyms+cases |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| polárdutkamin | polardutkamin | 1 |
| prefixes not working |
| Nuortalijguovlojn | Nuorttalijguovlojn | 1 |
| prefixes not working |
| polardutkamin | prefixes not working | |||
| Nuorttalijguovlojn | prefixes not working |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| NRK-NRK-NRK | acros compound to each other without hyphens | |||
| NRKNRKNRK | NRK-NRK-NRK | 2 |
| acros compound to each other without hyphens |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| moattejahkásasj | lot of adjectives missing | |||
| moattestávval | lot of adjectives missing | |||
| moattejienalasj | lot of adjectives missing | |||
| minimálla | lot of adjectives missing | |||
| miesska | lot of adjectives missing | |||
| mihá | lot of adjectives missing | |||
| stuorra | lot of adjectives missing | |||
| ådå | lot of adjectives missing | |||
| buorre | lot of adjectives missing | |||
| dábálasj | lot of adjectives missing | |||
| digitála | lot of adjectives missing | |||
| muottejahkásasj | moattejahkásasj | 2 |
| lot of adjectives missing |
| muottestávval | moattestávval | 2 |
| lot of adjectives missing |
| moattijienalasj | moattejienalasj | 1 |
| lot of adjectives missing |
| minimalla | minimálla | 1 |
| lot of adjectives missing |
| misska | miesska | 1 |
| lot of adjectives missing |
| miha | mihá | 1 |
| lot of adjectives missing |
| suorra | stuorra | 1 |
| lot of adjectives missing |
| ådo | ådå | 1 |
| lot of adjectives missing |
| buovre | buorre | 1 |
| lot of adjectives missing |
| dábalasj | dábálasj | 1 |
| lot of adjectives missing |
| digitala | digitála | 1 |
| lot of adjectives missing |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| Almoduvváj | Almoduváj | 1 |
| uvváj and uvvát |
| aneduvváj | aneduváj | 1 |
| uvváj and uvvát |
| mujttaluvvát | mujttaluvvat | 1 |
| uvváj and uvvát |
| bielkeduvváj | bielkeduváj | 1 |
| uvváj and uvvát |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| Åhpaditge-konsulænnta | Åhpadit-g-konsulænnta | 2 | inf+noun compounds |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| bullar1-dutkamin | bullar-1-dutkamin | 1 |
| numerals compound to nouns without hyphen |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| 0-mujttalibmesáhka | 0-mujttalimesáhka | 1 | impossible compounds |
Testpairs not in bugs
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| barrnev | bárnev | 2 |
| |
| Galileajávrregáttev | Galilea-jávrregáttev | 1 |
| |
| oahppamdiededimev | actio compound | |||
| riegádimgånågis | actio compound | |||
| lånudim- | actio compound form | |||
| Skåvllådåjmadak | compound | |||
| Allaskåvllådåjmadagas | Allaskåvllådåjmadagás | 1 | compound with typo, no suggestion | |
| Barggomárnanin | Barggomárnánin | 1 |
| compound with typo, no suggestion |
| guhkaájggásattjat | lexicalized compound | |||
| jáddamahtes | missing actio forms | |||
| riegádibme | missing actio forms | |||
| vuolgadime | missing actio forms | |||
| åbmudaksiebrre | missing compound | |||
| Barggogaskostime | missing compound | |||
| sebrudakalmasj | missing compound | |||
| siellosujtov | missing compound | |||
| goalmát | missing ordinal | |||
| nuppev | missing ordinal | |||
| vuostasj | missing ordinal | |||
| Vuostasj | missing ordinal | |||
| gus | missing particles | |||
| Svierigadárogielan | name compound i Kintel och Korhon=vuonadár...med liten bok | |||
| guovtekultuvralasj | no hash at compound border | |||
| älla | strange suggestions |

