Speller Test Results for: «tmp/regression-pl-sme.txt»
Overview
Technical data
Language tested: sme
Document tested: tmp/regression-pl-sme.txt
Speller tool: Polderland command line tool
Speller lexicon version: Davvisámi, public beta 2, 2007-10-03
Test Date: 20071004
Test Type: regression
Result summary
Number of input words: 295
| Speller view | |||
|---|---|---|---|
| Speller Positive (number of flagged words): 162 |
Speller Negative (number of accepted words): 133 |
||
| Reality | Number of real errors: 142 |
Number of true positives (detected real errors): 133 |
Number of false negatives (unflagged spelling errors): 9 |
| Number of real correct words: 153 |
Number of false positives (incorrectly flagged words): 29 |
Number of true negatives (unflagged correct words): 124 |
|
- Precision (tp/(tp+fp)):
- 82.1 %
- Recall (tp/(tp+fn)):
- 93.66 %
- Accuracy ((tp+tn)/words):
- 87.12 %
Suggestion statistics
- Number of spelling errors with correct suggestion: 64
- Number of spelling errors with correct suggestion in top 5: 62
- Number of spelling errors with only wrong suggestions: 67
- Number of spelling errors with no suggestions at all: 1
Spelling error statistics
- Number of spelling errors : 142
- Number of simple spelling errors : 109
- Number of spelling errors with editing distance 2: 30
- Number of spelling errors with editing distance 3 or more: 3
True positives (133)
Spelling errors with correct suggestions (64)
| Input word |
Expected correction |
Editing distance |
Suggestions | Bug ID | Comment |
|---|---|---|---|---|---|
| birrasit | birrasiid | 2 |
|
still wrong | |
| sámedikkepresideanta | sámediggepresideanta | 2 |
|
still wrong | |
| 1883:as | 1883:s | 1 |
|
||
| 2003:as | 2003:s | 1 |
|
2003:as does not get marked | |
| 169:is | 169:s | 1 |
|
169:is does not get marked | |
| 1883s | 1883:s | 1 |
|
2007-06-26 speller test report | |
| 1883as | 1883:s | 1 |
|
||
| skuvlajagin | skuvlajagiin | 1 |
|
single error in compound - no sugg | |
| 2003as | 2003:s | 1 |
|
still wrong | |
| sihkarit | sihkkarit | 1 |
|
"sihkarit" is not marked | |
| vuoittohallamin | vuoittáhallamin | 1 |
|
still wrong | |
| jápmina | jápmima | 1 |
|
still wrong | |
| Lulli-Eurohpas | Lulli-Eurohpás | 1 |
|
no spelling suggestion | |
| milljovnna | miljovnna | 1 |
|
still wrong | |
| eaktudáhtolaččat | eaktodáhtolaččat | 1 |
|
does not get marked | |
| nazisttain | nasisttain | 1 |
|
does not get marked | |
| jurdagiiid | jurdagiid | 1 |
|
367 | |
| čiekciin | čikčiin | 2 |
|
376 | |
| Oslobiila | Oslo-biila | 1 |
|
397 | name compounding |
| Harstadbiila | Harstad-biila | 1 |
|
397 | name compounding |
| oahppuneavvuid | oahpponeavvuid | 1 |
|
397 | bad compounding |
| bargguitjuogadeapmi | bargguidjuogadeapmi | 1 |
|
414 | wrong compound form on -t |
| Helssetvuođđoduvvon | Helsset-vuođđuduvvon | 2 |
|
426 | comp words from Divvun.no |
| guoktedássásaš | guovttedássásaš | 2 |
|
426 | comp words from Divvun.no |
| Geavaheaddjedokumentašuvdna | Geavaheaddjidokumentašuvdna | 1 |
|
426 | comp words from Divvun.no |
| Romsaprošeavttat | Romsa-prošeavttat | 1 |
|
426 | comp words from Divvun.no |
| geavaheaddjiide | geavaheddjiide | 1 |
|
426 | comp words from Divvun.no |
| loahpahandáhtomiin | loahpahandáhtoniin | 1 |
|
426 | comp words from Divvun.no |
| oidni | oinnii | 2 |
|
429 | missing suggestion |
| SáMI | SÁMI | 1 |
|
448 | wrong sugg if mostly upper case |
| vástideaddjiin | vástideddjiin | 1 |
|
452 | several lexical bugs |
| buvttadeaddjiide | buvttadeddjiide | 1 |
|
452 | several lexical bugs |
| oažžuin | ožžuin | 1 |
|
452 | several lexical bugs |
| leažžago | leažžágo | 1 |
|
461 | missing suggestion |
| Odná | Otná | 1 |
|
461 | missing suggestion |
| láibbiit | láibbiid | 1 |
|
461 | missing suggestion |
| viesot | visot | 1 |
|
461 | missing suggestion |
| Distat | Disdat | 1 |
|
461 | missing suggestion |
| distat | disdat | 1 |
|
461 | missing suggestion |
| guovlui | guvlui | 1 |
|
461 | missing suggestion |
| moht | mot | 1 |
|
461 | missing suggestion |
| molssašumi | molsašumi | 1 |
|
461 | missing suggestion |
| molssašupmi | molsašupmi | 1 |
|
461 | missing suggestion |
| muhtim | muhtin | 1 |
|
461 | missing suggestion |
| nuohtiin | nuhtiin | 1 |
|
461 | missing suggestion |
| suorggiin | surggiin | 1 |
|
461 | missing suggestion |
| niibispiidni | niibespiidni | 1 |
|
489 | lack of stem vowel shortening |
| rátnubiellu | rátnobiellu | 1 |
|
489 | lack of stem vowel shortening |
| stohpuspiidni | stohpospiidni | 1 |
|
489 | lack of stem vowel shortening |
| co2:ajn | co2:in | 2 |
|
508 | accepts smj entries |
| VG:at | VG:t | 1 |
|
508 | accepts SUB entries |
| VG:aid | VG:id | 1 |
|
508 | accepts SUB entries |
| VG:ii | VG:i | 1 |
|
508 | accepts SUB entries |
| co2:j | co2:i | 1 |
|
508 | accepts smj entries |
| stuorra-Oslolaš | stuorraoslolaš | 2 |
|
518 | nominal + Prop/Der compound |
| stuora-Oslolaš | stuoraoslolaš | 2 |
|
518 | nominal + Prop/Der compound |
| stuor-oslolaš | stuoraoslolaš | 1 |
|
518 | nominal + Prop/Der compound |
| Stuor-oslolaš | Stuoraoslolaš | 1 |
|
518 | nominal + Prop/Der compound |
| stuorra-oslolaš | stuorraoslolaš | 1 |
|
518 | nominal + Prop/Der compound |
| stuorraOslolaš | stuorraoslolaš | 1 |
|
518 | nominal + Prop/Der compound |
| Stuorra-oslolaš | Stuorraoslolaš | 1 |
|
518 | nominal + Prop/Der compound |
| stuora-oslolaš | stuoraoslolaš | 1 |
|
518 | nominal + Prop/Der compound |
| stuoraOslolaš | stuoraoslolaš | 1 |
|
518 | nominal + Prop/Der compound |
| Stuora-oslolaš | Stuoraoslolaš | 1 |
|
518 | nominal + Prop/Der compound |
Spelling errors with only incorrect suggestions (68)
| Input word |
Expected correction |
Editing distance |
Suggestions | Bug ID | Comment |
|---|---|---|---|---|---|
| Sámedikkeáirrasin | sámediggeáirrasin | 3 |
|
does not get marked | |
| eahpenášunálalaččat | eahpenášuvnnalaččat | 3 |
|
eahpenášunálalaččat does not get marked | |
| oarje-eurohpai | Oarje-Eurohpái | 3 |
|
no spelling suggestion | |
| EO:a | EO | 2 |
|
||
| nuppogeahčái | nuppe geahčái | 2 |
|
"nuppe" is marked | |
| nuppogeahčái | nuppi geahčái | 2 |
|
||
| studentaavissas | studeantaaviissas | 2 |
|
multiple errors in compound - no sugg | |
| INTERREG:ii | INTERREG:i | 1 |
|
||
| goldasii | goldásii | 1 |
|
still wrong | |
| guovttemiesát | guovttemiesat | 1 |
|
still wrong | |
| guššalemiin | giššalemiin | 1 |
|
still wrong | |
| ilgadeamis | ilgadeamos | 1 |
|
still wrong | |
| krimináláššiid | kriminálaáššiid | 1 |
|
missing suggestion | |
| skierranis | skieranis | 1 |
|
still wrong | |
| vakumas | vakuumas | 1 |
|
still wrong | |
| čorgedalla | čorgedallá | 1 |
|
still wrong | |
| divtásiiid | divtásiid | 1 |
|
367 | |
| guollibusse | guollebusse | 1 |
|
397 | wrong compounding |
| suopmasápmelaš | suomasápmelaš | 1 |
|
397 | bad compounding |
| X:ii | X:i | 1 |
|
425 | other words from Divvun.no |
| heajus-Oslo | heajos-Oslo | 1 |
|
431 | lowering in front of hyphen |
| álkis-Oslo | álkes-Oslo | 1 |
|
431 | lowering in front of hyphen |
| suopmasápmelaš | suomasápmelaš | 1 |
|
449 | suopmasápmelaš comps wrongly accepted |
| Eanadalit | Eandalit | 1 |
|
461 | missing suggestion |
| Julis | Julius | 1 |
|
461 | missing suggestion |
| bidju | biedju | 1 |
|
461 | missing suggestion |
| dohkkenit | dohkkehit | 1 |
|
461 | missing suggestion |
| dáguid | dáiguid | 1 |
|
461 | missing suggestion |
| dáhálaš | dehálaš | 1 |
|
461 | missing suggestion |
| gádnat | gávdnat | 1 |
|
461 | missing suggestion |
| la | lea | 1 |
|
461 | missing suggestion |
| leal | leat | 1 |
|
461 | missing suggestion |
| li | lei | 1 |
|
461 | missing suggestion |
| maddái | maiddái | 1 |
|
461 | missing suggestion |
| mutnos | munnos | 1 |
|
461 | missing suggestion |
| málmmi | máilmmi | 1 |
|
461 | missing suggestion |
| ovda | ovdal | 1 |
|
461 | missing suggestion |
| ođđ | ođđa | 1 |
|
461 | missing suggestion |
| romáhat | románat | 1 |
|
461 | missing suggestion |
| ráði | ráđi | 1 |
|
461 | missing suggestion |
| sáhkki | sáhkkái | 1 |
|
461 | missing suggestion |
| vuhtui | vuhtii | 1 |
|
461 | missing suggestion |
| Stuor-Guovdageainnut | Stuoraguovdageainnut | 2 |
|
518 | nominal + Prop/Der compound |
| Stuor-Oslolaš | Stuoraoslolaš | 2 |
|
518 | nominal + Prop/Der compound |
| StuorGuovdageainnut | Stuoraguovdageainnut | 2 |
|
518 | nominal + Prop/Der compound |
| StuorOslolaš | Stuoraoslolaš | 2 |
|
518 | nominal + Prop/Der compound |
| Stuora-Guovdageainnut | Stuoraguovdageainnut | 2 |
|
518 | nominal + Prop/Der compound |
| Stuora-Oslolaš | Stuoraoslolaš | 2 |
|
518 | nominal + Prop/Der compound |
| Stuorra-Guovdageainnut | Stuorraguovdageainnut | 2 |
|
518 | nominal + Prop/Der compound |
| Stuorra-Oslolaš | Stuorraoslolaš | 2 |
|
518 | nominal + Prop/Der compound |
| stuor-Guovdageainnut | stuoraguovdageainnut | 2 |
|
518 | nominal + Prop/Der compound |
| stuor-Oslolaš | stuoraoslolaš | 2 |
|
518 | nominal + Prop/Der compound |
| stuorGuovdageainnut | stuoraguovdageainnut | 2 |
|
518 | nominal + Prop/Der compound |
| stuorOslolaš | stuoraoslolaš | 2 |
|
518 | nominal + Prop/Der compound |
| stuora-Guovdageainnut | stuoraguovdageainnut | 2 |
|
518 | nominal + Prop/Der compound |
| stuorra-Guovdageainnut | stuorraguovdageainnut | 2 |
|
518 | nominal + Prop/Der compound |
| Stuor-guovdageainnut | Stuoraguovdageainnut | 1 |
|
518 | nominal + Prop/Der compound |
| Stuora-guovdageainnut | Stuoraguovdageainnut | 1 |
|
518 | nominal + Prop/Der compound |
| StuoraGuovdageainnut | Stuoraguovdageainnut | 1 |
|
518 | nominal + Prop/Der compound |
| StuoraOslolaš | Stuoraoslolaš | 1 |
|
518 | nominal + Prop/Der compound |
| Stuorra-guovdageainnut | Stuorraguovdageainnut | 1 |
|
518 | nominal + Prop/Der compound |
| StuorraGuovdageainnut | Stuorraguovdageainnut | 1 |
|
518 | nominal + Prop/Der compound |
| StuorraOslolaš | Stuorraoslolaš | 1 |
|
518 | nominal + Prop/Der compound |
| stuor-guovdageainnut | stuoraguovdageainnut | 1 |
|
518 | nominal + Prop/Der compound |
| stuora-guovdageainnut | stuoraguovdageainnut | 1 |
|
518 | nominal + Prop/Der compound |
| stuoraGuovdageainnut | stuoraguovdageainnut | 1 |
|
518 | nominal + Prop/Der compound |
| stuorra-guovdageainnut | stuorraguovdageainnut | 1 |
|
518 | nominal + Prop/Der compound |
| stuorraGuovdageainnut | stuorraguovdageainnut | 1 |
|
518 | nominal + Prop/Der compound |
Spelling errors without suggestions (1)
False positives (29)
False positives with suggestions (25)
| Input word |
Suggestions | Bug ID | Comment |
|---|---|---|---|
| BPA |
|
missing acros - they're in the lexicon | |
| ERUF |
|
2007-06-26 speller test report | |
| ERUF:s |
|
||
| Sámeálbmogiidsearvi |
|
Compounds marked PlGen | |
| láikiriehpu |
|
adj+noun compound | |
| ođđahistorismma |
|
||
| šaddat-sánis |
|
380 | nomin. + hyphen comp |
| bárrastávvalsániin |
|
397 | 3-syll cons comp, act. 3-part comp |
| duottarguolli |
|
397 | regular cons comp |
| julev- |
|
408 | NounRoot -> Rreal not in discont. comp |
| muorraguollenáhkki |
|
419 | 3-part compounds |
| gievramuorra |
|
421 | adj-nom+noun |
| gievrrasmuorra |
|
421 | adj-attr+noun |
| lojes-Oslo |
|
431 | adj attr+proper-noun, optional lowering |
| lojis-Oslo |
|
431 | adj attr+proper-noun, optional lowering |
| agimieljuohkáseapmi |
|
452 | several lexical bugs |
| boasttogáldu |
|
452 | several lexical bugs |
| interr |
|
520 | r9 and š9 not defined |
| goahtesilba |
|
522 | strange compounding behaviour - 2-p, ACCEPTED |
| goahtesilbageaidnu |
|
522 | strange compounding behaviour - 3-p, ACCEPTED |
| lastagieraolmmoš |
|
522 | strange compounding behaviour - 3-p, REJECTED |
| muorralastagiera |
|
522 | strange compounding behaviour - 3-p, ACCEPTED |
| veahkkegoahtesilba |
|
522 | strange compounding behaviour - 3-p, REJECTED |
| geađgeviessoláse |
|
524 | long compounds not accepted |
| šaddat-sánis |
|
525 | nomin. + hyphen comp should never be suggested |
False positives without suggestions (4)
veahkkegoahtesilbageaidnu muorralastagierraolmmoš muorralastagieraolmmoš hilbesrávdojávri
False negatives (9)
| Input word |
Expected correction |
Editing distance |
Bug ID | Comment |
|---|---|---|---|---|
| Duiskas | Duiskkas | 1 | still wrong | |
| heajus- | heajos- | 1 | 431 | lowering in front of hyphen |
| lojis- | lojes- | 1 | 431 | lowering in front of hyphen |
| álkis- | álkes- | 1 | 431 | lowering in front of hyphen |
| biila-- | biila- | 1 | 458 | double hyphens after some words - can really only be tested manually, the command line speller will ignore superfluous hyphens |
| nuvviid-- | nuvviid- | 1 | 458 | double hyphens after some words - can really only be tested manually, the command line speller will ignore superfluous hyphens |
| niibbispiidni | niibespiidni | 2 | 489 | does not follow compound tags |
| ránubiellu | rátnobiellu | 2 | 489 | does not follow compound tags |
| balvvareaŋga | balvareaŋga | 1 | 489 | does not follow compound tags |
True negatives (124)
The correctly accepted words are listed here for easy copy&paste into Word to quick check regressions in new versions of the speller: No correctly spelled words accepted by an earlier speller should be rejected by later spellers. To check, just copy and paste these words to Word, and see if you get any red underlines. Duplicate words are not repeated, unless one is followed by some punctuation. The words are reverse sorted according to Unicode character code.
čállindárkkistanprošeavtta čállindárkkistanprográmmat čállindárkkistanprográmmaid Ávdnos Ánná Ánne Ánde Ámmon Áilu Áillun Áilloš Áillonieida Áillohaš Áillehašvádjájohka Áillehašnjunni Áili Áilegasčohkka Áilegasvárri Áilegasvuopmi Áilegasmuotki Áilegasjávri Áilegasjohka Áilegas Áile Áila Ábo vástidanproseantage váldouksa váldorádji vuosttas vuolleleksikonat sátnejuohkin sátnejuohkimis stuorraoslolaš stuoraoslolaš stohpospiidni silbageaidnu rátnobiellu ođđasitčállinnjuolggadusaid olbmoš olbmo okta nubbi niibespiidni máŋgasat máŋga muitalanbargu muitalage moktiige moktige moaddásat moadde miljovnna mašiidnakompiláhtora láibi-sánis lastagiera kompiláhtor juostá juosat juonin juoidá jna. itge inge iige ii-goallostuvvon iešdieđiheapmi hupman-, guokte guldalan-, golbma gieraolmmoš giellaguolli generatiivalaš garrasepmosit fitnodatčatnosiin entusiástajoavkku ea. duodjeregisttarskovvi diakritihkalaš boahtágo boahtáge bešš bellodat bargoolmmái bargo-, balvareaŋga bahkkejeaddjit almmuhanbargu alfaveršuvdna Unicode-standárddas Unicode-doarjja Unicode Sátnejuohkin Sámirádioi Sámevuođahistorjá SÁB Stuorraoslolaš Stuoraoslolaš Sisasáddejuvvon Prográmmerárat PPC OpenOffice.orgii NSR Máŋgasat Máretgo Macintosh Länsmanis LO Kaplan InDesign Guovdageainnu-láđđi Gobby Gažadanskovvi Finncomm Fink Bargo-, BETA Almmuhannjuolggadusat Airlines 2004:s 1983:s
Grouped by bug #
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| divtásiiid | divtásiid | 1 |
|
|
| jurdagiiid | jurdagiid | 1 |
|
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| čiekciin | čikčiin | 2 |
|
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| láibi-sánis | nomin. + hyphen comp | |||
| šaddat-sánis |
|
nomin. + hyphen comp |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| boahtáge | verb + clitic | |||
| iige | verb + clitic | |||
| inge | verb + clitic | |||
| itge | verb + clitic | |||
| muitalage | verb + clitic |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| Harstadbiila | Harstad-biila | 1 |
|
name compounding |
| Oslobiila | Oslo-biila | 1 |
|
name compounding |
| guollibusse | guollebusse | 1 |
|
wrong compounding |
| oahppuneavvuid | oahpponeavvuid | 1 |
|
bad compounding |
| suopmasápmelaš | suomasápmelaš | 1 |
|
bad compounding |
| bárrastávvalsániin |
|
3-syll cons comp, act. 3-part comp | ||
| duottarguolli |
|
regular cons comp | ||
| giellaguolli | regular vowel comp | |||
| váldorádji | comp with vowel shortening | |||
| váldouksa | comp with vowel shortening |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| Máŋgasat | missing numerals | |||
| golbma | missing numerals | |||
| guokte | missing numerals | |||
| miljovnna | missing numerals | |||
| moadde | missing numerals | |||
| moaddásat | missing numerals | |||
| máŋga | missing numerals | |||
| máŋgasat | missing numerals | |||
| okta | missing numerals |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| Airlines | multiword names: Finncomm Airlines | |||
| Finncomm | multiword names: Finncomm Airlines |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| Bargo-, | word + hyph + comma | |||
| bargo-, | word + hyph + comma | |||
| guldalan-, | word + hyph + comma | |||
| hupman-, | word + hyph + comma |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| julev- |
|
NounRoot -> Rreal not in discont. comp |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| bahkkejeaddjit | missing derivation |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| garrasepmosit | adj -> adv |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| bargguitjuogadeapmi | bargguidjuogadeapmi | 1 |
|
wrong compound form on -t |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| Máretgo | nominals with clitic not accepted | |||
| boahtágo | nominals with clitic not accepted |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| muorraguollenáhkki |
|
3-part compounds |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| gievramuorra |
|
adj-nom+noun | ||
| gievrrasmuorra |
|
adj-attr+noun |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| 1983:s | infl. numerals not recognised | |||
| 2004:s | infl. numerals not recognised |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| Fink | missing names | |||
| Gobby | missing names | |||
| InDesign | missing names | |||
| Kaplan | missing names | |||
| Macintosh | missing names | |||
| OpenOffice.orgii | missing names | |||
| Unicode | missing names |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| X:ii | X:i | 1 |
|
other words from Divvun.no |
| BETA | other words from Divvun.no | |||
| PPC | other words from Divvun.no | |||
| Prográmmerárat | other words from Divvun.no | |||
| alfaveršuvdna | other words from Divvun.no | |||
| diakritihkalaš | other words from Divvun.no | |||
| generatiivalaš | other words from Divvun.no | |||
| kompiláhtor | other words from Divvun.no |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| Helssetvuođđoduvvon | Helsset-vuođđuduvvon | 2 |
|
comp words from Divvun.no |
| guoktedássásaš | guovttedássásaš | 2 |
|
comp words from Divvun.no |
| Geavaheaddjedokumentašuvdna | Geavaheaddjidokumentašuvdna | 1 |
|
comp words from Divvun.no |
| Romsaprošeavttat | Romsa-prošeavttat | 1 |
|
comp words from Divvun.no |
| geavaheaddjiide | geavaheddjiide | 1 |
|
comp words from Divvun.no |
| loahpahandáhtomiin | loahpahandáhtoniin | 1 |
|
comp words from Divvun.no |
| Sátnejuohkin | comp words from Divvun.no | |||
| Unicode-doarjja | comp words from Divvun.no | |||
| Unicode-standárddas | comp words from Divvun.no | |||
| alfaveršuvdna | comp words from Divvun.no | |||
| entusiástajoavkku | comp words from Divvun.no | |||
| ii-goallostuvvon | comp words from Divvun.no | |||
| mašiidnakompiláhtora | comp words from Divvun.no | |||
| ođđasitčállinnjuolggadusaid | comp words from Divvun.no | |||
| sátnejuohkimis | comp words from Divvun.no | |||
| sátnejuohkin | comp words from Divvun.no | |||
| vuolleleksikonat | comp words from Divvun.no | |||
| čállindárkkistanprográmmaid | comp words from Divvun.no | |||
| čállindárkkistanprográmmat | comp words from Divvun.no | |||
| čállindárkkistanprošeavtta | comp words from Divvun.no |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| oidni | oinnii | 2 |
|
missing suggestion |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| heajus- | heajos- | 1 | lowering in front of hyphen | |
| heajus-Oslo | heajos-Oslo | 1 |
|
lowering in front of hyphen |
| lojis- | lojes- | 1 | lowering in front of hyphen | |
| álkis- | álkes- | 1 | lowering in front of hyphen | |
| álkis-Oslo | álkes-Oslo | 1 |
|
lowering in front of hyphen |
| lojes-Oslo |
|
adj attr+proper-noun, optional lowering | ||
| lojis-Oslo |
|
adj attr+proper-noun, optional lowering |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| Ábo | names beginning in Á | |||
| Áilu | names beginning in Á | |||
| Ámmon | names beginning in Á | |||
| Ánde | names beginning in Á | |||
| Ánne | names beginning in Á | |||
| Ánná | names beginning in Á | |||
| Ávdnos | names beginning in Á |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| SáMI | SÁMI | 1 |
|
wrong sugg if mostly upper case |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| suopmasápmelaš | suomasápmelaš | 1 |
|
suopmasápmelaš comps wrongly accepted |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| Gažadanskovvi | verb+noun, verb+verb | |||
| Sisasáddejuvvon | verb+noun, verb+verb | |||
| vástidanproseantage | verb+noun, verb+verb |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| buvttadeaddjiide | buvttadeddjiide | 1 |
|
several lexical bugs |
| oažžuin | ožžuin | 1 |
|
several lexical bugs |
| vástideaddjiin | vástideddjiin | 1 |
|
several lexical bugs |
| Länsmanis | several lexical bugs | |||
| agimieljuohkáseapmi |
|
several lexical bugs | ||
| boasttogáldu |
|
several lexical bugs | ||
| duodjeregisttarskovvi | several lexical bugs | |||
| iešdieđiheapmi | several lexical bugs |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| biila-- | biila- | 1 | double hyphens after some words - can really only be tested manually, the command line speller will ignore superfluous hyphens | |
| nuvviid-- | nuvviid- | 1 | double hyphens after some words - can really only be tested manually, the command line speller will ignore superfluous hyphens |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| Distat | Disdat | 1 |
|
missing suggestion |
| Eanadalit | Eandalit | 1 |
|
missing suggestion |
| Julis | Julius | 1 |
|
missing suggestion |
| Odná | Otná | 1 |
|
missing suggestion |
| bidju | biedju | 1 |
|
missing suggestion |
| distat | disdat | 1 |
|
missing suggestion |
| dohkkenit | dohkkehit | 1 |
|
missing suggestion |
| dáguid | dáiguid | 1 |
|
missing suggestion |
| dáhálaš | dehálaš | 1 |
|
missing suggestion |
| guovlui | guvlui | 1 |
|
missing suggestion |
| gádnat | gávdnat | 1 |
|
missing suggestion |
| la | lea | 1 |
|
missing suggestion |
| leal | leat | 1 |
|
missing suggestion |
| leažžago | leažžágo | 1 |
|
missing suggestion |
| li | lei | 1 |
|
missing suggestion |
| láibbiit | láibbiid | 1 |
|
missing suggestion |
| maddái | maiddái | 1 |
|
missing suggestion |
| moht | mot | 1 |
|
missing suggestion |
| molssašumi | molsašumi | 1 |
|
missing suggestion |
| molssašupmi | molsašupmi | 1 |
|
missing suggestion |
| muhtim | muhtin | 1 |
|
missing suggestion |
| mutnos | munnos | 1 |
|
missing suggestion |
| málmmi | máilmmi | 1 |
|
missing suggestion |
| nuohtiin | nuhtiin | 1 |
|
missing suggestion |
| ovda | ovdal | 1 |
|
missing suggestion |
| ođđ | ođđa | 1 |
|
missing suggestion |
| romáhat | románat | 1 |
|
missing suggestion |
| ráði | ráđi | 1 |
|
missing suggestion |
| suorggiin | surggiin | 1 |
|
missing suggestion |
| sáhkki | sáhkkái | 1 |
|
missing suggestion |
| viesot | visot | 1 |
|
missing suggestion |
| vuhtui | vuhtii | 1 |
|
missing suggestion |
| moktige | missing suggestion | |||
| moktiige | missing suggestion | |||
| olbmo | missing suggestion | |||
| olbmoš | missing suggestion |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| Áila | missing names | |||
| Áile | missing names | |||
| Áilegas | missing names | |||
| Áilegasjohka | missing names | |||
| Áilegasjávri | missing names | |||
| Áilegasmuotki | missing names | |||
| Áilegasvuopmi | missing names | |||
| Áilegasvárri | missing names | |||
| Áilegasčohkka | missing names | |||
| Áili | missing names | |||
| Áillehašnjunni | missing names | |||
| Áillehašvádjájohka | missing names | |||
| Áillohaš | missing names | |||
| Áillonieida | missing names | |||
| Áilloš | missing names | |||
| Áillun | missing names |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| juoidá | missing pronouns | |||
| juonin | missing pronouns | |||
| juosat | missing pronouns | |||
| juostá | missing pronouns |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| niibbispiidni | niibespiidni | 2 | does not follow compound tags | |
| ránubiellu | rátnobiellu | 2 | does not follow compound tags | |
| balvvareaŋga | balvareaŋga | 1 | does not follow compound tags | |
| niibispiidni | niibespiidni | 1 |
|
lack of stem vowel shortening |
| rátnubiellu | rátnobiellu | 1 |
|
lack of stem vowel shortening |
| stohpuspiidni | stohpospiidni | 1 |
|
lack of stem vowel shortening |
| balvareaŋga | does not follow compound tags | |||
| niibespiidni | does not follow compound tags | |||
| rátnobiellu | does not follow compound tags | |||
| stohpospiidni | does not follow compound tags |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| bellodat | regressions |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| co2:ajn | co2:in | 2 |
|
accepts smj entries |
| VG:aid | VG:id | 1 |
|
accepts SUB entries |
| VG:at | VG:t | 1 |
|
accepts SUB entries |
| VG:ii | VG:i | 1 |
|
accepts SUB entries |
| co2:j | co2:i | 1 |
|
accepts smj entries |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| Stuor-Guovdageainnut | Stuoraguovdageainnut | 2 |
|
nominal + Prop/Der compound |
| Stuor-Oslolaš | Stuoraoslolaš | 2 |
|
nominal + Prop/Der compound |
| StuorGuovdageainnut | Stuoraguovdageainnut | 2 |
|
nominal + Prop/Der compound |
| StuorOslolaš | Stuoraoslolaš | 2 |
|
nominal + Prop/Der compound |
| Stuora-Guovdageainnut | Stuoraguovdageainnut | 2 |
|
nominal + Prop/Der compound |
| Stuora-Oslolaš | Stuoraoslolaš | 2 |
|
nominal + Prop/Der compound |
| Stuorra-Guovdageainnut | Stuorraguovdageainnut | 2 |
|
nominal + Prop/Der compound |
| Stuorra-Oslolaš | Stuorraoslolaš | 2 |
|
nominal + Prop/Der compound |
| stuor-Guovdageainnut | stuoraguovdageainnut | 2 |
|
nominal + Prop/Der compound |
| stuor-Oslolaš | stuoraoslolaš | 2 |
|
nominal + Prop/Der compound |
| stuorGuovdageainnut | stuoraguovdageainnut | 2 |
|
nominal + Prop/Der compound |
| stuorOslolaš | stuoraoslolaš | 2 |
|
nominal + Prop/Der compound |
| stuora-Guovdageainnut | stuoraguovdageainnut | 2 |
|
nominal + Prop/Der compound |
| stuora-Oslolaš | stuoraoslolaš | 2 |
|
nominal + Prop/Der compound |
| stuorra-Guovdageainnut | stuorraguovdageainnut | 2 |
|
nominal + Prop/Der compound |
| stuorra-Oslolaš | stuorraoslolaš | 2 |
|
nominal + Prop/Der compound |
| Stuor-guovdageainnut | Stuoraguovdageainnut | 1 |
|
nominal + Prop/Der compound |
| Stuor-oslolaš | Stuoraoslolaš | 1 |
|
nominal + Prop/Der compound |
| Stuora-guovdageainnut | Stuoraguovdageainnut | 1 |
|
nominal + Prop/Der compound |
| Stuora-oslolaš | Stuoraoslolaš | 1 |
|
nominal + Prop/Der compound |
| StuoraGuovdageainnut | Stuoraguovdageainnut | 1 |
|
nominal + Prop/Der compound |
| StuoraOslolaš | Stuoraoslolaš | 1 |
|
nominal + Prop/Der compound |
| Stuorra-guovdageainnut | Stuorraguovdageainnut | 1 |
|
nominal + Prop/Der compound |
| Stuorra-oslolaš | Stuorraoslolaš | 1 |
|
nominal + Prop/Der compound |
| StuorraGuovdageainnut | Stuorraguovdageainnut | 1 |
|
nominal + Prop/Der compound |
| StuorraOslolaš | Stuorraoslolaš | 1 |
|
nominal + Prop/Der compound |
| stuor-guovdageainnut | stuoraguovdageainnut | 1 |
|
nominal + Prop/Der compound |
| stuor-oslolaš | stuoraoslolaš | 1 |
|
nominal + Prop/Der compound |
| stuora-guovdageainnut | stuoraguovdageainnut | 1 |
|
nominal + Prop/Der compound |
| stuora-oslolaš | stuoraoslolaš | 1 |
|
nominal + Prop/Der compound |
| stuoraGuovdageainnut | stuoraguovdageainnut | 1 |
|
nominal + Prop/Der compound |
| stuoraOslolaš | stuoraoslolaš | 1 |
|
nominal + Prop/Der compound |
| stuorra-guovdageainnut | stuorraguovdageainnut | 1 |
|
nominal + Prop/Der compound |
| stuorra-oslolaš | stuorraoslolaš | 1 |
|
nominal + Prop/Der compound |
| stuorraGuovdageainnut | stuorraguovdageainnut | 1 |
|
nominal + Prop/Der compound |
| stuorraOslolaš | stuorraoslolaš | 1 |
|
nominal + Prop/Der compound |
| Stuoraoslolaš | nominal + Prop/Der compound | |||
| Stuorraoslolaš | nominal + Prop/Der compound | |||
| stuoraoslolaš | nominal + Prop/Der compound | |||
| stuorraoslolaš | nominal + Prop/Der compound |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| bešš | r9 and š9 not defined | |||
| interr |
|
r9 and š9 not defined |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| gieraolmmoš | strange compounding behaviour - 2-p, ACCEPTED | |||
| goahtesilba |
|
strange compounding behaviour - 2-p, ACCEPTED | ||
| goahtesilbageaidnu |
|
strange compounding behaviour - 3-p, ACCEPTED | ||
| lastagiera | strange compounding behaviour - 2-p, ACCEPTED | |||
| lastagieraolmmoš |
|
strange compounding behaviour - 3-p, REJECTED | ||
| muorralastagiera |
|
strange compounding behaviour - 3-p, ACCEPTED | ||
| muorralastagieraolmmoš |
|
strange compounding behaviour - 4-p, REJECTED | ||
| muorralastagierraolmmoš |
|
strange compounding behaviour - 4-p, ACCEPTED | ||
| silbageaidnu | strange compounding behaviour - 2-p, ACCEPTED | |||
| veahkkegoahtesilba |
|
strange compounding behaviour - 3-p, REJECTED | ||
| veahkkegoahtesilbageaidnu |
|
strange compounding behaviour - 4-p, REJECTED |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| geađgeviessoláse |
|
long compounds not accepted | ||
| hilbesrávdojávri |
|
long compounds not accepted |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| láibi-sánis | nomin. + hyphen comp should never be suggested | |||
| šaddat-sánis |
|
nomin. + hyphen comp should never be suggested |
Testpairs not in bugs
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| 1883:as | 1883:s | 1 |
|
|
| 1883as | 1883:s | 1 |
|
|
| EO:a | EO | 2 |
|
|
| ERUF:s |
|
|||
| INTERREG:ii | INTERREG:i | 1 |
|
|
| nuppogeahčái | nuppi geahčái | 2 |
|
|
| ođđahistorismma |
|
|||
| nuppogeahčái | nuppe geahčái | 2 |
|
"nuppe" is marked |
| sihkarit | sihkkarit | 1 |
|
"sihkarit" is not marked |
| 169:is | 169:s | 1 |
|
169:is does not get marked |
| 2003:as | 2003:s | 1 |
|
2003:as does not get marked |
| 1883s | 1883:s | 1 |
|
2007-06-26 speller test report |
| ERUF |
|
2007-06-26 speller test report | ||
| fitnodatčatnosiin | 2007-06-26 speller test report | |||
| Sámeálbmogiidsearvi |
|
Compounds marked PlGen | ||
| Sámevuođahistorjá | Compounds marked SgGen | |||
| Guovdageainnu-láđđi | Compounds with hyphens | |||
| Almmuhannjuolggadusat | actio compound | |||
| almmuhanbargu | actio compound | |||
| muitalanbargu | actio compound | |||
| láikiriehpu |
|
adj+noun compound | ||
| Sámedikkeáirrasin | sámediggeáirrasin | 3 |
|
does not get marked |
| eaktudáhtolaččat | eaktodáhtolaččat | 1 |
|
does not get marked |
| nazisttain | nasisttain | 1 |
|
does not get marked |
| eahpenášunálalaččat | eahpenášuvnnalaččat | 3 |
|
eahpenášunálalaččat does not get marked |
| jna. | missing abbreviations | |||
| ea. | missing abbreviations (they are in the LexC source files) | |||
| BPA |
|
missing acros - they're in the lexicon | ||
| LO | missing acros - they're in the lexicon | |||
| NSR | missing acros - they're in the lexicon | |||
| SÁB | missing acros - they're in the lexicon | |||
| nubbi | missing ordinal | |||
| vuosttas | missing ordinal | |||
| krimináláššiid | kriminálaáššiid | 1 |
|
missing suggestion |
| studentaavissas | studeantaaviissas | 2 |
|
multiple errors in compound - no sugg |
| Lulli-Eurohpas | Lulli-Eurohpás | 1 |
|
no spelling suggestion |
| oarje-eurohpai | Oarje-Eurohpái | 3 |
|
no spelling suggestion |
| Máttá-Trøndelágas | Mátta-Trøndelagas | 2 |
|
no suggestion |
| bargoolmmái | nom compounding with vowel shortening | |||
| skuvlajagin | skuvlajagiin | 1 |
|
single error in compound - no sugg |
| 2003as | 2003:s | 1 |
|
still wrong |
| Duiskas | Duiskkas | 1 | still wrong | |
| birrasit | birrasiid | 2 |
|
still wrong |
| goldasii | goldásii | 1 |
|
still wrong |
| guovttemiesát | guovttemiesat | 1 |
|
still wrong |
| guššalemiin | giššalemiin | 1 |
|
still wrong |
| ilgadeamis | ilgadeamos | 1 |
|
still wrong |
| jápmina | jápmima | 1 |
|
still wrong |
| milljovnna | miljovnna | 1 |
|
still wrong |
| skierranis | skieranis | 1 |
|
still wrong |
| sámedikkepresideanta | sámediggepresideanta | 2 |
|
still wrong |
| vakumas | vakuumas | 1 |
|
still wrong |
| vuoittohallamin | vuoittáhallamin | 1 |
|
still wrong |
| čorgedalla | čorgedallá | 1 |
|
still wrong |
| Sámirádioi | sápmi+N+SgGenCmp |