Speller Test Results for: «tmp/regression-pl-sme.txt»
Overview
Technical data
Language tested: sme
Document tested: tmp/regression-pl-sme.txt
Speller tool: Polderland command line tool
Speller lexicon version: Davvisámi, public beta 2, 2007-10-11
Test Date: 20071012
Test Type: regression
Result summary
Number of input words: 455
| Speller view | |||
|---|---|---|---|
| Speller Positive (number of flagged words): 163 |
Speller Negative (number of accepted words): 292 |
||
| Reality | Number of real errors: 175 |
Number of true positives (detected real errors): 137 |
Number of false negatives (unflagged spelling errors): 38 |
| Number of real correct words: 280 |
Number of false positives (incorrectly flagged words): 26 |
Number of true negatives (unflagged correct words): 254 |
|
- Precision (tp/(tp+fp)):
- 84.05 %
- Recall (tp/(tp+fn)):
- 78.29 %
- Accuracy ((tp+tn)/words):
- 85.93 %
Suggestion statistics
- Number of spelling errors with correct suggestion: 75
- Number of spelling errors with correct suggestion in top 5: 72
- Number of spelling errors with only wrong suggestions: 56
- Number of spelling errors with no suggestions at all: 3
Spelling error statistics
- Number of spelling errors : 175
- Number of simple spelling errors : 126
- Number of spelling errors with editing distance 2: 33
- Number of spelling errors with editing distance 3 or more: 15
True positives (137)
Spelling errors with correct suggestions (75)
| Input word |
Expected correction |
Editing distance |
Suggestions | Bug ID | Comment |
|---|---|---|---|---|---|
| birrasit | birrasiid | 2 |
|
still wrong | |
| sámedikkepresideanta | sámediggepresideanta | 2 |
|
still wrong | |
| skuvlajagin | skuvlajagiin | 1 |
|
single error in compound - no sugg | |
| sihkarit | sihkkarit | 1 |
|
"sihkarit" is not marked | |
| vuoittohallamin | vuoittáhallamin | 1 |
|
still wrong | |
| jápmina | jápmima | 1 |
|
still wrong | |
| Lulli-Eurohpas | Lulli-Eurohpás | 1 |
|
no spelling suggestion | |
| nazisttain | nasisttain | 1 |
|
does not get marked | |
| jurdagiiid | jurdagiid | 1 |
|
367 | |
| diktásiiid | diktásiid | 1 |
|
367 | |
| čiekciin | čikčiin | 2 |
|
376 | |
| guollibusse | guollebusse | 1 |
|
397 | wrong compounding |
| Oslobiila | Oslo-biila | 1 |
|
397 | name compounding |
| Harstadbiila | Harstad-biila | 1 |
|
397 | name compounding |
| bargguitjuogadeapmi | bargguidjuogadeapmi | 1 |
|
414 | wrong compound form on -t |
| Helssetvuođđoduvvon | Helsset-vuođđuduvvon | 2 |
|
426 | comp words from Divvun.no |
| Geavaheaddjedokumentašuvdna | Geavaheaddjidokumentašuvdna | 1 |
|
426 | comp words from Divvun.no |
| Romsaprošeavttat | Romsa-prošeavttat | 1 |
|
426 | comp words from Divvun.no |
| geavaheaddjiide | geavaheddjiide | 1 |
|
426 | comp words from Divvun.no |
| loahpahandáhtomiin | loahpahandáhtoniin | 1 |
|
426 | comp words from Divvun.no |
| oidni | oinnii | 2 |
|
429 | missing suggestion |
| SáMI | SÁMI | 1 |
|
448 | wrong sugg if mostly upper case |
| vástideaddjiin | vástideddjiin | 1 |
|
452 | several lexical bugs |
| buvttadeaddjiide | buvttadeddjiide | 1 |
|
452 | several lexical bugs |
| oažžuin | ožžuin | 1 |
|
452 | several lexical bugs |
| leažžago | leažžágo | 1 |
|
461 | missing suggestion |
| Odná | Otná | 1 |
|
461 | missing suggestion |
| láibbiit | láibbiid | 1 |
|
461 | missing suggestion |
| viesot | visot | 1 |
|
461 | missing suggestion |
| Distat | Disdat | 1 |
|
461 | missing suggestion |
| distat | disdat | 1 |
|
461 | missing suggestion |
| guovlui | guvlui | 1 |
|
461 | missing suggestion |
| moht | mot | 1 |
|
461 | missing suggestion |
| molssašumi | molsašumi | 1 |
|
461 | missing suggestion |
| molssašupmi | molsašupmi | 1 |
|
461 | missing suggestion |
| muhtim | muhtin | 1 |
|
461 | missing suggestion |
| nuohtiin | nuhtiin | 1 |
|
461 | missing suggestion |
| suorggiin | surggiin | 1 |
|
461 | missing suggestion |
| niibbispiidni | niibespiidni | 2 |
|
489 | does not follow compound tags |
| ránubiellu | rátnobiellu | 2 |
|
489 | does not follow compound tags |
| stohpuspiidni | stohpospiidni | 1 |
|
489 | lack of stem vowel shortening |
| balvvareaŋga | balvareaŋga | 1 |
|
489 | does not follow compound tags |
| co2:ajn | co2:in | 2 |
|
508 | accepts smj entries |
| VG:at | VG:t | 1 |
|
508 | accepts SUB entries |
| 1883:as | 1883:s | 1 |
|
508 | accepts SUB entries |
| 2003:as | 2003:s | 1 |
|
508 | 2003:as does not get marked |
| 169:is | 169:s | 1 |
|
508 | 169:is does not get marked |
| VG:aid | VG:id | 1 |
|
508 | accepts SUB entries |
| VG:ii | VG:i | 1 |
|
508 | accepts SUB entries |
| co2:j | co2:i | 1 |
|
508 | accepts smj entries |
| 1883s | 1883:s | 1 |
|
508 | accepts SUB entries, 2007-06-26 speller test report |
| 1883as | 1883:s | 1 |
|
508 | accepts SUB entries |
| 2003as | 2003:s | 1 |
|
508 | accepts SUB entries |
| milljovnna | miljovnna | 1 |
|
508 | accepts SUB entries |
| stuorra-Oslolaš | stuorraoslolaš | 2 |
|
518 | nominal + Prop/Der compound |
| stuora-Oslolaš | stuoraoslolaš | 2 |
|
518 | nominal + Prop/Der compound |
| stuorra-Guovdageainnut | stuorraguovdageainnut | 2 |
|
518 | nominal + Prop/Der compound |
| stuora-Guovdageainnut | stuoraguovdageainnut | 2 |
|
518 | nominal + Prop/Der compound |
| stuor-oslolaš | stuoraoslolaš | 1 |
|
518 | nominal + Prop/Der compound |
| Stuor-oslolaš | Stuoraoslolaš | 1 |
|
518 | nominal + Prop/Der compound |
| stuor-guovdageainnut | stuoraguovdageainnut | 1 |
|
518 | nominal + Prop/Der compound |
| Stuor-guovdageainnut | Stuoraguovdageainnut | 1 |
|
518 | nominal + Prop/Der compound |
| stuorra-oslolaš | stuorraoslolaš | 1 |
|
518 | nominal + Prop/Der compound |
| stuorraOslolaš | stuorraoslolaš | 1 |
|
518 | nominal + Prop/Der compound |
| Stuorra-oslolaš | Stuorraoslolaš | 1 |
|
518 | nominal + Prop/Der compound |
| stuora-oslolaš | stuoraoslolaš | 1 |
|
518 | nominal + Prop/Der compound |
| stuoraOslolaš | stuoraoslolaš | 1 |
|
518 | nominal + Prop/Der compound |
| Stuora-oslolaš | Stuoraoslolaš | 1 |
|
518 | nominal + Prop/Der compound |
| stuorra-guovdageainnut | stuorraguovdageainnut | 1 |
|
518 | nominal + Prop/Der compound |
| stuorraGuovdageainnut | stuorraguovdageainnut | 1 |
|
518 | nominal + Prop/Der compound |
| Stuorra-guovdageainnut | Stuorraguovdageainnut | 1 |
|
518 | nominal + Prop/Der compound |
| stuora-guovdageainnut | stuoraguovdageainnut | 1 |
|
518 | nominal + Prop/Der compound |
| stuoraGuovdageainnut | stuoraguovdageainnut | 1 |
|
518 | nominal + Prop/Der compound |
| Stuora-guovdageainnut | Stuoraguovdageainnut | 1 |
|
518 | nominal + Prop/Der compound |
| beavddeguorra | beavdeguorra | 1 |
|
539 | speller does not follow compound-tagging |
Spelling errors with only incorrect suggestions (59)
| Input word |
Expected correction |
Editing distance |
Suggestions | Bug ID | Comment |
|---|---|---|---|---|---|
| Sámedikkeáirrasin | sámediggeáirrasin | 3 |
|
does not get marked | |
| eahpenášunálalaččat | eahpenášuvnnalaččat | 3 |
|
eahpenášunálalaččat does not get marked | |
| oarje-eurohpai | Oarje-Eurohpái | 3 |
|
no spelling suggestion | |
| EO:a | EO | 2 |
|
||
| nuppogeahčái | nuppe geahčái | 2 |
|
"nuppe" is marked | |
| nuppogeahčái | nuppi geahčái | 2 |
|
||
| studentaavissas | studeantaaviissas | 2 |
|
multiple errors in compound - no sugg | |
| INTERREG:ii | INTERREG:i | 1 |
|
||
| goldasii | goldásii | 1 |
|
still wrong | |
| guovttemiesát | guovttemiesat | 1 |
|
still wrong | |
| guššalemiin | giššalemiin | 1 |
|
still wrong | |
| ilgadeamis | ilgadeamos | 1 |
|
still wrong | |
| krimináláššiid | kriminálaáššiid | 1 |
|
missing suggestion | |
| skierranis | skieranis | 1 |
|
still wrong | |
| vakumas | vakuumas | 1 |
|
still wrong | |
| čorgedalla | čorgedallá | 1 |
|
still wrong | |
| divtásiiid | diktásiid | 2 |
|
367 | SUBbed entry |
| divtásiid | diktásiid | 1 |
|
367 | |
| X:ii | X:i | 1 |
|
425 | other words from Divvun.no |
| heajus-Oslo | heajos-Oslo | 1 |
|
431 | lowering in front of hyphen |
| álkis-Oslo | álkes-Oslo | 1 |
|
431 | lowering in front of hyphen |
| Eanadalit | Eandalit | 1 |
|
461 | missing suggestion |
| Julis | Julius | 1 |
|
461 | missing suggestion |
| bidju | biedju | 1 |
|
461 | missing suggestion |
| dohkkenit | dohkkehit | 1 |
|
461 | missing suggestion |
| dáguid | dáiguid | 1 |
|
461 | missing suggestion |
| dáhálaš | dehálaš | 1 |
|
461 | missing suggestion |
| gádnat | gávdnat | 1 |
|
461 | missing suggestion |
| la | lea | 1 |
|
461 | missing suggestion |
| leal | leat | 1 |
|
461 | missing suggestion |
| li | lei | 1 |
|
461 | missing suggestion |
| maddái | maiddái | 1 |
|
461 | missing suggestion |
| mutnos | munnos | 1 |
|
461 | missing suggestion |
| málmmi | máilmmi | 1 |
|
461 | missing suggestion |
| ovda | ovdal | 1 |
|
461 | missing suggestion |
| ođđ | ođđa | 1 |
|
461 | missing suggestion |
| romáhat | románat | 1 |
|
461 | missing suggestion |
| ráði | ráđi | 1 |
|
461 | missing suggestion |
| sáhkki | sáhkkái | 1 |
|
461 | missing suggestion |
| vuhtui | vuhtii | 1 |
|
461 | missing suggestion |
| deažžalakkis- | deažžalakkes- | 1 |
|
489 | does not follow vowel shortening |
| Stuor-Guovdageainnut | Stuoraguovdageainnut | 2 |
|
518 | nominal + Prop/Der compound |
| Stuor-Oslolaš | Stuoraoslolaš | 2 |
|
518 | nominal + Prop/Der compound |
| StuorGuovdageainnut | Stuoraguovdageainnut | 2 |
|
518 | nominal + Prop/Der compound |
| StuorOslolaš | Stuoraoslolaš | 2 |
|
518 | nominal + Prop/Der compound |
| Stuora-Guovdageainnut | Stuoraguovdageainnut | 2 |
|
518 | nominal + Prop/Der compound |
| Stuora-Oslolaš | Stuoraoslolaš | 2 |
|
518 | nominal + Prop/Der compound |
| Stuorra-Guovdageainnut | Stuorraguovdageainnut | 2 |
|
518 | nominal + Prop/Der compound |
| Stuorra-Oslolaš | Stuorraoslolaš | 2 |
|
518 | nominal + Prop/Der compound |
| stuor-Guovdageainnut | stuoraguovdageainnut | 2 |
|
518 | nominal + Prop/Der compound |
| stuor-Oslolaš | stuoraoslolaš | 2 |
|
518 | nominal + Prop/Der compound |
| stuorGuovdageainnut | stuoraguovdageainnut | 2 |
|
518 | nominal + Prop/Der compound |
| stuorOslolaš | stuoraoslolaš | 2 |
|
518 | nominal + Prop/Der compound |
| StuoraGuovdageainnut | Stuoraguovdageainnut | 1 |
|
518 | nominal + Prop/Der compound |
| StuoraOslolaš | Stuoraoslolaš | 1 |
|
518 | nominal + Prop/Der compound |
| StuorraGuovdageainnut | Stuorraguovdageainnut | 1 |
|
518 | nominal + Prop/Der compound |
| StuorraOslolaš | Stuorraoslolaš | 1 |
|
518 | nominal + Prop/Der compound |
| addojupmimuorra | addojupmi | 6 |
|
540 | more "impossible" compounds accepted |
| anistupmimuorra | anistupmi | 6 |
|
540 | more "impossible" compounds accepted |
Spelling errors without suggestions (3)
False positives (26)
False positives with suggestions (25)
| Input word |
Suggestions | Bug ID | Comment |
|---|---|---|---|
| BPA |
|
missing acros - they're in the lexicon | |
| ERUF |
|
2007-06-26 speller test report | |
| ERUF:s |
|
||
| šaddat-sánis |
|
380 | nomin. + hyphen comp |
| julev- |
|
408 | NounRoot -> Rreal not in discont. comp |
| muorraguollenáhkki |
|
419 | 3-part compounds |
| X:i |
|
425 | other words from Divvun.no |
| heajos-Oslo |
|
431 | lowering in front of hyphen |
| lojes-Oslo |
|
431 | adj attr+proper-noun, optional lowering |
| lojis-Oslo |
|
431 | adj attr+proper-noun, optional lowering |
| álkes-Oslo |
|
431 | lowering in front of hyphen |
| agimieljuohkáseapmi |
|
452 | several lexical bugs |
| Eandalit |
|
461 | missing suggestion |
| deažžalakkes- |
|
489 | does not follow vowel shortening |
| suoluboazu |
|
489 | does not follow vowel shortening |
| veahkkegoahtesilba |
|
522 | strange compounding behaviour - 3-p, REJECTED |
| geađgeviessoláse |
|
524 | long compounds not accepted |
| hilbesrávdojávri |
|
524 | long compounds not accepted |
| šaddat-sánis |
|
525 | nomin. + hyphen comp should never be suggested |
| almmustusaš |
|
529 | diminutive of JOHTALAT words |
| erohusaš |
|
529 | diminutive of JOHTALAT words |
| gonagasaš |
|
529 | diminutive of JOHTALAT words |
| niibbidahkan |
|
535 | Left-compound-tags in sme-lex do not work |
| suotjalanveahkki |
|
536 | speller accepts "impossible" compound-forms |
| oktahatge |
|
542 | does not accepts pronouns + clitic |
False positives without suggestions (1)
veahkkegoahtesilbageaidnu
False negatives (38)
| Input word |
Expected correction |
Editing distance |
Bug ID | Comment |
|---|---|---|---|---|
| Duiskas | Duiskkas | 1 | still wrong | |
| eaktudáhtolaččat | eaktodáhtolaččat | 1 | does not get marked | |
| oahppuneavvuid | oahpponeavvuid | 1 | 397 | bad compounding |
| suopmasápmelaš | suomasápmelaš | 1 | 397 | bad compounding |
| guoktedássásaš | guovttedássásaš | 2 | 426 | comp words from Divvun.no |
| heajus- | heajos- | 1 | 431 | lowering in front of hyphen |
| lojis- | lojes- | 1 | 431 | lowering in front of hyphen |
| álkis- | álkes- | 1 | 431 | lowering in front of hyphen |
| suopmasápmelaš | suomasápmelaš | 1 | 449 | suopmasápmelaš comps wrongly accepted |
| biila-- | biila- | 1 | 458 | double hyphens after some words - can really only be tested manually, the command line speller will ignore superfluous hyphens |
| nuvviid-- | nuvviid- | 1 | 458 | double hyphens after some words - can really only be tested manually, the command line speller will ignore superfluous hyphens |
| beaivi- | beaive- | 1 | 489 | does not follow vowel shortening |
| boazu- | boazo- | 1 | 489 | does not follow vowel shortening |
| bákti- | bákte- | 1 | 489 | does not follow vowel shortening |
| bálddis- | bálddes- | 1 | 489 | does not follow vowel shortening |
| bálggis- | bálgges- | 1 | 489 | does not follow vowel shortening |
| doaris- | doares- | 1 | 489 | does not follow vowel shortening |
| goahti- | goahte- | 1 | 489 | does not follow vowel shortening |
| mális- | máles- | 1 | 489 | does not follow vowel shortening |
| niibispiidni | niibespiidni | 1 | 489 | lack of stem vowel shortening |
| oapmi- | oapme- | 1 | 489 | does not follow vowel shortening |
| ráhtis- | ráhtes- | 1 | 489 | does not follow vowel shortening |
| rátnubiellu | rátnobiellu | 1 | 489 | lack of stem vowel shortening |
| seamu- | seamo- | 1 | 489 | does not follow vowel shortening |
| suoloboazu | suoluboazu | 1 | 489 | does not follow vowel shortening |
| čuru- | čuro- | 1 | 489 | does not follow vowel shortening |
| loddi- | loddi- | 0 | 489 | does not follow vowel shortening |
| buoiddusteapmiveahkki | buoiddustanveahkki | 4 | 536 | speller accepts "impossible" compound-forms |
| geažideapmigárvu | geažidangárvu | 4 | 536 | speller accepts "impossible" compound-forms |
| giddesteapmisággi | giddestansággi | 4 | 536 | speller accepts "impossible" compound-forms |
| gárvodeapmimuorra | gárvodanmuorra | 4 | 536 | speller accepts "impossible" compound-forms |
| hárjáneapmisáhka | hárjánansáhka | 4 | 536 | speller accepts "impossible" compound-forms |
| lonuheapmibiila | lonuhanbiila | 4 | 536 | speller accepts "impossible" compound-forms |
| suotnjaleapmiveahkki | suotnjalanveahkki | 4 | 536 | speller accepts "impossible" compound-forms |
| sákkasteapmifierbmi | sákkastanfierbmi | 4 | 536 | speller accepts "impossible" compound-forms |
| bastilvuohtadovdu | bastilvuođadovdu | 2 | 536 | speller accepts "impossible" compound-forms |
| oktovuohtadovdu | oktovuođadovdu | 2 | 536 | speller accepts "impossible" compound-forms |
| luondugáldu | luonddugáldu | 1 | 536 | speller accepts "impossible" compound-forms |
True negatives (254)
The correctly accepted words are listed here for easy copy&paste into Word to quick check regressions in new versions of the speller: No correctly spelled words accepted by an earlier speller should be rejected by later spellers. To check, just copy and paste these words to Word, and see if you get any red underlines. Duplicate words are not repeated, unless one is followed by some punctuation. The words are reverse sorted according to Unicode character code.
čállindárkkistanprošeavtta čállindárkkistanprográmmat čállindárkkistanprográmmaid čuro- čikčiin Ávdnos Ánná Ánne Ánde Ámmon Áilu Áillun Áilloš Áillonieida Áillohaš Áillehašvádjájohka Áillehašnjunni Áili Áilegasčohkka Áilegasvárri Áilegasvuopmi Áilegasmuotki Áilegasjávri Áilegasjohka Áilegas Áile Áila Ábo vástideddjiin vástidanproseantage váldouksa váldorádji vuosttas vuolleleksikonat vuhtii visot sátnejuohkin sátnejuohkimis sákkastanfierbmi sáhkkái surggiin suomasápmelaš stuorraoslolaš stuorraguovdageainnut stuoraoslolaš stuoraguovdageainnut stohpospiidni silbageaidnu seamo- ráđi rátnobiellu ráhtes- románat ožžuin ođđasitčállinnjuolggadusaid ođđahistorismma ođđa ovdal olbmoš olbmo oktovuođadovdu okta oapme- oahpponeavvuid nuhtiin nubbi niibespiidni niibbiplánen niibbidilli nieiddaš máŋgasat máŋga mánáš máles- máilmmi muorralastagierraolmmoš muorralastagieraolmmoš muorralastagiera muorraguorra muoraidguorra munnos muitalanbargu muitalage muhtin mot monge molsašupmi molsašumi moktiige moktige moaddásat moadde miljovnna mašiidnakompiláhtora maiddái láikiriehpu láibi-sánis láibbiid luonddugáldu lonuhanbiila loahpahandáhtoniin lei leažžágo leat lea lastagieraolmmoš lastagiera lagusgoahti kompiláhtor jurdagiid juostá juosat juonin juoidá jna. itge interr. inge iige ii-goallostuvvon ihtanmuorra ihtaluddandávda iežáge iešguđetge iešguhtetge iešdieđiheapmi hárjánansáhka hupman-, gávdnat gárvodanmuorra guvlui guovttedássásaš guollebusse guokte guldalan-, golbma goahtesilbageaidnu goahtesilba goahte- gievrrasmuorra gievramuorra gieraolmmoš giellaguolli giddestansággi generatiivalaš geažidangárvu geavaheddjiide garrasepmosit fitnodatčatnosiin entusiástajoavkku ea. dáiguid dutnjege duottarguolli duodjeregisttarskovvi dohkkehit doares- disdat diktásiid diakritihkalaš dehálaš cugŋuiddávda co2:in co2:i bártnáš bárrastávvalsániin bálgges- báldáiddávda bálddes- bákte- buvttadeddjiide buoiddustanveahkki boazo- boasttogáldu boahtágo boahtáge biiggáš biedju bešš bellodat beavdeguorra beavddelokten beavddeahki beatnagaš beaive- bastilvuođadovdu bargoolmmái bargo-, balvareaŋga bajusdovdu bahkkejeaddjit arvvosmahttojupmi anistupmi almmuhanbargu alfaveršuvdna albmáidviessu albmabuktojupmi addojupmi VG:t VG:id VG:i Unicode-standárddas Unicode-doarjja Unicode Sátnejuohkin Sámirádioi Sámeálbmogiidsearvi Sámevuođahistorjá SÁMI SÁB Stuorraoslolaš Stuorraguovdageainnut Stuoraoslolaš Stuoraguovdageainnut Sisasáddejuvvon Romsa-prošeavttat Prográmmerárat PPC Otná Oslo-biila OpenOffice.orgii NSR Máŋgasat Máretgo Macintosh Länsmanis LO Kaplan Julius InDesign Helsset-vuođđuduvvon Harstad-biila Guovdageainnu-láđđi Gobby Geavaheaddjidokumentašuvdna Gažadanskovvi Finncomm Fink Disdat Bargo-, BETA Almmuhannjuolggadusat Airlines 2004:s 2003:s 1983:s 1883:s 169:s
Grouped by bug #
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| divtásiiid | diktásiid | 2 |
|
SUBbed entry |
| diktásiiid | diktásiid | 1 |
|
|
| divtásiid | diktásiid | 1 |
|
|
| jurdagiiid | jurdagiid | 1 |
|
|
| diktásiid | ||||
| jurdagiid |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| čiekciin | čikčiin | 2 |
|
|
| čikčiin |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| láibi-sánis | nomin. + hyphen comp | |||
| šaddat-sánis |
|
nomin. + hyphen comp |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| boahtáge | verb + clitic | |||
| iige | verb + clitic | |||
| inge | verb + clitic | |||
| itge | verb + clitic | |||
| muitalage | verb + clitic |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| Harstadbiila | Harstad-biila | 1 |
|
name compounding |
| Oslobiila | Oslo-biila | 1 |
|
name compounding |
| guollibusse | guollebusse | 1 |
|
wrong compounding |
| oahppuneavvuid | oahpponeavvuid | 1 | bad compounding | |
| suopmasápmelaš | suomasápmelaš | 1 | bad compounding | |
| Harstad-biila | name compounding | |||
| Oslo-biila | name compounding | |||
| bárrastávvalsániin | 3-syll cons comp, act. 3-part comp | |||
| duottarguolli | regular cons comp | |||
| giellaguolli | regular vowel comp | |||
| guollebusse | wrong compounding | |||
| oahpponeavvuid | bad compounding | |||
| suomasápmelaš | bad compounding | |||
| váldorádji | comp with vowel shortening | |||
| váldouksa | comp with vowel shortening |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| Máŋgasat | missing numerals | |||
| golbma | missing numerals | |||
| guokte | missing numerals | |||
| miljovnna | missing numerals | |||
| moadde | missing numerals | |||
| moaddásat | missing numerals | |||
| máŋga | missing numerals | |||
| máŋgasat | missing numerals | |||
| okta | missing numerals |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| Airlines | multiword names: Finncomm Airlines | |||
| Finncomm | multiword names: Finncomm Airlines |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| Bargo-, | word + hyph + comma | |||
| bargo-, | word + hyph + comma | |||
| guldalan-, | word + hyph + comma | |||
| hupman-, | word + hyph + comma |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| julev- |
|
NounRoot -> Rreal not in discont. comp |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| bahkkejeaddjit | missing derivation |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| garrasepmosit | adj -> adv |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| bargguitjuogadeapmi | bargguidjuogadeapmi | 1 |
|
wrong compound form on -t |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| Máretgo | nominals with clitic not accepted | |||
| boahtágo | nominals with clitic not accepted |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| muorraguollenáhkki |
|
3-part compounds |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| gievramuorra | adj-nom+noun | |||
| gievrrasmuorra | adj-attr+noun | |||
| láikiriehpu | adj+noun compound |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| 1983:s | infl. numerals not recognised | |||
| 2004:s | infl. numerals not recognised |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| Fink | missing names | |||
| Gobby | missing names | |||
| InDesign | missing names | |||
| Kaplan | missing names | |||
| Macintosh | missing names | |||
| OpenOffice.orgii | missing names | |||
| Unicode | missing names |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| X:ii | X:i | 1 |
|
other words from Divvun.no |
| BETA | other words from Divvun.no | |||
| PPC | other words from Divvun.no | |||
| Prográmmerárat | other words from Divvun.no | |||
| X:i |
|
other words from Divvun.no | ||
| alfaveršuvdna | other words from Divvun.no | |||
| diakritihkalaš | other words from Divvun.no | |||
| generatiivalaš | other words from Divvun.no | |||
| kompiláhtor | other words from Divvun.no |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| Helssetvuođđoduvvon | Helsset-vuođđuduvvon | 2 |
|
comp words from Divvun.no |
| guoktedássásaš | guovttedássásaš | 2 | comp words from Divvun.no | |
| Geavaheaddjedokumentašuvdna | Geavaheaddjidokumentašuvdna | 1 |
|
comp words from Divvun.no |
| Romsaprošeavttat | Romsa-prošeavttat | 1 |
|
comp words from Divvun.no |
| geavaheaddjiide | geavaheddjiide | 1 |
|
comp words from Divvun.no |
| loahpahandáhtomiin | loahpahandáhtoniin | 1 |
|
comp words from Divvun.no |
| Geavaheaddjidokumentašuvdna | comp words from Divvun.no | |||
| Helsset-vuođđuduvvon | comp words from Divvun.no | |||
| Romsa-prošeavttat | comp words from Divvun.no | |||
| Sátnejuohkin | comp words from Divvun.no | |||
| Unicode-doarjja | comp words from Divvun.no | |||
| Unicode-standárddas | comp words from Divvun.no | |||
| alfaveršuvdna | comp words from Divvun.no | |||
| entusiástajoavkku | comp words from Divvun.no | |||
| geavaheddjiide | comp words from Divvun.no | |||
| guovttedássásaš | comp words from Divvun.no | |||
| ii-goallostuvvon | comp words from Divvun.no | |||
| loahpahandáhtoniin | comp words from Divvun.no | |||
| mašiidnakompiláhtora | comp words from Divvun.no | |||
| ođđasitčállinnjuolggadusaid | comp words from Divvun.no | |||
| sátnejuohkimis | comp words from Divvun.no | |||
| sátnejuohkin | comp words from Divvun.no | |||
| vuolleleksikonat | comp words from Divvun.no | |||
| čállindárkkistanprográmmaid | comp words from Divvun.no | |||
| čállindárkkistanprográmmat | comp words from Divvun.no | |||
| čállindárkkistanprošeavtta | comp words from Divvun.no |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| oidni | oinnii | 2 |
|
missing suggestion |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| heajus- | heajos- | 1 | lowering in front of hyphen | |
| heajus-Oslo | heajos-Oslo | 1 |
|
lowering in front of hyphen |
| lojis- | lojes- | 1 | lowering in front of hyphen | |
| álkis- | álkes- | 1 | lowering in front of hyphen | |
| álkis-Oslo | álkes-Oslo | 1 |
|
lowering in front of hyphen |
| heajos-Oslo |
|
lowering in front of hyphen | ||
| lojes-Oslo |
|
adj attr+proper-noun, optional lowering | ||
| lojis-Oslo |
|
adj attr+proper-noun, optional lowering | ||
| álkes-Oslo |
|
lowering in front of hyphen |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| Ábo | names beginning in Á | |||
| Áilu | names beginning in Á | |||
| Ámmon | names beginning in Á | |||
| Ánde | names beginning in Á | |||
| Ánne | names beginning in Á | |||
| Ánná | names beginning in Á | |||
| Ávdnos | names beginning in Á |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| SáMI | SÁMI | 1 |
|
wrong sugg if mostly upper case |
| SÁMI | wrong sugg if mostly upper case |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| suopmasápmelaš | suomasápmelaš | 1 | suopmasápmelaš comps wrongly accepted | |
| suomasápmelaš | suopmasápmelaš comps wrongly accepted |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| Gažadanskovvi | verb+noun, verb+verb | |||
| Sisasáddejuvvon | verb+noun, verb+verb | |||
| vástidanproseantage | verb+noun, verb+verb |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| buvttadeaddjiide | buvttadeddjiide | 1 |
|
several lexical bugs |
| oažžuin | ožžuin | 1 |
|
several lexical bugs |
| vástideaddjiin | vástideddjiin | 1 |
|
several lexical bugs |
| Länsmanis | several lexical bugs | |||
| agimieljuohkáseapmi |
|
several lexical bugs | ||
| boasttogáldu | several lexical bugs | |||
| buvttadeddjiide | several lexical bugs | |||
| duodjeregisttarskovvi | several lexical bugs | |||
| iešdieđiheapmi | several lexical bugs | |||
| ožžuin | several lexical bugs | |||
| vástideddjiin | several lexical bugs |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| biila-- | biila- | 1 | double hyphens after some words - can really only be tested manually, the command line speller will ignore superfluous hyphens | |
| nuvviid-- | nuvviid- | 1 | double hyphens after some words - can really only be tested manually, the command line speller will ignore superfluous hyphens |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| Distat | Disdat | 1 |
|
missing suggestion |
| Eanadalit | Eandalit | 1 |
|
missing suggestion |
| Julis | Julius | 1 |
|
missing suggestion |
| Odná | Otná | 1 |
|
missing suggestion |
| bidju | biedju | 1 |
|
missing suggestion |
| distat | disdat | 1 |
|
missing suggestion |
| dohkkenit | dohkkehit | 1 |
|
missing suggestion |
| dáguid | dáiguid | 1 |
|
missing suggestion |
| dáhálaš | dehálaš | 1 |
|
missing suggestion |
| guovlui | guvlui | 1 |
|
missing suggestion |
| gádnat | gávdnat | 1 |
|
missing suggestion |
| la | lea | 1 |
|
missing suggestion |
| leal | leat | 1 |
|
missing suggestion |
| leažžago | leažžágo | 1 |
|
missing suggestion |
| li | lei | 1 |
|
missing suggestion |
| láibbiit | láibbiid | 1 |
|
missing suggestion |
| maddái | maiddái | 1 |
|
missing suggestion |
| moht | mot | 1 |
|
missing suggestion |
| molssašumi | molsašumi | 1 |
|
missing suggestion |
| molssašupmi | molsašupmi | 1 |
|
missing suggestion |
| muhtim | muhtin | 1 |
|
missing suggestion |
| mutnos | munnos | 1 |
|
missing suggestion |
| málmmi | máilmmi | 1 |
|
missing suggestion |
| nuohtiin | nuhtiin | 1 |
|
missing suggestion |
| ovda | ovdal | 1 |
|
missing suggestion |
| ođđ | ođđa | 1 |
|
missing suggestion |
| romáhat | románat | 1 |
|
missing suggestion |
| ráði | ráđi | 1 |
|
missing suggestion |
| suorggiin | surggiin | 1 |
|
missing suggestion |
| sáhkki | sáhkkái | 1 |
|
missing suggestion |
| viesot | visot | 1 |
|
missing suggestion |
| vuhtui | vuhtii | 1 |
|
missing suggestion |
| Disdat | missing suggestion | |||
| Eandalit |
|
missing suggestion | ||
| Julius | missing suggestion | |||
| Otná | missing suggestion | |||
| biedju | missing suggestion | |||
| dehálaš | missing suggestion | |||
| disdat | missing suggestion | |||
| dohkkehit | missing suggestion | |||
| dáiguid | missing suggestion | |||
| guvlui | missing suggestion | |||
| gávdnat | missing suggestion | |||
| lea | missing suggestion | |||
| leat | missing suggestion | |||
| leažžágo | missing suggestion | |||
| lei | missing suggestion | |||
| láibbiid | missing suggestion | |||
| maiddái | missing suggestion | |||
| moktige | missing suggestion | |||
| moktiige | missing suggestion | |||
| molsašumi | missing suggestion | |||
| molsašupmi | missing suggestion | |||
| mot | missing suggestion | |||
| muhtin | missing suggestion | |||
| munnos | missing suggestion | |||
| máilmmi | missing suggestion | |||
| nuhtiin | missing suggestion | |||
| olbmo | missing suggestion | |||
| olbmoš | missing suggestion | |||
| ovdal | missing suggestion | |||
| ođđa | missing suggestion | |||
| románat | missing suggestion | |||
| ráđi | missing suggestion | |||
| surggiin | missing suggestion | |||
| sáhkkái | missing suggestion | |||
| visot | missing suggestion | |||
| vuhtii | missing suggestion |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| Áila | missing names | |||
| Áile | missing names | |||
| Áilegas | missing names | |||
| Áilegasjohka | missing names | |||
| Áilegasjávri | missing names | |||
| Áilegasmuotki | missing names | |||
| Áilegasvuopmi | missing names | |||
| Áilegasvárri | missing names | |||
| Áilegasčohkka | missing names | |||
| Áili | missing names | |||
| Áillehašnjunni | missing names | |||
| Áillehašvádjájohka | missing names | |||
| Áillohaš | missing names | |||
| Áillonieida | missing names | |||
| Áilloš | missing names | |||
| Áillun | missing names |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| juoidá | missing pronouns | |||
| juonin | missing pronouns | |||
| juosat | missing pronouns | |||
| juostá | missing pronouns |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| niibbispiidni | niibespiidni | 2 |
|
does not follow compound tags |
| ránubiellu | rátnobiellu | 2 |
|
does not follow compound tags |
| balvvareaŋga | balvareaŋga | 1 |
|
does not follow compound tags |
| beaivi- | beaive- | 1 | does not follow vowel shortening | |
| boazu- | boazo- | 1 | does not follow vowel shortening | |
| bákti- | bákte- | 1 | does not follow vowel shortening | |
| bálddis- | bálddes- | 1 | does not follow vowel shortening | |
| bálggis- | bálgges- | 1 | does not follow vowel shortening | |
| deažžalakkis- | deažžalakkes- | 1 |
|
does not follow vowel shortening |
| doaris- | doares- | 1 | does not follow vowel shortening | |
| goahti- | goahte- | 1 | does not follow vowel shortening | |
| mális- | máles- | 1 | does not follow vowel shortening | |
| niibispiidni | niibespiidni | 1 | lack of stem vowel shortening | |
| oapmi- | oapme- | 1 | does not follow vowel shortening | |
| ráhtis- | ráhtes- | 1 | does not follow vowel shortening | |
| rátnubiellu | rátnobiellu | 1 | lack of stem vowel shortening | |
| seamu- | seamo- | 1 | does not follow vowel shortening | |
| stohpuspiidni | stohpospiidni | 1 |
|
lack of stem vowel shortening |
| suoloboazu | suoluboazu | 1 | does not follow vowel shortening | |
| čuru- | čuro- | 1 | does not follow vowel shortening | |
| loddi- | loddi- | 0 | does not follow vowel shortening | |
| balvareaŋga | does not follow compound tags | |||
| beaive- | does not follow vowel shortening | |||
| boazo- | does not follow vowel shortening | |||
| bákte- | does not follow vowel shortening | |||
| bálddes- | does not follow vowel shortening | |||
| bálgges- | does not follow vowel shortening | |||
| deažžalakkes- |
|
does not follow vowel shortening | ||
| doares- | does not follow vowel shortening | |||
| goahte- | does not follow vowel shortening | |||
| loddi- | does not follow vowel shortening | |||
| máles- | does not follow vowel shortening | |||
| niibespiidni | does not follow compound tags | |||
| oapme- | does not follow vowel shortening | |||
| ráhtes- | does not follow vowel shortening | |||
| rátnobiellu | does not follow compound tags | |||
| seamo- | does not follow vowel shortening | |||
| stohpospiidni | does not follow compound tags | |||
| suoluboazu |
|
does not follow vowel shortening | ||
| čuro- | does not follow vowel shortening |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| bellodat | regressions |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| co2:ajn | co2:in | 2 |
|
accepts smj entries |
| 169:is | 169:s | 1 |
|
169:is does not get marked |
| 1883:as | 1883:s | 1 |
|
accepts SUB entries |
| 1883as | 1883:s | 1 |
|
accepts SUB entries |
| 1883s | 1883:s | 1 |
|
accepts SUB entries, 2007-06-26 speller test report |
| 2003:as | 2003:s | 1 |
|
2003:as does not get marked |
| 2003as | 2003:s | 1 |
|
accepts SUB entries |
| VG:aid | VG:id | 1 |
|
accepts SUB entries |
| VG:at | VG:t | 1 |
|
accepts SUB entries |
| VG:ii | VG:i | 1 |
|
accepts SUB entries |
| co2:j | co2:i | 1 |
|
accepts smj entries |
| milljovnna | miljovnna | 1 |
|
accepts SUB entries |
| 169:s | 169:is does not get marked | |||
| 1883:s | accepts SUB entries, 2007-06-26 speller test report | |||
| 2003:s | 2003:as does not get marked | |||
| VG:i | accepts SUB entries | |||
| VG:id | accepts SUB entries | |||
| VG:t | accepts SUB entries | |||
| co2:i | accepts smj entries | |||
| co2:in | accepts smj entries | |||
| miljovnna | accepts SUB entries |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| Stuor-Guovdageainnut | Stuoraguovdageainnut | 2 |
|
nominal + Prop/Der compound |
| Stuor-Oslolaš | Stuoraoslolaš | 2 |
|
nominal + Prop/Der compound |
| StuorGuovdageainnut | Stuoraguovdageainnut | 2 |
|
nominal + Prop/Der compound |
| StuorOslolaš | Stuoraoslolaš | 2 |
|
nominal + Prop/Der compound |
| Stuora-Guovdageainnut | Stuoraguovdageainnut | 2 |
|
nominal + Prop/Der compound |
| Stuora-Oslolaš | Stuoraoslolaš | 2 |
|
nominal + Prop/Der compound |
| Stuorra-Guovdageainnut | Stuorraguovdageainnut | 2 |
|
nominal + Prop/Der compound |
| Stuorra-Oslolaš | Stuorraoslolaš | 2 |
|
nominal + Prop/Der compound |
| stuor-Guovdageainnut | stuoraguovdageainnut | 2 |
|
nominal + Prop/Der compound |
| stuor-Oslolaš | stuoraoslolaš | 2 |
|
nominal + Prop/Der compound |
| stuorGuovdageainnut | stuoraguovdageainnut | 2 |
|
nominal + Prop/Der compound |
| stuorOslolaš | stuoraoslolaš | 2 |
|
nominal + Prop/Der compound |
| stuora-Guovdageainnut | stuoraguovdageainnut | 2 |
|
nominal + Prop/Der compound |
| stuora-Oslolaš | stuoraoslolaš | 2 |
|
nominal + Prop/Der compound |
| stuorra-Guovdageainnut | stuorraguovdageainnut | 2 |
|
nominal + Prop/Der compound |
| stuorra-Oslolaš | stuorraoslolaš | 2 |
|
nominal + Prop/Der compound |
| Stuor-guovdageainnut | Stuoraguovdageainnut | 1 |
|
nominal + Prop/Der compound |
| Stuor-oslolaš | Stuoraoslolaš | 1 |
|
nominal + Prop/Der compound |
| Stuora-guovdageainnut | Stuoraguovdageainnut | 1 |
|
nominal + Prop/Der compound |
| Stuora-oslolaš | Stuoraoslolaš | 1 |
|
nominal + Prop/Der compound |
| StuoraGuovdageainnut | Stuoraguovdageainnut | 1 |
|
nominal + Prop/Der compound |
| StuoraOslolaš | Stuoraoslolaš | 1 |
|
nominal + Prop/Der compound |
| Stuorra-guovdageainnut | Stuorraguovdageainnut | 1 |
|
nominal + Prop/Der compound |
| Stuorra-oslolaš | Stuorraoslolaš | 1 |
|
nominal + Prop/Der compound |
| StuorraGuovdageainnut | Stuorraguovdageainnut | 1 |
|
nominal + Prop/Der compound |
| StuorraOslolaš | Stuorraoslolaš | 1 |
|
nominal + Prop/Der compound |
| stuor-guovdageainnut | stuoraguovdageainnut | 1 |
|
nominal + Prop/Der compound |
| stuor-oslolaš | stuoraoslolaš | 1 |
|
nominal + Prop/Der compound |
| stuora-guovdageainnut | stuoraguovdageainnut | 1 |
|
nominal + Prop/Der compound |
| stuora-oslolaš | stuoraoslolaš | 1 |
|
nominal + Prop/Der compound |
| stuoraGuovdageainnut | stuoraguovdageainnut | 1 |
|
nominal + Prop/Der compound |
| stuoraOslolaš | stuoraoslolaš | 1 |
|
nominal + Prop/Der compound |
| stuorra-guovdageainnut | stuorraguovdageainnut | 1 |
|
nominal + Prop/Der compound |
| stuorra-oslolaš | stuorraoslolaš | 1 |
|
nominal + Prop/Der compound |
| stuorraGuovdageainnut | stuorraguovdageainnut | 1 |
|
nominal + Prop/Der compound |
| stuorraOslolaš | stuorraoslolaš | 1 |
|
nominal + Prop/Der compound |
| Stuoraguovdageainnut | nominal + Prop/Der compound | |||
| Stuoraoslolaš | nominal + Prop/Der compound | |||
| Stuorraguovdageainnut | nominal + Prop/Der compound | |||
| Stuorraoslolaš | nominal + Prop/Der compound | |||
| stuoraguovdageainnut | nominal + Prop/Der compound | |||
| stuoraoslolaš | nominal + Prop/Der compound | |||
| stuorraguovdageainnut | nominal + Prop/Der compound | |||
| stuorraoslolaš | nominal + Prop/Der compound |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| bešš | r9 and š9 not defined | |||
| interr. | r9 and š9 not defined |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| gieraolmmoš | strange compounding behaviour - 2-p, ACCEPTED | |||
| goahtesilba | strange compounding behaviour - 2-p, ACCEPTED | |||
| goahtesilbageaidnu | strange compounding behaviour - 3-p, ACCEPTED | |||
| lastagiera | strange compounding behaviour - 2-p, ACCEPTED | |||
| lastagieraolmmoš | strange compounding behaviour - 3-p, REJECTED | |||
| muorralastagiera | strange compounding behaviour - 3-p, ACCEPTED | |||
| muorralastagieraolmmoš | strange compounding behaviour - 4-p, REJECTED | |||
| muorralastagierraolmmoš | strange compounding behaviour - 4-p, ACCEPTED | |||
| silbageaidnu | strange compounding behaviour - 2-p, ACCEPTED | |||
| veahkkegoahtesilba |
|
strange compounding behaviour - 3-p, REJECTED | ||
| veahkkegoahtesilbageaidnu |
|
strange compounding behaviour - 4-p, REJECTED |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| geađgeviessoláse |
|
long compounds not accepted | ||
| hilbesrávdojávri |
|
long compounds not accepted |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| láibi-sánis | nomin. + hyphen comp should never be suggested | |||
| šaddat-sánis |
|
nomin. + hyphen comp should never be suggested |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| almmustusaš |
|
diminutive of JOHTALAT words | ||
| beatnagaš | diminutive of JOHTALAT words | |||
| biiggáš | diminutive of JOHTALAT words | |||
| bártnáš | diminutive of JOHTALAT words | |||
| erohusaš |
|
diminutive of JOHTALAT words | ||
| gonagasaš |
|
diminutive of JOHTALAT words | ||
| mánáš | diminutive of JOHTALAT words | |||
| nieiddaš | diminutive of JOHTALAT words | |||
| olbmoš | diminutive of JOHTALAT words |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| beavddeahki | Left-compound-tags in sme-lex do not work | |||
| beavddelokten | Left-compound-tags in sme-lex do not work | |||
| niibbidahkan |
|
Left-compound-tags in sme-lex do not work | ||
| niibbidilli | Left-compound-tags in sme-lex do not work | |||
| niibbiplánen | Left-compound-tags in sme-lex do not work |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| buoiddusteapmiveahkki | buoiddustanveahkki | 4 | speller accepts "impossible" compound-forms | |
| geažideapmigárvu | geažidangárvu | 4 | speller accepts "impossible" compound-forms | |
| giddesteapmisággi | giddestansággi | 4 | speller accepts "impossible" compound-forms | |
| gárvodeapmimuorra | gárvodanmuorra | 4 | speller accepts "impossible" compound-forms | |
| hárjáneapmisáhka | hárjánansáhka | 4 | speller accepts "impossible" compound-forms | |
| lonuheapmibiila | lonuhanbiila | 4 | speller accepts "impossible" compound-forms | |
| suotnjaleapmiveahkki | suotnjalanveahkki | 4 | speller accepts "impossible" compound-forms | |
| sákkasteapmifierbmi | sákkastanfierbmi | 4 | speller accepts "impossible" compound-forms | |
| bastilvuohtadovdu | bastilvuođadovdu | 2 | speller accepts "impossible" compound-forms | |
| oktovuohtadovdu | oktovuođadovdu | 2 | speller accepts "impossible" compound-forms | |
| luondugáldu | luonddugáldu | 1 | speller accepts "impossible" compound-forms | |
| bastilvuođadovdu | speller accepts "impossible" compound-forms | |||
| buoiddustanveahkki | speller accepts "impossible" compound-forms | |||
| geažidangárvu | speller accepts "impossible" compound-forms | |||
| giddestansággi | speller accepts "impossible" compound-forms | |||
| gárvodanmuorra | speller accepts "impossible" compound-forms | |||
| hárjánansáhka | speller accepts "impossible" compound-forms | |||
| lonuhanbiila | speller accepts "impossible" compound-forms | |||
| luonddugáldu | speller accepts "impossible" compound-forms | |||
| oktovuođadovdu | speller accepts "impossible" compound-forms | |||
| suotjalanveahkki |
|
speller accepts "impossible" compound-forms | ||
| sákkastanfierbmi | speller accepts "impossible" compound-forms |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| beavddeguorra | beavdeguorra | 1 |
|
speller does not follow compound-tagging |
| beavdeguorra | speller does not follow compound-tagging | |||
| muoraidguorra | speller does not follow compound-tagging | |||
| muorraguorra | speller does not follow compound-tagging |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| addojupmimuorra | addojupmi | 6 |
|
more "impossible" compounds accepted |
| albmabuktojupmimuorra | albmabuktojupmi | 6 |
|
more "impossible" compounds accepted |
| anistupmimuorra | anistupmi | 6 |
|
more "impossible" compounds accepted |
| arvvosmahttojupmimuorra | arvvosmahttojupmi | 6 |
|
more "impossible" compounds accepted |
| addojupmi | more "impossible" compounds accepted | |||
| albmabuktojupmi | more "impossible" compounds accepted | |||
| anistupmi | more "impossible" compounds accepted | |||
| arvvosmahttojupmi | more "impossible" compounds accepted |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| bajusdovdu | does not recognize compounds | |||
| lagusgoahti | does not recognize compounds |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| dutnjege | does not accepts pronouns + clitic | |||
| iešguhtetge | does not accepts pronouns + clitic | |||
| iešguđetge | does not accepts pronouns + clitic | |||
| iežáge | does not accepts pronouns + clitic | |||
| monge | does not accepts pronouns + clitic | |||
| oktahatge |
|
does not accepts pronouns + clitic |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| ihtaluddandávda | plural actios do not form compounds | |||
| ihtanmuorra | plural actios do not form compounds |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| albmáidviessu | does not follow cmp-tagging, contracted nouns | |||
| báldáiddávda | does not follow cmp-tagging, contracted nouns | |||
| cugŋuiddávda | does not follow cmp-tagging, contracted nouns |
Testpairs not in bugs
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| EO:a | EO | 2 |
|
|
| ERUF:s |
|
|||
| INTERREG:ii | INTERREG:i | 1 |
|
|
| nuppogeahčái | nuppi geahčái | 2 |
|
|
| ođđahistorismma | ||||
| nuppogeahčái | nuppe geahčái | 2 |
|
"nuppe" is marked |
| sihkarit | sihkkarit | 1 |
|
"sihkarit" is not marked |
| ERUF |
|
2007-06-26 speller test report | ||
| fitnodatčatnosiin | 2007-06-26 speller test report | |||
| Sámeálbmogiidsearvi | Compounds marked PlGen | |||
| Sámevuođahistorjá | Compounds marked SgGen | |||
| Guovdageainnu-láđđi | Compounds with hyphens | |||
| Almmuhannjuolggadusat | actio compound | |||
| almmuhanbargu | actio compound | |||
| muitalanbargu | actio compound | |||
| Sámedikkeáirrasin | sámediggeáirrasin | 3 |
|
does not get marked |
| eaktudáhtolaččat | eaktodáhtolaččat | 1 | does not get marked | |
| nazisttain | nasisttain | 1 |
|
does not get marked |
| eahpenášunálalaččat | eahpenášuvnnalaččat | 3 |
|
eahpenášunálalaččat does not get marked |
| jna. | missing abbreviations | |||
| ea. | missing abbreviations (they are in the LexC source files) | |||
| BPA |
|
missing acros - they're in the lexicon | ||
| LO | missing acros - they're in the lexicon | |||
| NSR | missing acros - they're in the lexicon | |||
| SÁB | missing acros - they're in the lexicon | |||
| nubbi | missing ordinal | |||
| vuosttas | missing ordinal | |||
| krimináláššiid | kriminálaáššiid | 1 |
|
missing suggestion |
| studentaavissas | studeantaaviissas | 2 |
|
multiple errors in compound - no sugg |
| Lulli-Eurohpas | Lulli-Eurohpás | 1 |
|
no spelling suggestion |
| oarje-eurohpai | Oarje-Eurohpái | 3 |
|
no spelling suggestion |
| Máttá-Trøndelágas | Mátta-Trøndelagas | 2 |
|
no suggestion |
| bargoolmmái | nom compounding with vowel shortening | |||
| skuvlajagin | skuvlajagiin | 1 |
|
single error in compound - no sugg |
| Duiskas | Duiskkas | 1 | still wrong | |
| birrasit | birrasiid | 2 |
|
still wrong |
| goldasii | goldásii | 1 |
|
still wrong |
| guovttemiesát | guovttemiesat | 1 |
|
still wrong |
| guššalemiin | giššalemiin | 1 |
|
still wrong |
| ilgadeamis | ilgadeamos | 1 |
|
still wrong |
| jápmina | jápmima | 1 |
|
still wrong |
| skierranis | skieranis | 1 |
|
still wrong |
| sámedikkepresideanta | sámediggepresideanta | 2 |
|
still wrong |
| vakumas | vakuumas | 1 |
|
still wrong |
| vuoittohallamin | vuoittáhallamin | 1 |
|
still wrong |
| čorgedalla | čorgedallá | 1 |
|
still wrong |
| Sámirádioi | sápmi+N+SgGenCmp |