Speller Test Results for: «tmp/sme-regression.txt»
Overview
Technical data
Language tested: sme
Document tested: tmp/sme-regression.txt
Speller tool: Polderland command line tool
Speller lexicon version: Davvisámi, public beta 2, 2007-09-19
Test Date: 20070920
Test Type: regression
Result summary
Number of input words: 261
| Speller view | |||
|---|---|---|---|
| Speller Positive (number of flagged words): 146 |
Speller Negative (number of accepted words): 115 |
||
| Reality | Number of real errors: 124 |
Number of true positives (detected real errors): 101 |
Number of false negatives (unflagged spelling errors): 23 |
| Number of real correct words: 137 |
Number of false positives (incorrectly flagged words): 45 |
Number of true negatives (unflagged correct words): 92 |
|
- Precision (tp/(tp+fp)):
- 69.18 %
- Recall (tp/(tp+fn)):
- 81.45 %
- Accuracy ((tp+tn)/words):
- 73.95 %
Suggestion statistics
- Number of spelling errors with correct suggestion: 28
- Number of spelling errors with correct suggestion in top 5: 27
- Number of spelling errors with only wrong suggestions: 69
- Number of spelling errors with no suggestions at all: 2
Spelling error statistics
- Number of spelling errors : 124
- Number of simple spelling errors : 100
- Number of spelling errors with editing distance 2: 21
- Number of spelling errors with editing distance 3 or more: 3
True positives (101)
Spelling errors with correct suggestions (28)
| Input word |
Expected correction |
Editing distance |
Suggestions | Bug ID | Comment |
|---|---|---|---|---|---|
| guššalemiin | giššalemiin | 1 |
|
still wrong | |
| 169:is | 169:s | 1 |
|
169:is does not get marked | |
| 1883s | 1883:s | 1 |
|
2007-06-26 speller test report | |
| 1883as | 1883:s | 1 |
|
||
| 1883:as | 1883:s | 1 |
|
||
| skuvlajagin | skuvlajagiin | 1 |
|
single error in compound - no sugg | |
| 2003as | 2003:s | 1 |
|
still wrong | |
| sihkarit | sihkkarit | 1 |
|
"sihkarit" is not marked | |
| vuoittohallamin | vuoittáhallamin | 1 |
|
still wrong | |
| jápmina | jápmima | 1 |
|
still wrong | |
| milljovnna | miljovnna | 1 |
|
still wrong | |
| 2003:as | 2003:s | 1 |
|
2003:as does not get marked | |
| eaktudáhtolaččat | eaktodáhtolaččat | 1 |
|
does not get marked | |
| jurdagiiid | jurdagiid | 1 |
|
367 | |
| Harstadbiila | Harstad-biila | 1 |
|
397 | name compounding |
| oahppuneavvuid | oahpponeavvuid | 1 |
|
397 | bad compounding |
| bargguitjuogadeapmi | bargguidjuogadeapmi | 1 |
|
414 | wrong compound form on -t |
| Helssetvuođđoduvvon | Helsset-vuođđuduvvon | 2 |
|
426 | comp words from Divvun.no |
| guoktedássásaš | guovttedássásaš | 2 |
|
426 | comp words from Divvun.no |
| Romsaprošeavttat | Romsa-prošeavttat | 1 |
|
426 | comp words from Divvun.no |
| geavaheaddjiide | geavaheddjiide | 1 |
|
426 | comp words from Divvun.no |
| SáMI | SÁMI | 1 |
|
448 | wrong sugg if mostly upper case |
| Distat | Disdat | 1 |
|
461 | missing suggestion |
| distat | disdat | 1 |
|
461 | missing suggestion |
| co2:ajn | co2:in | 2 |
|
508 | accepts smj entries |
| VG:at | VG:t | 1 |
|
508 | accepts SUB entries |
| VG:ii | VG:i | 1 |
|
508 | accepts SUB entries |
| co2:j | co2:i | 1 |
|
508 | accepts smj entries |
Spelling errors with only incorrect suggestions (71)
| Input word |
Expected correction |
Editing distance |
Suggestions | Bug ID | Comment |
|---|---|---|---|---|---|
| eahpenášunálalaččat | eahpenášuvnnalaččat | 3 |
|
eahpenášunálalaččat does not get marked | |
| oarje-eurohpai | Oarje-Eurohpái | 3 |
|
no spelling suggestion | |
| EO:a | EO | 2 |
|
||
| birrasit | birrasiid | 2 |
|
still wrong | |
| nuppogeahčái | nuppe geahčái | 2 |
|
"nuppe" is marked | |
| nuppogeahčái | nuppi geahčái | 2 |
|
||
| studentaavissas | studeantaaviissas | 2 |
|
multiple errors in compound - no sugg | |
| INTERREG:ii | INTERREG:i | 1 |
|
||
| goldasii | goldásii | 1 |
|
still wrong | |
| guovttemiesát | guovttemiesat | 1 |
|
still wrong | |
| ilgadeamis | ilgadeamos | 1 |
|
still wrong | |
| krimináláššiid | kriminálaáššiid | 1 |
|
missing suggestion | |
| skierranis | skieranis | 1 |
|
still wrong | |
| vakumas | vakuumas | 1 |
|
still wrong | |
| čorgedalla | čorgedallá | 1 |
|
still wrong | |
| divtásiiid | divtásiid | 1 |
|
367 | |
| čiekciin | čiekcin | 1 |
|
376 | |
| Oslobiila | Oslo-biila | 1 |
|
397 | name compounding |
| guollibusse | guollebusse | 1 |
|
397 | wrong compounding |
| loahpahandáhtomiin | loahpahandáhtoniin | 1 |
|
426 | comp words from Divvun.no |
| oidni | oinnii | 2 |
|
429 | missing suggestion |
| heajus-Oslo | heajos-Oslo | 1 |
|
431 | lowering in front of hyphen |
| lojis-Oslo | lojes-Oslo | 1 |
|
431 | lowering in front of hyphen |
| álkis-Oslo | álkes-Oslo | 1 |
|
431 | lowering in front of hyphen |
| Dienna | Dienne | 1 |
|
461 | missing suggestion |
| Eanadalit | Eandalit | 1 |
|
461 | missing suggestion |
| Julis | Julius | 1 |
|
461 | missing suggestion |
| Odná | Odne | 1 |
|
461 | missing suggestion |
| bidju | biedju | 1 |
|
461 | missing suggestion |
| dohkkenit | dohkkehit | 1 |
|
461 | missing suggestion |
| dáguid | dáiguid | 1 |
|
461 | missing suggestion |
| dáhálaš | dehálaš | 1 |
|
461 | missing suggestion |
| guovlui | guvlui | 1 |
|
461 | missing suggestion |
| gádnat | gávdnat | 1 |
|
461 | missing suggestion |
| la | lea | 1 |
|
461 | missing suggestion |
| leal | leat | 1 |
|
461 | missing suggestion |
| leažžago | leažžágo | 1 |
|
461 | missing suggestion |
| li | lei | 1 |
|
461 | missing suggestion |
| láibbiit | láibbiid | 1 |
|
461 | missing suggestion |
| maddái | maiddái | 1 |
|
461 | missing suggestion |
| moht | mot | 1 |
|
461 | missing suggestion |
| molssašumi | molsašumi | 1 |
|
461 | missing suggestion |
| molssašupmi | molsašupmi | 1 |
|
461 | missing suggestion |
| muhtim | muhtin | 1 |
|
461 | missing suggestion |
| mutnos | munnos | 1 |
|
461 | missing suggestion |
| málmmi | máilmmi | 1 |
|
461 | missing suggestion |
| nuohtiin | nuhtiin | 1 |
|
461 | missing suggestion |
| ovda | ovdal | 1 |
|
461 | missing suggestion |
| ođđ | ođđa | 1 |
|
461 | missing suggestion |
| romáhat | románat | 1 |
|
461 | missing suggestion |
| ráði | ráđi | 1 |
|
461 | missing suggestion |
| suorggiin | surggiin | 1 |
|
461 | missing suggestion |
| sáhkki | sáhkkái | 1 |
|
461 | missing suggestion |
| viesot | visot | 1 |
|
461 | missing suggestion |
| vuhtui | vuhtii | 1 |
|
461 | missing suggestion |
| niibispiidni | niibespiidni | 1 |
|
489 | lack of stem vowel shortening |
| rátnubiellu | rátnobiellu | 1 |
|
489 | lack of stem vowel shortening |
| stohpuspiidni | stohpospiidni | 1 |
|
489 | lack of stem vowel shortening |
| VG:aid | VG:id | 1 |
|
508 | accepts SUB entries |
| StuorOslolaš | Stuoraoslolaš | 2 |
|
518 | nominal + Prop/Der compound |
| stuorOslolaš | stuoraoslolaš | 2 |
|
518 | nominal + Prop/Der compound |
| Stuor-oslolaš | Stuoraoslolaš | 1 |
|
518 | nominal + Prop/Der compound |
| Stuora-oslolaš | Stuoraoslolaš | 1 |
|
518 | nominal + Prop/Der compound |
| StuoraOslolaš | Stuoraoslolaš | 1 |
|
518 | nominal + Prop/Der compound |
| Stuorra-oslolaš | Stuorraoslolaš | 1 |
|
518 | nominal + Prop/Der compound |
| StuorraOslolaš | Stuorraoslolaš | 1 |
|
518 | nominal + Prop/Der compound |
| stuor-oslolaš | stuoraoslolaš | 1 |
|
518 | nominal + Prop/Der compound |
| stuora-oslolaš | stuoraoslolaš | 1 |
|
518 | nominal + Prop/Der compound |
| stuoraOslolaš | stuoraoslolaš | 1 |
|
518 | nominal + Prop/Der compound |
| stuorra-oslolaš | stuorraoslolaš | 1 |
|
518 | nominal + Prop/Der compound |
| stuorraOslolaš | stuorraoslolaš | 1 |
|
518 | nominal + Prop/Der compound |
Spelling errors without suggestions (2)
False positives (45)
False positives with suggestions (41)
| Input word |
Suggestions | Bug ID | Comment |
|---|---|---|---|
| BPA |
|
missing acros - they're in the lexicon | |
| ERUF |
|
2007-06-26 speller test report | |
| ERUF:s |
|
||
| Sámeálbmogiidsearvi |
|
Compounds marked PlGen | |
| láikiriehpu |
|
adj+noun compound | |
| ođđahistorismma |
|
||
| čaddat-sánis |
|
380 | nomin. + hyphen comp |
| julev- |
|
408 | NounRoot -> Rreal not in discont. comp |
| muorraguollenáhkki |
|
419 | 3-part compounds |
| gievrrasmuorra |
|
421 | adj-attr+noun |
| Unicode |
|
424 | missing names |
| PPC |
|
425 | other words from Divvun.no |
| X:ii |
|
425 | other words from Divvun.no |
| ii-goallostuvvon |
|
426 | comp words from Divvun.no |
| Áilu |
|
445 | names beginning in Á |
| Ámmon |
|
445 | names beginning in Á |
| Ánde |
|
445 | names beginning in Á |
| Ánne |
|
445 | names beginning in Á |
| Ánná |
|
445 | names beginning in Á |
| Ávdnos |
|
445 | names beginning in Á |
| agimieljuohkáseapmi |
|
452 | several lexical bugs |
| boasttogáldu |
|
452 | several lexical bugs |
| buvttadeaddjiide |
|
452 | several lexical bugs |
| oažžuin |
|
452 | several lexical bugs |
| vástideaddjiin |
|
452 | several lexical bugs |
| moktige |
|
461 | missing suggestion |
| Áila |
|
484 | missing names |
| Áile |
|
484 | missing names |
| Áilegas |
|
484 | missing names |
| Áilegasjohka |
|
484 | missing names |
| Áilegasjávri |
|
484 | missing names |
| Áilegasvuopmi |
|
484 | missing names |
| Áilegasvárri |
|
484 | missing names |
| Áilegasčohkka |
|
484 | missing names |
| Áili |
|
484 | missing names |
| Áillehašnjunni |
|
484 | missing names |
| Áillohaš |
|
484 | missing names |
| Áillonieida |
|
484 | missing names |
| Áilloš |
|
484 | missing names |
| Áillun |
|
484 | missing names |
| vuostamuttjan |
|
486 | missing pronouns |
False positives without suggestions (4)
Unicode-standárddas Unicode-doarjja Áillehašvádjájohka Áilegasmuotki
False negatives (23)
| Input word |
Expected correction |
Editing distance |
Bug ID | Comment |
|---|---|---|---|---|
| Sámedikkeáirrasin | sámediggeáirrasin | 3 | does not get marked | |
| sámedikkepresideanta | sámediggepresideanta | 2 | still wrong | |
| Duiskas | Duiskkas | 1 | still wrong | |
| nazisttain | nasisttain | 1 | does not get marked | |
| suopmasápmelaš | suomasápmelaš | 1 | 397 | bad compounding |
| viessudihtor | viessodihtor | 1 | 397 | wrong compounding |
| Geavaheaddjedokumentašuvdna | Geavaheaddjidokumentašuvdna | 1 | 426 | comp words from Divvun.no |
| heajus- | heajos- | 1 | 431 | lowering in front of hyphen |
| lojis- | lojes- | 1 | 431 | lowering in front of hyphen |
| álkis- | álkes- | 1 | 431 | lowering in front of hyphen |
| suopmasápmelaš | suomasápmelaš | 1 | 449 | suopmasápmelaš comps wrongly accepted |
| biila-- | biila- | 1 | 458 | double hyphens after some words - can really only be tested manually, the command line speller will ignore superfluous hyphens |
| nuvviid-- | nuvviid- | 1 | 458 | double hyphens after some words - can really only be tested manually, the command line speller will ignore superfluous hyphens |
| olbmo | olbmoš | 1 | 461 | missing suggestion |
| niibbispiidni | niibespiidni | 2 | 489 | does not follow compound tags |
| ránubiellu | rátnobiellu | 2 | 489 | does not follow compound tags |
| balvvareaŋga | balvareaŋga | 1 | 489 | does not follow compound tags |
| Stuor-Oslolaš | Stuoraoslolaš | 2 | 518 | nominal + Prop/Der compound |
| Stuora-Oslolaš | Stuoraoslolaš | 2 | 518 | nominal + Prop/Der compound |
| Stuorra-Oslolaš | Stuorraoslolaš | 2 | 518 | nominal + Prop/Der compound |
| stuor-Oslolaš | stuoraoslolaš | 2 | 518 | nominal + Prop/Der compound |
| stuora-Oslolaš | stuoraoslolaš | 2 | 518 | nominal + Prop/Der compound |
| stuorra-Oslolaš | stuorraoslolaš | 2 | 518 | nominal + Prop/Der compound |
True negatives (92)
The correctly accepted words are listed here for easy copy&paste into Word to quick check regressions in new versions of the speller: No correctly spelled words accepted by an earlier speller should be rejected by later spellers. To check, just copy and paste these words to Word, and see if you get any red underlines. Duplicate words are not repeated, unless one is followed by some punctuation. The words are reverse sorted according to Unicode character code.
čállindárkkistanprošeavtta čállindárkkistanprográmmat čállindárkkistanprográmmaid Ábo vástidanproseantage váldouksa váldorádji vuosttas vuolleleksikonat sátnejuohkin sátnejuohkimis stuorraoslolaš stuoraoslolaš stohpospiidni rátnobiellu ođđasitčállinnjuolggadusaid okta nubbi niibespiidni máŋgasat máŋga muitalanbargu muitalage moktiige moaddásat moadde miljovnna mašiidnakompiláhtora láibi-sánis kompiláhtor juostá juosat juonin juoidá jna. itge inge iige iešdieđiheapmi hupman-, guokte guldalan-, golbma gievramuorra giellaguolli generatiivalaš garrasepmosit fitnodatčatnosiin entusiástajoavkku ea. duottarguolli duodjeregisttarskovvi diakritihkalaš bárrastávvalsániin boahtágo boahtáge bellodat bargoolmmái bargo-, balvareaŋga bahkkejeaddjit almmuhanbargu alfaveršuvdna Sátnejuohkin Sámirádioi Sámevuođahistorjá SÁB Stuorraoslolaš Stuoraoslolaš Sisasáddejuvvon Prográmmerárat OpenOffice.orgii NSR Máŋgasat Máretgo Macintosh Länsmanis LO Kaplan InDesign Guovdageainnu-láđđi Gobby Gažadanskovvi Finncomm Fink Bargo-, BETA Almmuhannjuolggadusat Airlines 2004:s 1983:s
Grouped by bug #
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| divtásiiid | divtásiid | 1 |
|
|
| jurdagiiid | jurdagiid | 1 |
|
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| čiekciin | čiekcin | 1 |
|
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| láibi-sánis | nomin. + hyphen comp | |||
| čaddat-sánis |
|
nomin. + hyphen comp |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| boahtáge | verb + clitic | |||
| iige | verb + clitic | |||
| inge | verb + clitic | |||
| itge | verb + clitic | |||
| muitalage | verb + clitic |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| Harstadbiila | Harstad-biila | 1 |
|
name compounding |
| Oslobiila | Oslo-biila | 1 |
|
name compounding |
| guollibusse | guollebusse | 1 |
|
wrong compounding |
| oahppuneavvuid | oahpponeavvuid | 1 |
|
bad compounding |
| suopmasápmelaš | suomasápmelaš | 1 | bad compounding | |
| viessudihtor | viessodihtor | 1 | wrong compounding | |
| bárrastávvalsániin | 3-syll cons comp, act. 3-part comp | |||
| duottarguolli | regular cons comp | |||
| giellaguolli | regular vowel comp | |||
| váldorádji | comp with vowel shortening | |||
| váldouksa | comp with vowel shortening |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| Máŋgasat | missing numerals | |||
| golbma | missing numerals | |||
| guokte | missing numerals | |||
| miljovnna | missing numerals | |||
| moadde | missing numerals | |||
| moaddásat | missing numerals | |||
| máŋga | missing numerals | |||
| máŋgasat | missing numerals | |||
| okta | missing numerals |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| Airlines | multiword names: Finncomm Airlines | |||
| Finncomm | multiword names: Finncomm Airlines |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| Bargo-, | word + hyph + comma | |||
| bargo-, | word + hyph + comma | |||
| guldalan-, | word + hyph + comma | |||
| hupman-, | word + hyph + comma |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| julev- |
|
NounRoot -> Rreal not in discont. comp |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| bahkkejeaddjit | missing derivation |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| garrasepmosit | adj -> adv |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| bargguitjuogadeapmi | bargguidjuogadeapmi | 1 |
|
wrong compound form on -t |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| Máretgo | nominals with clitic not accepted | |||
| boahtágo | nominals with clitic not accepted |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| muorraguollenáhkki |
|
3-part compounds |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| gievramuorra | adj-nom+noun | |||
| gievrrasmuorra |
|
adj-attr+noun |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| 1983:s | infl. numerals not recognised | |||
| 2004:s | infl. numerals not recognised |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| Fink | missing names | |||
| Gobby | missing names | |||
| InDesign | missing names | |||
| Kaplan | missing names | |||
| Macintosh | missing names | |||
| OpenOffice.orgii | missing names | |||
| Unicode |
|
missing names |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| BETA | other words from Divvun.no | |||
| PPC |
|
other words from Divvun.no | ||
| Prográmmerárat | other words from Divvun.no | |||
| X:ii |
|
other words from Divvun.no | ||
| alfaveršuvdna | other words from Divvun.no | |||
| diakritihkalaš | other words from Divvun.no | |||
| generatiivalaš | other words from Divvun.no | |||
| kompiláhtor | other words from Divvun.no |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| Helssetvuođđoduvvon | Helsset-vuođđuduvvon | 2 |
|
comp words from Divvun.no |
| guoktedássásaš | guovttedássásaš | 2 |
|
comp words from Divvun.no |
| Geavaheaddjedokumentašuvdna | Geavaheaddjidokumentašuvdna | 1 | comp words from Divvun.no | |
| Romsaprošeavttat | Romsa-prošeavttat | 1 |
|
comp words from Divvun.no |
| geavaheaddjiide | geavaheddjiide | 1 |
|
comp words from Divvun.no |
| loahpahandáhtomiin | loahpahandáhtoniin | 1 |
|
comp words from Divvun.no |
| Sátnejuohkin | comp words from Divvun.no | |||
| Unicode-doarjja |
|
comp words from Divvun.no | ||
| Unicode-standárddas |
|
comp words from Divvun.no | ||
| alfaveršuvdna | comp words from Divvun.no | |||
| entusiástajoavkku | comp words from Divvun.no | |||
| ii-goallostuvvon |
|
comp words from Divvun.no | ||
| mašiidnakompiláhtora | comp words from Divvun.no | |||
| ođđasitčállinnjuolggadusaid | comp words from Divvun.no | |||
| sátnejuohkimis | comp words from Divvun.no | |||
| sátnejuohkin | comp words from Divvun.no | |||
| vuolleleksikonat | comp words from Divvun.no | |||
| čállindárkkistanprográmmaid | comp words from Divvun.no | |||
| čállindárkkistanprográmmat | comp words from Divvun.no | |||
| čállindárkkistanprošeavtta | comp words from Divvun.no |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| oidni | oinnii | 2 |
|
missing suggestion |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| heajus- | heajos- | 1 | lowering in front of hyphen | |
| heajus-Oslo | heajos-Oslo | 1 |
|
lowering in front of hyphen |
| lojis- | lojes- | 1 | lowering in front of hyphen | |
| lojis-Oslo | lojes-Oslo | 1 |
|
lowering in front of hyphen |
| álkis- | álkes- | 1 | lowering in front of hyphen | |
| álkis-Oslo | álkes-Oslo | 1 |
|
lowering in front of hyphen |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| Ábo | names beginning in Á | |||
| Áilu |
|
names beginning in Á | ||
| Ámmon |
|
names beginning in Á | ||
| Ánde |
|
names beginning in Á | ||
| Ánne |
|
names beginning in Á | ||
| Ánná |
|
names beginning in Á | ||
| Ávdnos |
|
names beginning in Á |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| SáMI | SÁMI | 1 |
|
wrong sugg if mostly upper case |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| suopmasápmelaš | suomasápmelaš | 1 | suopmasápmelaš comps wrongly accepted |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| Gažadanskovvi | verb+noun, verb+verb | |||
| Sisasáddejuvvon | verb+noun, verb+verb | |||
| vástidanproseantage | verb+noun, verb+verb |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| Länsmanis | several lexical bugs | |||
| agimieljuohkáseapmi |
|
several lexical bugs | ||
| boasttogáldu |
|
several lexical bugs | ||
| buvttadeaddjiide |
|
several lexical bugs | ||
| duodjeregisttarskovvi | several lexical bugs | |||
| iešdieđiheapmi | several lexical bugs | |||
| oažžuin |
|
several lexical bugs | ||
| vástideaddjiin |
|
several lexical bugs |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| biila-- | biila- | 1 | double hyphens after some words - can really only be tested manually, the command line speller will ignore superfluous hyphens | |
| nuvviid-- | nuvviid- | 1 | double hyphens after some words - can really only be tested manually, the command line speller will ignore superfluous hyphens |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| Dienna | Dienne | 1 |
|
missing suggestion |
| Distat | Disdat | 1 |
|
missing suggestion |
| Eanadalit | Eandalit | 1 |
|
missing suggestion |
| Julis | Julius | 1 |
|
missing suggestion |
| Odná | Odne | 1 |
|
missing suggestion |
| bidju | biedju | 1 |
|
missing suggestion |
| distat | disdat | 1 |
|
missing suggestion |
| dohkkenit | dohkkehit | 1 |
|
missing suggestion |
| dáguid | dáiguid | 1 |
|
missing suggestion |
| dáhálaš | dehálaš | 1 |
|
missing suggestion |
| guovlui | guvlui | 1 |
|
missing suggestion |
| gádnat | gávdnat | 1 |
|
missing suggestion |
| la | lea | 1 |
|
missing suggestion |
| leal | leat | 1 |
|
missing suggestion |
| leažžago | leažžágo | 1 |
|
missing suggestion |
| li | lei | 1 |
|
missing suggestion |
| láibbiit | láibbiid | 1 |
|
missing suggestion |
| maddái | maiddái | 1 |
|
missing suggestion |
| moht | mot | 1 |
|
missing suggestion |
| molssašumi | molsašumi | 1 |
|
missing suggestion |
| molssašupmi | molsašupmi | 1 |
|
missing suggestion |
| muhtim | muhtin | 1 |
|
missing suggestion |
| mutnos | munnos | 1 |
|
missing suggestion |
| málmmi | máilmmi | 1 |
|
missing suggestion |
| nuohtiin | nuhtiin | 1 |
|
missing suggestion |
| olbmo | olbmoš | 1 | missing suggestion | |
| ovda | ovdal | 1 |
|
missing suggestion |
| ođđ | ođđa | 1 |
|
missing suggestion |
| romáhat | románat | 1 |
|
missing suggestion |
| ráði | ráđi | 1 |
|
missing suggestion |
| suorggiin | surggiin | 1 |
|
missing suggestion |
| sáhkki | sáhkkái | 1 |
|
missing suggestion |
| viesot | visot | 1 |
|
missing suggestion |
| vuhtui | vuhtii | 1 |
|
missing suggestion |
| moktige |
|
missing suggestion | ||
| moktiige | missing suggestion |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| Áila |
|
missing names | ||
| Áile |
|
missing names | ||
| Áilegas |
|
missing names | ||
| Áilegasjohka |
|
missing names | ||
| Áilegasjávri |
|
missing names | ||
| Áilegasmuotki |
|
missing names | ||
| Áilegasvuopmi |
|
missing names | ||
| Áilegasvárri |
|
missing names | ||
| Áilegasčohkka |
|
missing names | ||
| Áili |
|
missing names | ||
| Áillehašnjunni |
|
missing names | ||
| Áillehašvádjájohka |
|
missing names | ||
| Áillohaš |
|
missing names | ||
| Áillonieida |
|
missing names | ||
| Áilloš |
|
missing names | ||
| Áillun |
|
missing names |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| juoidá | missing pronouns | |||
| juonin | missing pronouns | |||
| juosat | missing pronouns | |||
| juostá | missing pronouns | |||
| vuostamuttjan |
|
missing pronouns |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| niibbispiidni | niibespiidni | 2 | does not follow compound tags | |
| ránubiellu | rátnobiellu | 2 | does not follow compound tags | |
| balvvareaŋga | balvareaŋga | 1 | does not follow compound tags | |
| niibispiidni | niibespiidni | 1 |
|
lack of stem vowel shortening |
| rátnubiellu | rátnobiellu | 1 |
|
lack of stem vowel shortening |
| stohpuspiidni | stohpospiidni | 1 |
|
lack of stem vowel shortening |
| balvareaŋga | does not follow compound tags | |||
| niibespiidni | does not follow compound tags | |||
| rátnobiellu | does not follow compound tags | |||
| stohpospiidni | does not follow compound tags |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| bellodat | regressions |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| co2:ajn | co2:in | 2 |
|
accepts smj entries |
| VG:aid | VG:id | 1 |
|
accepts SUB entries |
| VG:at | VG:t | 1 |
|
accepts SUB entries |
| VG:ii | VG:i | 1 |
|
accepts SUB entries |
| co2:j | co2:i | 1 |
|
accepts smj entries |
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| Stuor-Oslolaš | Stuoraoslolaš | 2 | nominal + Prop/Der compound | |
| StuorOslolaš | Stuoraoslolaš | 2 |
|
nominal + Prop/Der compound |
| Stuora-Oslolaš | Stuoraoslolaš | 2 | nominal + Prop/Der compound | |
| Stuorra-Oslolaš | Stuorraoslolaš | 2 | nominal + Prop/Der compound | |
| stuor-Oslolaš | stuoraoslolaš | 2 | nominal + Prop/Der compound | |
| stuorOslolaš | stuoraoslolaš | 2 |
|
nominal + Prop/Der compound |
| stuora-Oslolaš | stuoraoslolaš | 2 | nominal + Prop/Der compound | |
| stuorra-Oslolaš | stuorraoslolaš | 2 | nominal + Prop/Der compound | |
| Stuor-oslolaš | Stuoraoslolaš | 1 |
|
nominal + Prop/Der compound |
| Stuora-oslolaš | Stuoraoslolaš | 1 |
|
nominal + Prop/Der compound |
| StuoraOslolaš | Stuoraoslolaš | 1 |
|
nominal + Prop/Der compound |
| Stuorra-oslolaš | Stuorraoslolaš | 1 |
|
nominal + Prop/Der compound |
| StuorraOslolaš | Stuorraoslolaš | 1 |
|
nominal + Prop/Der compound |
| stuor-oslolaš | stuoraoslolaš | 1 |
|
nominal + Prop/Der compound |
| stuora-oslolaš | stuoraoslolaš | 1 |
|
nominal + Prop/Der compound |
| stuoraOslolaš | stuoraoslolaš | 1 |
|
nominal + Prop/Der compound |
| stuorra-oslolaš | stuorraoslolaš | 1 |
|
nominal + Prop/Der compound |
| stuorraOslolaš | stuorraoslolaš | 1 |
|
nominal + Prop/Der compound |
| Stuoraoslolaš | nominal + Prop/Der compound | |||
| Stuorraoslolaš | nominal + Prop/Der compound | |||
| stuoraoslolaš | nominal + Prop/Der compound | |||
| stuorraoslolaš | nominal + Prop/Der compound |
Testpairs not in bugs
| Input word |
Expected correction |
Editing distance |
Suggestions | Comment |
|---|---|---|---|---|
| 1883:as | 1883:s | 1 |
|
|
| 1883as | 1883:s | 1 |
|
|
| EO:a | EO | 2 |
|
|
| ERUF:s |
|
|||
| INTERREG:ii | INTERREG:i | 1 |
|
|
| nuppogeahčái | nuppi geahčái | 2 |
|
|
| ođđahistorismma |
|
|||
| nuppogeahčái | nuppe geahčái | 2 |
|
"nuppe" is marked |
| sihkarit | sihkkarit | 1 |
|
"sihkarit" is not marked |
| 169:is | 169:s | 1 |
|
169:is does not get marked |
| 2003:as | 2003:s | 1 |
|
2003:as does not get marked |
| 1883s | 1883:s | 1 |
|
2007-06-26 speller test report |
| ERUF |
|
2007-06-26 speller test report | ||
| fitnodatčatnosiin | 2007-06-26 speller test report | |||
| Sámeálbmogiidsearvi |
|
Compounds marked PlGen | ||
| Sámevuođahistorjá | Compounds marked SgGen | |||
| Guovdageainnu-láđđi | Compounds with hyphens | |||
| Almmuhannjuolggadusat | actio compound | |||
| almmuhanbargu | actio compound | |||
| muitalanbargu | actio compound | |||
| láikiriehpu |
|
adj+noun compound | ||
| Sámedikkeáirrasin | sámediggeáirrasin | 3 | does not get marked | |
| eaktudáhtolaččat | eaktodáhtolaččat | 1 |
|
does not get marked |
| nazisttain | nasisttain | 1 | does not get marked | |
| eahpenášunálalaččat | eahpenášuvnnalaččat | 3 |
|
eahpenášunálalaččat does not get marked |
| jna. | missing abbreviations | |||
| ea. | missing abbreviations (they are in the LexC source files) | |||
| BPA |
|
missing acros - they're in the lexicon | ||
| LO | missing acros - they're in the lexicon | |||
| NSR | missing acros - they're in the lexicon | |||
| SÁB | missing acros - they're in the lexicon | |||
| nubbi | missing ordinal | |||
| vuosttas | missing ordinal | |||
| krimináláššiid | kriminálaáššiid | 1 |
|
missing suggestion |
| studentaavissas | studeantaaviissas | 2 |
|
multiple errors in compound - no sugg |
| Lulli-Eurohpas | Lulli-Eurohpás | 1 |
|
no spelling suggestion |
| oarje-eurohpai | Oarje-Eurohpái | 3 |
|
no spelling suggestion |
| Máttá-Trøndelágas | Mátta-Trøndelagas | 2 |
|
no suggestion |
| bargoolmmái | nom compounding with vowel shortening | |||
| skuvlajagin | skuvlajagiin | 1 |
|
single error in compound - no sugg |
| 2003as | 2003:s | 1 |
|
still wrong |
| Duiskas | Duiskkas | 1 | still wrong | |
| birrasit | birrasiid | 2 |
|
still wrong |
| goldasii | goldásii | 1 |
|
still wrong |
| guovttemiesát | guovttemiesat | 1 |
|
still wrong |
| guššalemiin | giššalemiin | 1 |
|
still wrong |
| ilgadeamis | ilgadeamos | 1 |
|
still wrong |
| jápmina | jápmima | 1 |
|
still wrong |
| milljovnna | miljovnna | 1 |
|
still wrong |
| skierranis | skieranis | 1 |
|
still wrong |
| sámedikkepresideanta | sámediggepresideanta | 2 | still wrong | |
| vakumas | vakuumas | 1 |
|
still wrong |
| vuoittohallamin | vuoittáhallamin | 1 |
|
still wrong |
| čorgedalla | čorgedallá | 1 |
|
still wrong |
| Sámirádioi | sápmi+N+SgGenCmp |