Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
27 commits
Select commit Hold shift + click to select a range
04aef43
Add correspomding->corresponding and friends
peternewman Aug 14, 2020
2a2bef6
Fix the sort, add cyrstal->crystal
peternewman Aug 14, 2020
40f4c97
Add accessoires->accessories
peternewman Aug 15, 2020
17dc4fb
Add openscource->open-source, open source, opensource, and friends
peternewman Aug 15, 2020
5e19368
Add some more suggestions to #1330
peternewman Aug 18, 2020
98a56ed
Move dur as a popular abbreviation for duration
peternewman Aug 19, 2020
c97735b
Dur is often used for duration in code
peternewman Aug 19, 2020
1c68969
Merge pull request #1673 from codespell-project/master
peternewman Sep 2, 2020
c1807ef
Suggestions from @lurch and @sebweb3r
peternewman Sep 2, 2020
1fcda3f
Sort the dictionary
peternewman Sep 2, 2020
157a6e1
Fix a missing comma
peternewman Sep 2, 2020
0094f12
More suggestions
peternewman Sep 3, 2020
1e94350
Move calender(s) to rare
peternewman Sep 4, 2020
ec001b6
Add calender(s) to rare
peternewman Sep 4, 2020
1b7b365
Add splig->split, splign, and friends
peternewman Sep 5, 2020
25e8fc2
Add bale->able to rare
peternewman Sep 12, 2020
819fb19
Add procedre->procedure and friend
peternewman Sep 16, 2020
3584dff
Add nax->max, nin->min and long versions
peternewman Sep 17, 2020
a51e812
Address lots of comments
peternewman Sep 21, 2020
33164ad
Sort the dictionary
peternewman Sep 21, 2020
fa3029e
More sorting!
peternewman Sep 21, 2020
ed89a55
Remove specialties->specialities from rare as it's a US word
peternewman Oct 5, 2020
490a979
Add specialities->specialties and friends to GB to US
peternewman Oct 5, 2020
396190b
Add coverge->coverage
peternewman Oct 12, 2020
1d2c8e4
Add some more multi* typos
peternewman Oct 30, 2020
837ce67
Remove a legitimate word, add multiplier and some other variants per …
peternewman Nov 2, 2020
ccfb5ad
Merge branch 'master' into peternewman-correspomding
peternewman Nov 2, 2020
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
90 changes: 75 additions & 15 deletions codespell_lib/data/dictionary.txt
Original file line number Diff line number Diff line change
Expand Up @@ -297,6 +297,9 @@ accessiblilty->accessibility
accessile->accessible
accessintg->accessing
accessisble->accessible
accessoire->accessory
accessoires->accessories, accessorise,
accessoirez->accessorize, accessories,
accessort->accessor
accesss->access
accesssor->accessor
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Could also suggest accesssor->access or (if that's not too far outside the scope of this PR)
Should accessor->access or maybe be in the code dictionary?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Could also suggest accesssor->access or (if that's not too far outside the scope of this PR)

I think that should be a separate PR if it goes in, but you're saying you typed an s instead of a space seems rather unlikely to be. Some real world examples would be good again.

Should accessor->access or maybe be in the code dictionary?

Yes if it was to go in. There seem to be enough examples of those words together that it's plausible. But probably one for a separate PR given this one is getting big. Why don't you do the honours?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sorry if I made too many suggestions, I'm happy for you to just ignore any comments I made, as you see fit.

Expand Down Expand Up @@ -4229,8 +4232,6 @@ calculuations->calculations
calcurate->calculate
calcutated->calculated
caleed->called
calender->calendar
calenders->calendars
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Oh wow, new one for me

caler->caller
calescing->coalescing
caliased->aliased
Expand Down Expand Up @@ -4846,7 +4847,7 @@ childen->children
childern->children
childres->children
childs->children, child's,
chiled->child
chiled->child, chilled,
chiledren->children
chilren->children
chineese->chinese
Expand Down Expand Up @@ -4877,7 +4878,7 @@ choser->chooser
chosing->choosing
chossen->chosen
chould->should, could,
chouse->chose
chouse->choose, chose, choux,
chracter->character
chracters->characters
chractor->character
Expand Down Expand Up @@ -7118,6 +7119,10 @@ correllation->correlation
correllations->correlations
correnspond->correspond
corrensponded->corresponded
correnspondence->correspondence
correnspondences->correspondences
correnspondent->correspondent
correnspondents->correspondents
corrensponding->corresponding
corrensponds->corresponds
corrent->correct, current,
Expand All @@ -7137,6 +7142,14 @@ correspnding->corresponding
correspodence->correspondence
correspoding->corresponding
correspoinding->corresponding
correspomd->correspond
correspomded->corresponded
correspomdence->correspondence
correspomdences->correspondences
correspomdent->correspondent
correspomdents->correspondents
correspomding->corresponding
correspomds->corresponds
correspondance->correspondence
correspondances->correspondences
correspondant->correspondent
Expand Down Expand Up @@ -7302,6 +7315,7 @@ covarege->coverage
covention->convention
covere->cover
coveres->covers
coverge->coverage
coverred->covered
coverted->converted, covered, coveted,
coverting->converting
Expand Down Expand Up @@ -7420,7 +7434,13 @@ crowkay->croquet
crowm->crown
crrespond->correspond
crsytal->crystal
crsytalline->crystalline
crsytallisation->crystallisation
crsytallise->crystallise
crsytallization->crystallization
crsytallize->crystallize
crsytallographic->crystallographic
crsytals->crystals
crtical->critical
crticised->criticised
crucialy->crucially
Expand Down Expand Up @@ -7550,6 +7570,13 @@ Cyrllic->Cyrillic
cyrpto->crypto
cyrrent->current
cyrrilic->Cyrillic
cyrstal->crystal
cyrstalline->crystalline
cyrstallisation->crystallisation
cyrstallise->crystallise
cyrstallization->crystallization
cyrstallize->crystallize
cyrstals->crystals
cyrto->crypto
cywgin->Cygwin
daa->data
Expand Down Expand Up @@ -10051,7 +10078,6 @@ dupplicates->duplicates
dupplicating->duplicating
dupplication->duplication
dupplications->duplications
dur->due
durationm->duration
durectories->directories
durectory->directory
Expand Down Expand Up @@ -10208,7 +10234,7 @@ ehancements->enhancements
ehough->enough
ehr->her
ehternet->Ethernet
ehther->ether
ehther->ether, either,
ehthernet->ethernet
eighter->either
eigth->eighth, eight,
Expand Down Expand Up @@ -16147,7 +16173,7 @@ jumpt->jumped, jump,
juristiction->jurisdiction
juristictions->jurisdictions
jus->just
juse->just
juse->just, juice, Jude, June,
justfied->justified
justication->justification
justifed->justified
Expand Down Expand Up @@ -17730,13 +17756,14 @@ mudering->murdering
mudule->module
mudules->modules
muext->mutex
muhammadan->muslim
mulithread->multithread
mulitpart->multipart
mulitpath->multipath
mulitple->multiple
mulitplicative->multiplicative
mulitplied->multiplied
mulitplier->multiplier
mulitpliers->multipliers
multi-dimenional->multi-dimensional
multi-dimenionsal->multi-dimensional
multi-langual->multi-lingual
Expand All @@ -17759,8 +17786,10 @@ multilpe->multiple
multipe->multiple
multipes->multiples
multipiler->multiplier
multipilers->multipliers
multipl->multiple, multiply,
multipled->multiplied
multipler->multiplier, multiple,
multiplers->multipliers
multipliciaton->multiplication
multiplicites->multiplicities
Expand All @@ -17775,8 +17804,17 @@ multitute->multitude
multivriate->multivariate
multixsite->multisite
multliple->multiple
multliples->multiples
multliplied->multiplied
multliplier->multiplier
multlipliers->multipliers
multliplies->multiplies
multliply->multiply
multliplying->multiplying
multple->multiple
multplied->multiplied
multplier->multiplier
multpliers->multipliers
multplies->multiplies
multply->multiply
multplying->multiplying
Expand Down Expand Up @@ -17817,8 +17855,12 @@ mutli->multi
mutli-threaded->multi-threaded
mutlipart->multipart
mutliple->multiple
mutlipler->multiplier
mutlipler->multiplier, multiple,
mutliples->multiples
mutliplication->multiplication
mutliplicites->multiplicities
mutliplier->multiplier
mutlipliers->multipliers
mutliply->multiply
mutully->mutually
mutux->mutex
Expand Down Expand Up @@ -17882,6 +17924,10 @@ navagate->navigate
navagating->navigating
navagation->navigation
navagitation->navigation
nax->max, nad,
naxima->maxima
naximal->maximal
naximum->maximum
Nazereth->Nazareth
nclude->include
nd->and, 2nd,
Expand Down Expand Up @@ -18412,11 +18458,15 @@ nighboring->neighboring
nightfa;;->nightfall
nightime->nighttime
nimutes->minutes
nin->inn, min, bin, nine,
nineth->ninth
ninima->minima
ninimal->minimal
ninimum->minimum
ninjs->ninja
ninteenth->nineteenth
ninties->1990s
ninty->ninety
ninties->nineties, 1990s,
ninty->ninety, minty,
nither->neither
nknown->unknown
nkow->know
Expand Down Expand Up @@ -18951,6 +19001,8 @@ openess->openness
openin->opening
openned->opened
openning->opening
openscource->open-source, open source, opensource,
openscourced->open-sourced, open sourced, opensourced,
operaand->operand
operaands->operands
operaion->operation
Expand Down Expand Up @@ -19488,7 +19540,7 @@ Palistinians->Palestinians
pallete->palette
pallette->palette
palletted->paletted
paln->plan
paln->plan, pain, palm,
paltette->palette
paltform->platform
pamflet->pamphlet
Expand Down Expand Up @@ -20854,6 +20906,8 @@ procedes->proceeds, precedes,
procedger->procedure
proceding->proceeding, preceding,
procedings->proceedings
procedre->procedure
procedres->procedures
proceedure->procedure
proceedures->procedures
proceeed->proceed
Expand Down Expand Up @@ -23892,6 +23946,10 @@ scoll->scroll
scolling->scrolling
scopeing->scoping
scorebord->scoreboard
scource->source, scouse,
scourced->sourced, scoured,
scourcer->scourer, sorcerer, scouser,
scources->sources
scrach->scratch
scrached->scratched
scraches->scratches
Expand Down Expand Up @@ -25322,8 +25380,10 @@ spile->spite, spiral,
spiltting->splitting
spinlcok->spinlock
spinock->spinlock
splig->split, splign,
spligs->splits
spliitting->splitting
splite->split
splite->split, splits, splice,
spliting->splitting
splitted->split
splittng->splitting
Expand Down Expand Up @@ -25438,7 +25498,7 @@ standartized->standardized
standarts->standards
standatd->standard
standtard->standard
standy->standby
standy->standby, sandy, standee,
stange->strange
stanp->stamp
staration->starvation
Expand Down Expand Up @@ -29093,7 +29153,7 @@ vunerable->vulnerable
vyer->very
vyre->very
waht->what
wakeus->wakeups
wakeus->wakeups, wake us, walrus,
wakup->wakeup
wallthickness->wall thickness
wan't->want, wasn't,
Expand Down
1 change: 1 addition & 0 deletions codespell_lib/data/dictionary_code.txt
Original file line number Diff line number Diff line change
Expand Up @@ -9,6 +9,7 @@ copyable->copiable
define'd->defined
dof->of, doff,
dont->don't
dur->due
endcode->encode
errorstring->error string
files'->file's
Expand Down
2 changes: 2 additions & 0 deletions codespell_lib/data/dictionary_en-GB_to_en-US.txt
Original file line number Diff line number Diff line change
Expand Up @@ -224,6 +224,8 @@ specialise->specialize
specialised->specialized
specialises->specializes
specialising->specializing
specialities->specialties
speciality->specialty
splendour->splendor
standardisation->standardization
standardise->standardize
Expand Down
4 changes: 3 additions & 1 deletion codespell_lib/data/dictionary_rare.txt
Original file line number Diff line number Diff line change
Expand Up @@ -6,13 +6,16 @@ automatized->automated
automatizes->automates
backword->backward
backwords->backwards
bale->able
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Maybe pale?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

And kale, dale, male etc. None of those letters are close to B though. I really just wanted to cover the transposing of two characters from one perfectly valid word ( https://en.wikipedia.org/wiki/Baler ) to another more likely one. I suspect male might be more likely than pale anyway if we went down that route.

Copy link
Contributor

@sebweb3r sebweb3r Sep 14, 2020

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pale just came to my mind, because there is a german dialect, which doesn't distinguish between b or p, and d or t 😄
Close on the keyboard would be vale. I just wanted to prevent people from auto-correcting something they didn't want to (yes i know interactive mode).

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Could also suggest bale->bald ?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Personally I don't think vaguely infrequent rare valid words should suggest similar but unrelated ones unless there are very strong arguments for it, bald seems the only likely typo we've come up with. I'd suggest vale is as infrequent as bale.

I think also the frequency stats are very telling (I wasn't too far off with vale; bald is still pretty rare):
http://app.aspell.net/lookup?dict=en_US&words=bale%0D%0Aable%0D%0Avale%0D%0Abald%0D%0Apale%0D%0Amale%0D%0Adale%0D%0Akale%0D%0Asale

Between those and the fact it's just a transposition, so not even a real typo, I'd really like to leave it as is.

I think the auto-correct solution is to add what we dropped when we went to multi dictionary and have everything in rare (and names and informal) offer corrections to itself automatically). I can pull it out to a separate PR until that's been done if that would keep you two happy.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You obviously understand this project much better than I do; I don't have strong opinions either way.

bellow->below
bloc->block
blocs->blocks
bodgy->body
buss->bus
busses->buses
calculatable->calculable
calender->calendar
calenders->calendars
cant->can't
chack->check, chalk, cheque,
chancel->cancel
Expand Down Expand Up @@ -115,7 +118,6 @@ sightly->slightly
singe->single
singed->signed, singled,
slippy->slippery
specialties->specialities
specif->specify, specific,
steams->streams
sting->string
Expand Down