Wiktionary:Spell check/likely misspellings

From Wiktionary, the free dictionary
Jump to navigation Jump to search

T1+ASCII from 2022-04-20 dump excluding quotations[edit]

Most of the likely misspellings from this section have been cleaned up. Some of the remainder are alternate forms which simple entries could be created for. -- Beland (talk) 19:48, 2 May 2023 (UTC)[reply]

2-5[edit]

1 (n-z)[edit]

1 (a-m)[edit]

More likely typos from 2023-04-20 dump[edit]

ME[edit]

Probable English-language compounds (with and without hyphen).

ME 2+[edit]

ME 1 (q-z)[edit]

====ME 1 (g-p)

ME 1 (a-f)[edit]

TE[edit]

Possible English-language typos.

C[edit]

Chemistry-related words.

TS[edit]

These are suspected whitespace errors.

N+DOUBLEDOT[edit]

Please ignore this section; there seems to be a parser bug which is causing false alarms.

Comment: do these represent a bug in the script generating the list? I don't see them in the displayed text or the written wikicode of any of these pages, even if I look through old revisions from months ago. Looking for commonalities, maybe it's a bug that arises when a string that lacks an entry is followed by a period and then a line break?- -sche (discuss) 23:29, 2 May 2023 (UTC)[reply]
Hmm, yes, it looks like periods from before and after templates are somehow being jammed together. Not sure if that's a bug in my suppression code or a bug in the NLTK tokenizer. I'll check into that, but in the meantime this section can be ignored. I'll just drop it from future reports since there's nothing useful in it. -- Beland (talk) 23:42, 2 May 2023 (UTC)[reply]