Jump to content

User talk:Trey314159/homoglyphHunter.js

Page contents not supported in other languages.
From Wikipedia, the free encyclopedia

Addition

[edit]

Hi, you might to add the following to your Latin-to-Cyrillic map: 'ḯ':'ї́', 'Ḯ':'Ї́',. I was using the map to correct Ukrainian words with the wrong script and just encountered the lowercase letter in a Ukrainian word in Reconstruction:Proto-Slavic/dojiti. — Eru·tuon 23:16, 17 April 2019 (UTC)[reply]

Actually, a way to avoid having to add more characters is to convert to canonically decompose and then do the replacing (and recompose because that is the normalization used in wikitext). Then all the diacriticked letters could be removed, if they decompose to a letter and combining diacritics. At the moment I can't think of any weird effects that would have in this case. — Eru·tuon 04:52, 18 April 2019 (UTC)[reply]

Yep, there is a catch. The grapheme с̧ (Cyrillic small letter es, combining cedilla), which would result from the decomposition method, is not canonically equivalent to ҫ (Cyrillic small letter es with descender). Sigh. — Eru·tuon 08:58, 18 April 2019 (UTC)[reply]