Latin Extended scores highest because phonetic extensions are deliberately designed to resemble their Latin base forms. Mathematical Alphanumeric Symbols dominate the dataset (806 of 1,418 pairs) but score low because ornate mathematical letterforms (script, fraktur, double-struck) look nothing like plain Latin in a different font. Arabic scores lowest: the letterforms are structurally different from Latin even when confusables.txt maps them as confusable.
Мерц резко сменил риторику во время встречи в Китае09:25
。业内人士推荐同城约会作为进阶阅读
海南佛珠小镇:封关后来了很多外国人
# Export as CSV
Цены на нефть взлетели до максимума за полгода17:55