Currently the confusable data doesn't fully include CJK ideographs related info.
Unihan database provides these:
kZVariant which describes logically unifiable ideographs that are separately encoded for other reasons
kSpoofingVariant which describes confusable ideograph relationships
EquivalentUnifiedIdeograph.txt which describes CJK Radical/Stroke and their corresponding Ideograph.
Procedurally i'd recommend Unicode security standards adopt these as confusables upstream, either by data or by algorithm, and then this crate can support them automatically.
cc @Manishearth
Currently the confusable data doesn't fully include CJK ideographs related info.
Unihan database provides these:
kZVariantwhich describes logically unifiable ideographs that are separately encoded for other reasonskSpoofingVariantwhich describes confusable ideograph relationshipsEquivalentUnifiedIdeograph.txtwhich describes CJK Radical/Stroke and their corresponding Ideograph.Procedurally i'd recommend Unicode security standards adopt these as confusables upstream, either by data or by algorithm, and then this crate can support them automatically.
cc @Manishearth