ICU: What does NFD and NFC mean?
Asked Answered
W

1

7

I found a snippet which reads

Any-Latin; NFD; [:Nonspacing Mark:] Remove; NFC; [:Punctuation:] Remove; Lower();

...and is supposed to make arbitrary string well fitting into a URL.

So, I guess the things between the semicolons are something like "commands" which are to be executed, but what particularly does NDF and NFC stand for? I really did find nothing even in the official documentation...

Woodie answered 14/5, 2016 at 12:52 Comment(5)
w3.org/TR/charmod-norm/#normalization_formsCrisper
oy thanks; you should have posted that as an answer for I can accept it.. what should I do?Giovannagiovanni
If that link does not help you write your own answer then I'll have to apologize.Crisper
@HansPassant that's a good page, though.Donielle
@AdrianFöder where in the official documentation did you look? Asking so I can improve it.Donielle
D
5

See ICU transliterators and the linked page on TR15 normalization forms for complete examples.

Normalization Form D (NFD) Canonical Decomposition

Normalization Form C (NFC) Canonical Decomposition, followed by Canonical Composition

Normalization Form KD (NFKD) Compatibility Decomposition

Normalization Form KC (NFKC) Compatibility Decomposition, followed by Canonical Composition

Donielle answered 19/7, 2016 at 20:10 Comment(0)

© 2022 - 2024 — McMap. All rights reserved.