Normalization token filters
There are several token filters available which try to normalize special characters of a certain language.
- Arabic
arabic_normalization
- German
german_normalization
- Hindi
hindi_normalization
- Indic
indic_normalization
- Kurdish (Sorani)
sorani_normalization
- Persian
persian_normalization
- Scandinavian
scandinavian_normalization
,scandinavian_folding
- Serbian