Currently used string simplification process is simple:
- Normalise all character with NFKD, which removes accents from characters and create 2 different chars.
- Remove all characters that are not in ASCII range.
This process works just fine for almost all cases, but in some situtation it fails, such as the letter ı does not have any accent, but used a lot in Turkish language and clearly corresponds to the letter i in ASCII, but since it does not have accent, it gets lost in the process.
We need to find a way to support such characters.
Currently used string simplification process is simple:
This process works just fine for almost all cases, but in some situtation it fails, such as the letter
ıdoes not have any accent, but used a lot in Turkish language and clearly corresponds to the letteriin ASCII, but since it does not have accent, it gets lost in the process.We need to find a way to support such characters.