Skip to content

Simplification On Chars That Does Not Have Accent #1

@erthium

Description

@erthium

Currently used string simplification process is simple:

  • Normalise all character with NFKD, which removes accents from characters and create 2 different chars.
  • Remove all characters that are not in ASCII range.

This process works just fine for almost all cases, but in some situtation it fails, such as the letter ı does not have any accent, but used a lot in Turkish language and clearly corresponds to the letter i in ASCII, but since it does not have accent, it gets lost in the process.

We need to find a way to support such characters.

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't workinggood first issueGood for newcomers

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions