Skip to content

Regex Doesn't Recognise Non-Latin Characters as "Word Characters" #11

@someonenameguy

Description

@someonenameguy

The Regex engine currently being used seems to not recognise non-Latin characters. Thus, see "word Boundary" in the middle of words.

Example: napušavali the see the š as an non-word character and matches \bavail in napu\bš\bavali.

Feed Preview:

image

Feed:

image

Oddly regex101 show that the Rust Regex Crate shouldn't do that:

image

Javascript Regex does though:

Image

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions