Skip to content

Newer tlds not supported #5057

@armijnhemel

Description

@armijnhemel

Description

The current e-mail regular expression cannot catch the latest (and greatest?) TLDs. There are many these days:

https://en.wikipedia.org/wiki/List_of_Internet_top-level_domains

but scancode limits it to something either 2, 3 or 4 characters:

def emails_regex():
    return re.compile('\\b[A-Z0-9._%-]+@[A-Z0-9.-]+\\.[A-Z]{2,4}\\b', re.IGNORECASE)

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions