Skip to content

Conversation

@kyleconroy
Copy link
Collaborator

Add isClickHouseWhitespace() to recognize characters that ClickHouse
treats as whitespace but Go's unicode.IsSpace() does not: U+FEFF BOM,
U+180E Mongolian Vowel Separator, U+200B Zero Width Space, U+200C Zero
Width Non-Joiner, U+200D Zero Width Joiner, and U+2060 Word Joiner.

This fixes the 01280_unicode_whitespaces_lexer test case stmt3.

Add isClickHouseWhitespace() to recognize characters that ClickHouse
treats as whitespace but Go's unicode.IsSpace() does not: U+FEFF BOM,
U+180E Mongolian Vowel Separator, U+200B Zero Width Space, U+200C Zero
Width Non-Joiner, U+200D Zero Width Joiner, and U+2060 Word Joiner.

This fixes the 01280_unicode_whitespaces_lexer test case stmt3.
@kyleconroy kyleconroy merged commit 3adedd2 into main Dec 28, 2025
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants