$ cat 'fooಠbar' > test
$ ack 'foo.bar' test
# ack 'foo..bar' test
$ ack 'foo...bar' test
fooಠbar
It would appear that the file is treated as UTF-8 bytes, so attempts to match a non-ascii character against . are not successful if the utf8 representation uses more than one octet.
Should files be assumed to be UTF-8 encoded? Or perhaps just when characters are seen that are not in the [\x00-\x7F] range? Or should a command-line argument be required to specify the encoding?
It would appear that the file is treated as UTF-8 bytes, so attempts to match a non-ascii character against
.are not successful if the utf8 representation uses more than one octet.Should files be assumed to be UTF-8 encoded? Or perhaps just when characters are seen that are not in the
[\x00-\x7F]range? Or should a command-line argument be required to specify the encoding?