UTF-8 files are not decoded before searching

```bash
$ cat 'fooಠbar' > test
$ ack 'foo.bar' test
# ack 'foo..bar' test
$ ack 'foo...bar' test
fooಠbar
```

It would appear that the file is treated as UTF-8 bytes, so attempts to match a non-ascii character against `.` are not successful if the utf8 representation uses more than one octet.

Should files be assumed to be UTF-8 encoded? Or perhaps just when characters are seen that are not in the `[\x00-\x7F]` range?  Or should a command-line argument be required to specify the encoding?




Provide feedback

Saved searches

Use saved searches to filter your results more quickly

UTF-8 files are not decoded before searching #396

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

UTF-8 files are not decoded before searching #396

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions