Skip to content

fix: read validator text files as utf-8#3306

Open
Ghraven wants to merge 1 commit into
openai:mainfrom
Ghraven:fix/validator-txt-utf8
Open

fix: read validator text files as utf-8#3306
Ghraven wants to merge 1 commit into
openai:mainfrom
Ghraven:fix/validator-txt-utf8

Conversation

@Ghraven
Copy link
Copy Markdown

@Ghraven Ghraven commented May 24, 2026

Summary

  • read .txt files in the fine-tuning data validator with explicit UTF-8 encoding

Before / after

Before, the .txt conversion path used the platform default text encoding when reading user-provided text files.
After, it reads text files as UTF-8 explicitly, matching the validator's JSONL/CSV-style text data expectations and avoiding locale-dependent behavior.

Verification

  • python -m py_compile src/openai/lib/_validators.py
  • git diff --check

@Ghraven Ghraven requested a review from a team as a code owner May 24, 2026 17:20
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant