GH-35806: [R] Improve error message for null type inference with sparse CSV data by thisisnic · Pull Request #49338 · apache/arrow

thisisnic · 2026-02-19T09:31:51Z

Rationale for this change

When reading a CSV with sparse data (many missing values followed by actual values), Arrow can infer a column type as null based on the first block of data. When non-null values appear later, the error message incorrectly suggests using skip = 1 for header rows, which is misleading.

What changes are included in this PR?

Adds a specific check for "conversion error to null" that provides a helpful message explaining the cause (type inference from sparse data) and the solution (specify column types explicitly via col_types or schema).

Are these changes tested?

Yes, added a test in test-dataset-csv.R.

Are there any user-facing changes?

Yes, improved error message when CSV type inference fails due to sparse data.

This PR was authored by Claude (Opus 4.5) and reviewed by @thisisnic.

🤖 Generated with Claude Code

GitHub Issue: [R] Error message caused by reading sparsely populated data is misleading #35806

…h sparse CSV data When a CSV column contains only missing values in the first block of data, Arrow infers the type as null. If a non-null value appears later, the conversion fails with an unhelpful error suggesting `skip = 1`. This change adds a specific check for "conversion error to null" and provides a more helpful message explaining the cause (type inference from sparse data) and the solution (specify column types explicitly). Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

github-actions · 2026-02-19T09:32:16Z

⚠️ GitHub issue #35806 has been automatically assigned in GitHub to PR creator.

thisisnic · 2026-02-19T13:50:27Z

I'm not totally happy with the error message, will rewrite before marking ready for review

thisisnic requested a review from jonkeane as a code owner February 19, 2026 09:31

github-actions bot added Component: R awaiting committer review Awaiting committer review labels Feb 19, 2026

thisisnic marked this pull request as draft February 19, 2026 13:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GH-35806: [R] Improve error message for null type inference with sparse CSV data#49338

GH-35806: [R] Improve error message for null type inference with sparse CSV data#49338
thisisnic wants to merge 1 commit intoapache:mainfrom
thisisnic:GH-35806-null-type-error-message

thisisnic commented Feb 19, 2026 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Feb 19, 2026

Uh oh!

thisisnic commented Feb 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

Conversation

thisisnic commented Feb 19, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

Uh oh!

github-actions bot commented Feb 19, 2026

Uh oh!

thisisnic commented Feb 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

thisisnic commented Feb 19, 2026 •

edited by github-actions bot

Loading