Skip to content

Add Korean alternative date expressions#1289

Merged
serhii73 merged 3 commits into
scrapinghub:masterfrom
seungwoo-ji-03:feat/add_Korean_alternative_date_expressions
Jun 9, 2026
Merged

Add Korean alternative date expressions#1289
serhii73 merged 3 commits into
scrapinghub:masterfrom
seungwoo-ji-03:feat/add_Korean_alternative_date_expressions

Conversation

@seungwoo-ji-03

Copy link
Copy Markdown
Contributor

This PR adds support for formal and colloquial Korean date expressions to improve parsing coverage.

Formal expressions (격식체):

  • 금일 - formal variant of 오늘 (today)
  • 작일 - formal variant of 어제 (yesterday)
  • 명일 - formal variant of 내일 (tomorrow)

Colloquial expressions (구어체):

  • - colloquial alternative to 개월 (month)
    • e.g., 3달 전 (3 months ago), 2달 후 (in 2 months)

@Gallaecio

Gallaecio commented Oct 27, 2025

Copy link
Copy Markdown
Contributor

I see no YAML file edited, which makes me think you might have not seen https://dateparser.readthedocs.io/en/latest/contributing.html#guidelines-for-editing-translation-data.

@seungwoo-ji-03 seungwoo-ji-03 force-pushed the feat/add_Korean_alternative_date_expressions branch from 2a72767 to 9175252 Compare October 27, 2025 15:15
@seungwoo-ji-03

seungwoo-ji-03 commented Oct 27, 2025

Copy link
Copy Markdown
Contributor Author

I see no YAML file edited, which makes me think you might have not seen https://dateparser.readthedocs.io/en/latest/contributing.html#guidelines-for-editing-translation-data.

My bad, I updated files

Resolve conflict in the generated dateparser/data/date_translation_data/ko.py
by regenerating it from the supplementary ko.yaml with the current generator.
Master moved the relative-type-regex digit quantifiers to the possessive form
(\d++[.,]?\d*+), so the new "달" (month) patterns are now emitted in that same
form, keeping the generated data consistent with the rest of master.

Verified:
- generator reproduces all other committed data files byte-for-byte
- test_dateparser_data_integrity passes (committed data == generator output)
- full test suite passes (24055 passed, 18 skipped, 1 xfailed)
- ruff check / format clean

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
@codecov

codecov Bot commented Jun 9, 2026

Copy link
Copy Markdown

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 97.11%. Comparing base (98b9c32) to head (99e2021).

Additional details and impacted files
@@           Coverage Diff           @@
##           master    #1289   +/-   ##
=======================================
  Coverage   97.11%   97.11%           
=======================================
  Files         235      235           
  Lines        2909     2909           
=======================================
  Hits         2825     2825           
  Misses         84       84           

☔ View full report in Codecov by Harness.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

@serhii73

serhii73 commented Jun 9, 2026

Copy link
Copy Markdown
Collaborator

Thanks!

@serhii73 serhii73 merged commit a049fd1 into scrapinghub:master Jun 9, 2026
15 checks passed
@serhii73 serhii73 mentioned this pull request Jun 15, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants