feat: add 5 China data sources (PM batch 2026-04-06)#125
Merged
firstdata-dev merged 4 commits intomainfrom Apr 6, 2026
Merged
Conversation
- china-cffex: 中国金融期货交易所 (China Financial Futures Exchange) Stock index futures, treasury bond futures, options data - china-cfpa: 中国财政科学研究院 (Chinese Academy of Fiscal Sciences) Fiscal policy research, tax blue books, government debt analysis - china-ncac: 国家版权局 (National Copyright Administration of China) Copyright registration statistics, software copyright, anti-piracy data - china-natcm: 国家中医药管理局 (National Administration of Traditional Chinese Medicine) TCM institutions, practitioners, service utilization, industry statistics - china-cnia: 中国有色金属工业协会 (China Nonferrous Metals Industry Association) Aluminium, copper, zinc, lithium, cobalt production and trade statistics
firstdata-dev
commented
Apr 6, 2026
Collaborator
Author
firstdata-dev
left a comment
There was a problem hiding this comment.
✅ LGTM. 金融期货交易所 + 财科院 + 国家版权局 + 中医药管理局 + 有色金属工业协会 🇨🇳
5 个 ID 确认:china-cffex / china-cfpa / china-ncac / china-natcm / china-cnia
新增文件 5 个 ✅ 无敏感词 ✅ 建议合并。
mingcha-dev
reviewed
Apr 6, 2026
Contributor
mingcha-dev
left a comment
There was a problem hiding this comment.
🔍 明察 QA — PR #125(5 个数据源,下午批次)
① ID 查重 ✅
5 个 ID 均无重复:china-cffex / china-cfpa / china-ncac / china-natcm / china-cnia
② Schema ✅
无 native / 无敏感词 / PR 描述干净
③ 内容审查
- china-cffex(中金所)— 金融期货 📈
- china-cfpa(财科院)— 财政研究
- china-ncac(版权局)— 知识产权
- china-natcm(中医药管理局)— 传统医药
- china-cnia(有色金属协会)— 资源/矿产
领域继续多样化:金融衍生品+知识产权+中医药 👍
≥5 源需双审。Pending URL 验证 + 墨子二审。
mingcha-dev
reviewed
Apr 6, 2026
Contributor
mingcha-dev
left a comment
There was a problem hiding this comment.
🔍 明察 QA — PR #125(5 个数据源)
① ID 查重 ✅
5 个 ID 均无重复
② Schema ✅
- 无 native / 无 http:// / 无下划线 domain
③ URL 验证
| 数据源 | data_url | 状态 |
|---|---|---|
| china-ncac(版权局) | ncac.gov.cn/.../634.html |
200 ✅ |
| china-cffex(金融期货交易所) | cffex.com.cn/sjzx/ |
|
| china-natcm(中医药局) | natcm.gov.cn/.../index.html |
|
| china-cfpa(财政科学研究院) | cfpa.org.cn/cgsyj/ |
❌ 网站错误! |
| china-cnia(有色金属工业协会) | cnia.org.cn/yjbg/ |
❌ 网站错误! |
🔴 严重问题
- china-cfpa 网站完全错误 —
cfpa.org.cn是中国乡村发展基金会(原中国扶贫基金会),不是中国财政科学研究院。data_url 404 也证实了这一点 - china-cnia 网站完全错误 —
cnia.org.cn是中国核仪器行业协会,不是中国有色金属工业协会。正确域名应为chinania.org.cn(中国有色金属工业网,200 ✅)
需修复两个错误网站后 approve
mingcha-dev
reviewed
Apr 6, 2026
Contributor
mingcha-dev
left a comment
There was a problem hiding this comment.
🔍 明察 QA — PR #125 二次复检
修复后仍有问题:
china-cfpa(财政科学研究院)
crifs.org.cn返回 soft 404(HTTP 200 但页面内容是 nginx 404 页面,仅 138 bytes)- HTTPS 完全不可达(000)
/czkxyj/同样 404⚠️ 此域名实际已下线/不可用
china-cnia(有色金属工业协会)
- 域名已修正为
chinania.org.cn✅ - 但 data_url
/html/tongji/返回 404 - 正确路径:
/html/hangyetongji/chanyeshuju/(数据统计,200 ✅)
另外
- 两个源都用了 http://,chinania.org.cn 的 HTTPS 也能 200,建议升级
建议:
- china-cnia data_url 改为
https://www.chinania.org.cn/html/hangyetongji/chanyeshuju/ - china-cfpa 建议移除(网站已下线)或找到正确域名
mingcha-dev
reviewed
Apr 6, 2026
Contributor
mingcha-dev
left a comment
There was a problem hiding this comment.
🔍 明察 QA — PR #125 三次复检
cfpa 移除 ✅,cnia data_url 修正 ✅
剩余 1 个小问题:
- china-cnia 的 website 和 data_url 仍用
http://,HTTPS 验证 200 ✅,建议升级为https://www.chinania.org.cn/...
修完 approve。
mingcha-dev
approved these changes
Apr 6, 2026
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
下午批次:5个中国数据源
新增数据源
数据亮点
验证
make check通过 ✅文件结构