What China's Vaccine Procurement Data Reveals: A Province‑Level Analysis
This article documents the collection, cleaning, and statistical analysis of publicly released second‑category vaccine procurement data from 28 Chinese provinces, highlighting data sources, processing steps with pandas, top manufacturers, regional market shares, and the challenges encountered during the effort.
Background
Public procurement data for second‑category vaccines were collected from the official websites of 28 Chinese provinces (excluding Hong Kong, Macau, Taiwan, Shandong, Qinghai, and Tibet). All records were uploaded to a public GitHub repository for reuse.
Data‑collection methodology
Search terms such as “省名 第二类疫苗 采购”, “省名 第二类疫苗 公示”, etc., were used on provincial procurement portals. Common obstacles included login‑required sites, broken links, image‑only pages, and missing documents for several regions. The extracted records were saved as CSV files with the following columns:
通用名 (name)
来源 (src)
生产企业 (create_company)
申报企业 (report_company – defaults to the production company if empty)
省 (prov)
中标年份 (year)
Data processing and descriptive statistics
All CSV files were merged with pandas and visualised using echart. The combined dataset contains 1,529 rows covering 167 distinct manufacturers. Basic descriptive statistics are:
count 167.000000
mean 9.143713
std 15.839281
min 1.000000
25% 1.000000
50% 3.000000
75% 9.000000
max 98.000000The distribution is heavily right‑skewed, indicating that a small number of large manufacturers dominate the market.
Top manufacturers (by number of vaccine entries)
北京科兴生物制品有限公司 – 98
长春长生生物科技有限责任公司 – 91
华兰生物疫苗有限公司 – 72
玉溪沃森生物技术有限公司 – 57
上海生物制品研究所有限责任公司 – 56
大连雅立峰生物制药有限公司 – 51
长春生物制品研究所有限责任公司 – 50
华北制药金坦生物技术股份有限公司 – 46
辽宁成大生物股份有限公司 – 44
北京智飞绿竹生物制药有限公司 – 43
Provincial market share of major suppliers
The share of leading manufacturers (e.g., 长春长生) in the 14 provinces with available data ranges from roughly 10 % to 20 %.
[{'name': '天津', 'value': 14.0625},
{'name': '辽宁', 'value': 17.647058823529413},
{'name': '黑龙江', 'value': 13.274336283185843},
{'name': '安徽', 'value': 9.75609756097561},
{'name': '新疆', 'value': 16.94915254237288},
{'name': '江苏', 'value': 14.563106796116504},
{'name': '江西', 'value': 12.179487179487179},
{'name': '山西', 'value': 16.52173913043478},
{'name': '吉林', 'value': 17.857142857142858},
{'name': '贵州', 'value': 14.17910447761194},
{'name': '广东', 'value': 13.445378151260504},
{'name': '四川', 'value': 16.93548387096774},
{'name': '宁夏', 'value': 17.5},
{'name': '海南', 'value': 14.285714285714285}]Limitations
The dataset is incomplete; many provinces lack data because their procurement portals were inaccessible or the documents were unavailable. The current release represents roughly half of the intended coverage.
Further work
Additional provinces will be processed and the updated CSV files will be pushed to the GitHub repository when they become available.
Signed-in readers can open the original source through BestHub's protected redirect.
This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.
ITPUB
Official ITPUB account sharing technical insights, community news, and exciting events.
How this landed with the community
Was this worth your time?
0 Comments
Thoughtful readers leave field notes, pushback, and hard-won operational detail here.
