Logo
Gary's VAERS effort

Gary's early efforts to uncover hidden and alarming data relating to COVID-19 era vaccination schedules / VAERS reporting and omissions...


univaers.com — Annotated File Inventory

Source: https://vaersthedata.org/Downloads

Context: A researcher-built consolidation of weekly VAERS (Vaccine Adverse Event Reporting System) public data drops from HHS/CDC. Tracks changes between drops — deletions, restorations, cell edits, field imputation, and reports submitted to VAERS that may have been later removed or changed.


Core Flatfile (download/flatfile/)

File Size Records/Lines Description What You Can Find
CSV_VAERS_FLATFILE.zip 328 MB ~900K+ reports Full consolidated VAERS dataset, all drops merged into one CSV Complete adverse event records: symptoms, outcomes, demographics, lot numbers, dates — the master dataset for any VAERS analysis
XLSX_VAERS_FLATFILE.zip 429 MB Same Same flatfile in Excel format Same as above; useful for tools that don't handle CSV well
stats.csv 32 KB ~148 drops Per-drop change statistics: deletions, restorations, gap-fills, never-published counts, cell edits by field How VAERS data has changed week-over-week; quantify deletions and edits across the full publication history
all_ever_seen.txt 16 MB ~1.8M lines Every VAERS ID ever observed across all drops Reconstruct which report IDs existed at any point; identify reports that appeared then vanished
never_published.txt 282 KB 32,222 IDs VAERS IDs received by CDC but never appearing in any public release Size and identity of the "dark pool" of suppressed/withheld reports
symptoms_deduped.txt 2.2 MB ~253K lines De-duplicated MedDRA symptom strings extracted from the flatfile Canonical symptom vocabulary; use for NLP, search, or frequency analysis
2023-12-29_run_output.txt 81 KB Latest build log (Dec 2023) Data provenance, what changed in the final captured drop
HTM_run_output_vaers_flatfile_large_earlier_main.htm 5.0 MB HTML-rendered run output for earlier (pre-Jul 2023) drops Historical change log for the earlier collection period
py_vaers_flatfile_build.txt 140 KB Python source code for building the flatfile Methodology transparency; reproduce or extend the consolidation pipeline

Weekly Run Outputs (download/flatfile/run_outputs_other/)

File Size Description
2023-07-28_run_output.txt 15 KB Build log for Jul 28 2023 drop
2023-08-04_run_output.txt 14 KB Build log for Aug 4 2023 drop
2023-08-11_run_output.txt 15 KB Build log for Aug 11 2023 drop
2023-08-18_run_output.txt 15 KB Build log for Aug 18 2023 drop
2023-08-25_run_output.txt 16 KB Build log for Aug 25 2023 drop
2023-09-01_run_output.txt 19 KB Build log for Sep 1 2023 drop
2023-09-08_run_output.txt 10 KB Build log for Sep 8 2023 drop
2023-09-15_run_output.txt 16 KB Build log for Sep 15 2023 drop
2023-09-22_run_output.txt 9.7 KB Build log for Sep 22 2023 drop
2023-09-29_run_output.txt 8.5 KB Build log for Sep 29 2023 drop
2023-10-27_run_output.txt 11 KB Build log for Oct 27 2023 drop
2023-11-24_run_output.txt 53 KB Build log for Nov 24 2023 drop

Enrichment / Lookup Tables (download/)

File Size Records Description What You Can Find
country_codes.csv 3.7 KB 253 ISO 2-letter country codes Decode the foreign-country entries in the VAERS STATE field (many international reports use country codes)
expiration_dates.csv 222 KB 3,937 Vaccine lot → expiry date, manufacturer, source Cross-reference lot numbers against known expiry windows; flag expired-product administration events
filled_genders.csv 55 KB 3,486 VAERS IDs where gender was U (unknown) but was imputed from free-text narrative Recover sex for otherwise-unknown records; reduces bias in sex-stratified analyses
filled_states.csv 34 MB 888,625 VAERS IDs where state was parsed from the SPLTTYPE field (manufacturer split-type codes) Recover geographic location for records missing the STATE field — critical for geographic clustering

Ages Sample (download/ages/)

File Size Description What You Can Find
vaers_jonf_garyha_sample_32k_v1.xlsx 19 MB 32K-record sample with age analysis (collaborative work by researchers "jonf" and "garyha") Age-distribution patterns in adverse event reports; methodology for age imputation

Lot Cleaning (download/lot_cleaning/)

File Size Description What You Can Find
hawk_lots_cleaned_2023-07-16.zip 63 MB Normalized/cleaned vaccine lot number strings from the full VAERS dataset Standardized lot numbers for lot-level signal detection; raw VAERS lot strings are inconsistently formatted and this resolves them

Special Analyses (download/special/)

Drop-Specific Event Files

File Size Records Description What You Can Find
2023-07-28_new_reports.csv 1.8 MB 2,657 Reports appearing for the first time in the Jul 28 2023 drop Late-added reports; characterize the lag between event and VAERS entry
2023-08-18_new_reports.csv 1.5 MB 2,502 New reports in the Aug 18 2023 drop Same; compare new-report volume across drops
2023-10-27_deleted.csv 382 KB 114 Full record detail for reports deleted from VAERS on Oct 27 2023 What kinds of reports get deleted — severity, symptoms, outcomes of removed records
2023-10-27_symptom_entry_counts.xlsx 337 KB Symptom frequency counts for the Oct 2023 drop Ranked symptom occurrence; identify top signals in the most recent complete drop

Lot-Specific Deep Dives

File Size Records Description What You Can Find
hawk_FD0810.csv 858 KB 942 All VAERS reports referencing Pfizer lot FD0810 Lot-level adverse event profile; signal assessment for a single production batch
hawk_EL9262_and_EL9264.xlsx 2.3 MB Reports for Moderna lots EL9262 and EL9264 Comparative lot signal; two Moderna batches side by side
032H20A.xlsx 102 KB Reports for Moderna lot 032H20A Single-lot event profile
212C21A_and_213C21A.xlsx 197 KB Reports for Moderna lots 212C21A and 213C21A Sequential lot comparison
SV40_FL0007.csv 1.3 MB 1,953 Reports for Pfizer lot FL0007, framed in SV40 contamination context Adverse events associated with a lot flagged in SV40 promoter sequence contamination discussions
XLSX_Moderna_FOIA_Missing_Lots.zip 6.4 MB Moderna lots identified via FOIA that are absent from public VAERS releases Lots with adverse events that appear in manufacturer safety files but not in public VAERS

Clinical / Condition-Focused

File Size Records Description What You Can Find
myocard_thru_march_2022.xlsx 25 MB Myocarditis/pericarditis VAERS reports through March 2022 Cardiac inflammatory signal over time; age/sex breakdown; dose correlation
2023_10-27_cancer.xlsx 15 MB Cancer-coded VAERS reports through Oct 2023 Malignancy signal in VAERS; identify cancer-related MedDRA terms and their temporal patterns
2023-10-27_fetus_all_likely.xlsx 8.4 MB Fetal/pregnancy-outcome reports through Oct 2023 Miscarriage, stillbirth, fetal abnormality signals; pregnancy outcome data
foetal_cytopenia_etc.xlsx 1.8 MB Fetal cytopenia, thrombocytopenia, and related hematologic events Neonatal blood disorder signals; rare but specific adverse events
vaers_symptoms_female_fertility_and_baby.xlsx 11 KB Symptom list for female reproductive / infant adverse events Curated MedDRA term list for reproductive signal queries
XLSX_univaers_female_and_baby_reports.xslx.zip 118 MB Full report dataset: female reproductive and infant adverse events Large-scale reproductive/neonatal signal dataset
hawk_endotoxins.csv 7.7 MB 1,193 Reports with endotoxin-related symptom patterns Contamination-type signal; febrile reactions potentially consistent with endotoxin exposure
sv40.csv 390 KB ~143 Reports mentioning SV40 or related terms (CSV format) Subset of reports tied to SV40 promoter contamination concern in Pfizer bivalent lots
sv40.xlsx 102 KB ~143 Same SV40-flagged reports in Excel format Same as above; easier to filter/pivot in Excel
hawk_vaers_other_serious.xlsx 204 MB All serious non-death VAERS reports Hospitalization, disability, life-threatening events excluding deaths — the broadest serious-outcomes dataset here
vaers_hawk_serious_v_not.xlsx 204 MB Reports classified as serious vs. non-serious Train classifiers; analyze what features predict seriousness; compare outcomes across severity strata
plane_crashes_from_1-2020_uncompleted.xlsx 203 KB Cross-reference to aviation incidents from Jan 2020; marked uncompleted Occupational/transportation safety signals; may cross-reference pilot or aircrew VAERS reports

2022-11-11 Purge Details (download/special/2022-11-11_purge_details/)

File Size Description What You Can Find
vaers_with_blankings_2022-11-04_to_2022-11-11.xlsx 219 MB Cell-level diff between two consecutive drops — every field that was blanked or changed Which specific data fields CDC removed or overwrote in a single week; the scale and pattern of data suppression
df_11_deleted.csv 4.6 MB Full records for reports deleted in the Nov 11 2022 drop Profile of what was removed: were deleted reports more severe? Different demographics or lots?
run_output.txt 2.4 KB Summary output of the purge analysis run Quick stats on the Nov 11 2022 purge event
PY_rename_it_vaers_purging_info.txt 17 KB Python source for the purge analysis Replicate the field-blanking detection methodology

Never-Published Reports (download/special/never_published/)

File Size Description What You Can Find
any_never_published.zip 100 KB All report IDs received by VAERS but never appearing in any public drop The suppressed-report ID list; 32,222 reports confirmed received but not released
any_ever_seen.zip 4.2 MB All IDs observed in any drop The full universe of observed VAERS IDs for comparison against the never-published list
per_drop_never_published.csv 2.7 KB Count of never-published reports per weekly drop How many reports were withheld per release cycle; trend over time
run_output_NEVER_PUBLISHED_analysis.txt 364 KB Detailed analysis log of the never-published detection methodology Full methodological audit; quantified breakdown by drop
PY_code_vaers_NEVER_PUBLISHED.txt 35 KB Python source code for the never-published analysis Reproduce or extend the never-published detection independently

Summary: Research Questions and Key Files

Research Question Key Files
Overall signal detection across all vaccines CSV_VAERS_FLATFILE.zip, symptoms_deduped.txt
Week-over-week data integrity / deletion tracking stats.csv, run_outputs_other/, 2023-10-27_deleted.csv
Geographic distribution of adverse events filled_states.csv, country_codes.csv
Lot-level safety signals hawk_lots_cleaned_2023-07-16.zip, hawk_FD0810.csv, hawk_EL9262_and_EL9264.xlsx, expiration_dates.csv
Myocarditis / cardiac outcomes myocard_thru_march_2022.xlsx, hawk_vaers_other_serious.xlsx
Reproductive / fetal outcomes 2023-10-27_fetus_all_likely.xlsx, foetal_cytopenia_etc.xlsx, XLSX_univaers_female_and_baby_reports.xslx.zip
Cancer signals 2023_10-27_cancer.xlsx
Reports withheld from public release any_never_published.zip, per_drop_never_published.csv
Scale and nature of data purges vaers_with_blankings_2022-11-04_to_2022-11-11.xlsx, df_11_deleted.csv
SV40 contamination concern sv40.csv, sv40.xlsx, SV40_FL0007.csv
Moderna FOIA-disclosed missing lots XLSX_Moderna_FOIA_Missing_Lots.zip
Sex/gender imputation for unknown records filled_genders.csv
Age-distribution analysis vaers_jonf_garyha_sample_32k_v1.xlsx
Endotoxin / contamination-type reactions hawk_endotoxins.csv
Comments

Leave a Comment


Approved Comments (0)

No comments yet.