|
1 | | -# GP2-Genome-Browser |
| 1 | +# The Global Parkinson’s Disease Genetics (GP2) Genome Browser |
| 2 | + |
| 3 | +`GP2 ❤️ Open Science 😍` |
| 4 | + |
| 5 | +[](https://opensource.org/licenses/MIT) |
| 6 | + |
| 7 | + |
| 8 | +**Last Updated:** December 2025 |
| 9 | + |
| 10 | +--- |
| 11 | + |
| 12 | +## Summary |
| 13 | + |
| 14 | +This repository accompanies the brief report **“The Global Parkinson’s Disease Genetics (GP2) Genome Browser.”** |
| 15 | + |
| 16 | +The GP2 Genome Browser is an open-access platform that provides gene- and variant-level information from one of the largest ancestry-diverse sequencing datasets assembled for Parkinson’s disease (PD) research. |
| 17 | + |
| 18 | +The browser integrates: |
| 19 | + |
| 20 | +- **31,665 whole-genome sequences (WGS)** |
| 21 | +- **9,559 clinical exomes (CES)** |
| 22 | +- **11 genetically determined ancestries** |
| 23 | + |
| 24 | +using a unified joint-calling, QC, and annotation pipeline (DeepVariant → GLnexus → GenoTools → VEP). It enables: |
| 25 | + |
| 26 | +- Gene-level pages with annotations, resources, and constraint metrics |
| 27 | +- Variant-level pages showing functional consequence, allele frequencies, ClinVar, dbSNP, and CADD |
| 28 | +- Frequency tables stratified by **ancestry** and **phenotype** (Case / Control / Other) |
| 29 | + |
| 30 | +The GP2 Genome Browser is openly available at: |
| 31 | +**https://gp2.broadinstitute.org** |
| 32 | + |
| 33 | +This repository contains the script used to produce the browser-ready Hail tables and frequency outputs used in the manuscript. |
| 34 | + |
| 35 | +--- |
| 36 | + |
| 37 | +## Citation |
| 38 | + |
| 39 | +If you use this repository or find it helpful, please cite: |
| 40 | + |
| 41 | +> **Fang Z-H., Grant R.H., Vitale D., Hernandez C.F., Hong S., Leonard H.L., Makarious M.B., Lange L.M., Solomonson M., Heutink P., Dilliott A., Ghosh Galvelis K., Nalls M.A., Singleton A.B., Blauwendraat C., and the Global Parkinson’s Genetics Program (GP2).** |
| 42 | +> *The Global Parkinson’s Disease Genetics (GP2) Genome Browser* (2025). |
| 43 | +> |
| 44 | +> **Manuscript DOI:** coming soon |
| 45 | +> **GitHub DOI:** xx |
| 46 | +
|
| 47 | +--- |
| 48 | + |
| 49 | +## Data Statement |
| 50 | + |
| 51 | +- **GP2 WGS Data** |
| 52 | + - GP2 Release 10 (WGS): DOI **10.5281/zenodo.15748014** |
| 53 | + - Accessible via AMP-PD: https://amp-pd.org |
| 54 | + |
| 55 | +- **GP2 CES Data** |
| 56 | + - GP2 Release 8 (PDGENEration CES): DOI **10.5281/zenodo.13755496** |
| 57 | + |
| 58 | +- **Additional Controls** |
| 59 | + - Alzheimer’s Disease Sequencing Project (ADSP) WGS: DOI **10.60859/z6z9-9692**, accessed via NIAGADS |
| 60 | + |
| 61 | +- **Variant Processing Pipeline** |
| 62 | + - Single-sample calling: **DeepVariant v1.6.1** |
| 63 | + - Joint-genotyping: **GLnexus v1.4.3** |
| 64 | + - Quality control: genotype-, sample-, and variant-level filters from AMP-PD & GP2 |
| 65 | + - Genetic ancestry assignment: **GenoTools v1.2.3** |
| 66 | + - Annotation: **Ensembl VEP v111** (adds ClinVar, CADD, dbSNP, functional consequence) |
| 67 | + - Intergenic variants excluded from browser display |
| 68 | + - Allele frequencies computed by **genetic ancestry** and **phenotype** |
| 69 | + |
| 70 | +--- |
| 71 | + |
| 72 | +## Helpful Links |
| 73 | + |
| 74 | +- **GP2 Website:** https://gp2.org |
| 75 | + - GP2 Cohort Dashboard: https://gp2.org/cohort-dashboard-advanced |
| 76 | +- **AMP-PD:** https://amp-pd.org |
| 77 | +- **PDGENEration:** https://www.parkinson.org/PDGENEration |
| 78 | +- **ADSP / NIAGADS:** https://www.niagads.org |
| 79 | +- **GP2 Overview:** https://movementdisorders.onlinelibrary.wiley.com/doi/10.1002/mds.28494 |
| 80 | +- **GP2 Manuscripts:** |
| 81 | + https://pubmed.ncbi.nlm.nih.gov/?term=%22global+parkinson%27s+genetics+program%22 |
| 82 | + |
| 83 | +--- |
| 84 | + |
| 85 | +# Repository Orientation |
| 86 | + |
| 87 | +``` |
| 88 | +├── LICENSE.txt |
| 89 | +├── README.md |
| 90 | +└── analyses/ |
| 91 | + └── gp2_genome_browser_hail_table.ipynb |
| 92 | +``` |
| 93 | + |
| 94 | + |
| 95 | +--- |
| 96 | + |
| 97 | +## Analysis Notebook |
| 98 | + |
| 99 | +**Languages:** Python (Hail), and bash |
| 100 | + |
| 101 | +**Description:** |
| 102 | +This notebook performs all steps needed to generate the tables consumed by the GP2 Genome Browser, including: |
| 103 | + |
| 104 | +- Importing GP2, AMP-PD, and ADSP Hail tables |
| 105 | +- Applying genotype-, sample-, and variant-level QC |
| 106 | +- Merging WGS and CES high-quality datasets |
| 107 | +- Annotating variants with VEP, ClinVar, dbSNP, and CADD |
| 108 | +- Assigning genetic ancestry groups (via precomputed metadata) |
| 109 | +- Calculating allele frequencies stratified by ancestry & phenotype |
| 110 | +- Exporting browser-ready tables for *gene* and *variant* pages |
| 111 | + |
| 112 | + |
| 113 | +--- |
| 114 | + |
| 115 | +# Software |
| 116 | + |
| 117 | +| Software | Version | URL | RRID | Notes | |
| 118 | +|---------|---------|-----|------|-------| |
| 119 | +| **Python** | 3.9 / 3.10 | http://python.org | RRID:SCR_008394 | Main analysis environment | |
| 120 | +| **R** | 4.2 | http://www.r-project.org | RRID:SCR_001905 | Used for summary figures | |
| 121 | +| **Hail** | 0.2.x | https://hail.is | — | Variant processing, QC, tables | |
| 122 | +| **DeepVariant** | 1.6.1 | https://github.com/google/deepvariant | — | Single-sample variant calling | |
| 123 | +| **GLnexus** | 1.4.3 | https://github.com/dnanexus-rnd/GLnexus | — | Joint-genotyping (DV-WGS / DV-WES presets) | |
| 124 | +| **GenoTools** | 1.2.3 | https://github.com/GP2code/GenoTools | — | Genetic ancestry assignment | |
| 125 | +| **VEP** | v111 | https://ensembl.org/info/docs/tools/vep/ | RRID:SCR_007931 | Variant consequence + annotation | |
| 126 | +| **KING** | 2.3.0 | https://kingrelatedness.com | RRID:SCR_009251 | Relatedness inference | |
| 127 | + |
| 128 | + |
| 129 | +--- |
| 130 | + |
| 131 | +# Data & Code Availability |
| 132 | + |
| 133 | +- GP2 and AMP-PD datasets are available through application via **AMP-PD** (https://amp-pd.org). |
| 134 | +- ADSP data are available via **NIAGADS** (NG00067; DOI: 10.60859/z6z9-9692). |
| 135 | +- Code in this repository reproduces browser-ready data tables and manuscript figures. |
| 136 | +- A permanent Zenodo archive of this repository will be linked via the DOI badge above. |
| 137 | + |
| 138 | +--- |
| 139 | + |
0 commit comments