Skip to content

Commit add62ba

Browse files
Update README.md
Co-Authored-By: Zih-Hua Fang <7847344+zihhuafang@users.noreply.github.com>
1 parent 62ba4e9 commit add62ba

1 file changed

Lines changed: 139 additions & 1 deletion

File tree

README.md

Lines changed: 139 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1 +1,139 @@
1-
# GP2-Genome-Browser
1+
# The Global Parkinson’s Disease Genetics (GP2) Genome Browser
2+
3+
`GP2 ❤️ Open Science 😍`
4+
5+
[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
6+
7+
8+
**Last Updated:** December 2025
9+
10+
---
11+
12+
## Summary
13+
14+
This repository accompanies the brief report **“The Global Parkinson’s Disease Genetics (GP2) Genome Browser.”**
15+
16+
The GP2 Genome Browser is an open-access platform that provides gene- and variant-level information from one of the largest ancestry-diverse sequencing datasets assembled for Parkinson’s disease (PD) research.
17+
18+
The browser integrates:
19+
20+
- **31,665 whole-genome sequences (WGS)**
21+
- **9,559 clinical exomes (CES)**
22+
- **11 genetically determined ancestries**
23+
24+
using a unified joint-calling, QC, and annotation pipeline (DeepVariant → GLnexus → GenoTools → VEP). It enables:
25+
26+
- Gene-level pages with annotations, resources, and constraint metrics
27+
- Variant-level pages showing functional consequence, allele frequencies, ClinVar, dbSNP, and CADD
28+
- Frequency tables stratified by **ancestry** and **phenotype** (Case / Control / Other)
29+
30+
The GP2 Genome Browser is openly available at:
31+
**https://gp2.broadinstitute.org**
32+
33+
This repository contains the script used to produce the browser-ready Hail tables and frequency outputs used in the manuscript.
34+
35+
---
36+
37+
## Citation
38+
39+
If you use this repository or find it helpful, please cite:
40+
41+
> **Fang Z-H., Grant R.H., Vitale D., Hernandez C.F., Hong S., Leonard H.L., Makarious M.B., Lange L.M., Solomonson M., Heutink P., Dilliott A., Ghosh Galvelis K., Nalls M.A., Singleton A.B., Blauwendraat C., and the Global Parkinson’s Genetics Program (GP2).**
42+
> *The Global Parkinson’s Disease Genetics (GP2) Genome Browser* (2025).
43+
>
44+
> **Manuscript DOI:** coming soon
45+
> **GitHub DOI:** xx
46+
47+
---
48+
49+
## Data Statement
50+
51+
- **GP2 WGS Data**
52+
- GP2 Release 10 (WGS): DOI **10.5281/zenodo.15748014**
53+
- Accessible via AMP-PD: https://amp-pd.org
54+
55+
- **GP2 CES Data**
56+
- GP2 Release 8 (PDGENEration CES): DOI **10.5281/zenodo.13755496**
57+
58+
- **Additional Controls**
59+
- Alzheimer’s Disease Sequencing Project (ADSP) WGS: DOI **10.60859/z6z9-9692**, accessed via NIAGADS
60+
61+
- **Variant Processing Pipeline**
62+
- Single-sample calling: **DeepVariant v1.6.1**
63+
- Joint-genotyping: **GLnexus v1.4.3**
64+
- Quality control: genotype-, sample-, and variant-level filters from AMP-PD & GP2
65+
- Genetic ancestry assignment: **GenoTools v1.2.3**
66+
- Annotation: **Ensembl VEP v111** (adds ClinVar, CADD, dbSNP, functional consequence)
67+
- Intergenic variants excluded from browser display
68+
- Allele frequencies computed by **genetic ancestry** and **phenotype**
69+
70+
---
71+
72+
## Helpful Links
73+
74+
- **GP2 Website:** https://gp2.org
75+
- GP2 Cohort Dashboard: https://gp2.org/cohort-dashboard-advanced
76+
- **AMP-PD:** https://amp-pd.org
77+
- **PDGENEration:** https://www.parkinson.org/PDGENEration
78+
- **ADSP / NIAGADS:** https://www.niagads.org
79+
- **GP2 Overview:** https://movementdisorders.onlinelibrary.wiley.com/doi/10.1002/mds.28494
80+
- **GP2 Manuscripts:**
81+
https://pubmed.ncbi.nlm.nih.gov/?term=%22global+parkinson%27s+genetics+program%22
82+
83+
---
84+
85+
# Repository Orientation
86+
87+
```
88+
├── LICENSE.txt
89+
├── README.md
90+
└── analyses/
91+
└── gp2_genome_browser_hail_table.ipynb
92+
```
93+
94+
95+
---
96+
97+
## Analysis Notebook
98+
99+
**Languages:** Python (Hail), and bash
100+
101+
**Description:**
102+
This notebook performs all steps needed to generate the tables consumed by the GP2 Genome Browser, including:
103+
104+
- Importing GP2, AMP-PD, and ADSP Hail tables
105+
- Applying genotype-, sample-, and variant-level QC
106+
- Merging WGS and CES high-quality datasets
107+
- Annotating variants with VEP, ClinVar, dbSNP, and CADD
108+
- Assigning genetic ancestry groups (via precomputed metadata)
109+
- Calculating allele frequencies stratified by ancestry & phenotype
110+
- Exporting browser-ready tables for *gene* and *variant* pages
111+
112+
113+
---
114+
115+
# Software
116+
117+
| Software | Version | URL | RRID | Notes |
118+
|---------|---------|-----|------|-------|
119+
| **Python** | 3.9 / 3.10 | http://python.org | RRID:SCR_008394 | Main analysis environment |
120+
| **R** | 4.2 | http://www.r-project.org | RRID:SCR_001905 | Used for summary figures |
121+
| **Hail** | 0.2.x | https://hail.is || Variant processing, QC, tables |
122+
| **DeepVariant** | 1.6.1 | https://github.com/google/deepvariant || Single-sample variant calling |
123+
| **GLnexus** | 1.4.3 | https://github.com/dnanexus-rnd/GLnexus || Joint-genotyping (DV-WGS / DV-WES presets) |
124+
| **GenoTools** | 1.2.3 | https://github.com/GP2code/GenoTools || Genetic ancestry assignment |
125+
| **VEP** | v111 | https://ensembl.org/info/docs/tools/vep/ | RRID:SCR_007931 | Variant consequence + annotation |
126+
| **KING** | 2.3.0 | https://kingrelatedness.com | RRID:SCR_009251 | Relatedness inference |
127+
128+
129+
---
130+
131+
# Data & Code Availability
132+
133+
- GP2 and AMP-PD datasets are available through application via **AMP-PD** (https://amp-pd.org).
134+
- ADSP data are available via **NIAGADS** (NG00067; DOI: 10.60859/z6z9-9692).
135+
- Code in this repository reproduces browser-ready data tables and manuscript figures.
136+
- A permanent Zenodo archive of this repository will be linked via the DOI badge above.
137+
138+
---
139+

0 commit comments

Comments
 (0)