Skip to content

Commit c66e80c

Browse files
github-actions[bot]kelly-sovacool
authored andcommitted
ci: 🤖 format everything with pre-commit
1 parent 180af48 commit c66e80c

65 files changed

Lines changed: 1003 additions & 832 deletions

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

.test/README.md

Lines changed: 4 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,8 @@
11
## Test data
2+
23
2 mouse samples:
34

4-
* iCre_D0
5-
* D4_Meso_iCre_Dox
5+
- iCre_D0
6+
- D4_Meso_iCre_Dox
67

7-
Each sample has 2 replicates. Each replicate has been subsampled to only include reads aligning to chr19:10,000,000-20,000,000
8+
Each sample has 2 replicates. Each replicate has been subsampled to only include reads aligning to chr19:10,000,000-20,000,000

README.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
# ASPEN
22

3-
**A**tac **S**eq **P**ip**E**li**N**e :
3+
**A**tac **S**eq **P**ip**E**li**N**e :
44

55
[CCBR](https://bioinformatics.ccr.cancer.gov/ccbr/) recommends ASPEN to effectively analyze ATAC-seq datasets on the [BIOWULF](https://hpc.nih.gov) HPC system at the [NIH](https://www.nih.gov/).
66

@@ -12,6 +12,7 @@
1212
```bash
1313
module load ccbrpipeliner/7
1414
```
15+
1516
```bash
1617
aspen --help
1718
```

config/README.md

Lines changed: 5 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,6 @@
11
## Config files
2-
* `config.yaml`: sets all the runtime configurations for the pipeline like output folder, resources folder, genome, tool-parameters etc.
3-
* `samples.tsv`: tab delimited sample manifest. Defaults to the samples in the .test folder of the pipeline
4-
* `multiqc_atacseq_config.yaml`: custom-multiqc config file for report generation
5-
* `create_test_config_sample_manifest_files.bash`: replicates variables `PIPELINE_HOME` and `WORKDIR` etc. to create sample manifest for testing purposes
2+
3+
- `config.yaml`: sets all the runtime configurations for the pipeline like output folder, resources folder, genome, tool-parameters etc.
4+
- `samples.tsv`: tab delimited sample manifest. Defaults to the samples in the .test folder of the pipeline
5+
- `multiqc_atacseq_config.yaml`: custom-multiqc config file for report generation
6+
- `create_test_config_sample_manifest_files.bash`: replicates variables `PIPELINE_HOME` and `WORKDIR` etc. to create sample manifest for testing purposes

config/multiqc_atacseq_config.yaml

Lines changed: 111 additions & 111 deletions
Original file line numberDiff line numberDiff line change
@@ -1,115 +1,115 @@
11
custom_data:
2-
Nreads:
3-
section_name: 'Nreads'
4-
description: 'Number of reads per replicate. Mitochondrial fraction should be < 10%.'
5-
nrf_stats:
6-
file_format: 'tsv'
7-
section_name: 'NRF PBC Stats'
8-
description: '
9-
Non-redundant Fraction (NRF): Number of distinct uniquely mapping reads (i.e. after removing duplicates) / Total number of reads.
10-
PCR Bottlenecking Coefficient 1 (PBC1): (number of genomic locations where exactly one read maps uniquely) / (number of distinct genomic locations to which some read maps uniquely).
11-
PCR Bottlenecking Coefficient 2 (PBC2): (number of genomic locations where only one read maps uniquely) / (number of genomic locations where two reads map uniquely).
12-
The preferred values are as follows: NRF>0.9, PBC1>0.9, and PBC2>3. '
13-
plot_type: 'table'
14-
pconfig:
15-
id: 'NRF Stats'
16-
title: 'NRF Stats table'
17-
fld_files:
18-
file_format: 'tsv'
19-
section_name: 'Fragment Length Distribution'
20-
description: 'Per sample FLD. Peak < 150bp = Nucleosome Free Peak, Peak between 150-300 bp = Mononucleosome Peak, Peak > 300bp = Dinucleosome Peak'
21-
plot_type: 'linegraph'
22-
pconfig:
23-
id: 'FLD'
24-
title: 'Fragment Length Distribution'
25-
ylab: 'Normalized read density X 1e3'
26-
xlab: 'Fragment Length'
27-
xmax: 1000
28-
fld_stats_peaks:
29-
file_format: 'tsv'
30-
section_name: 'FLD Stats (Peaks)'
31-
description: 'Presence or Absense of nucleosome-free, mono and di-nucleosome peaks in FLD. A nucleosome free region (NFR) must be present. A mononucleosome peak must be present.'
32-
plot_type: 'table'
33-
pconfig:
34-
id: 'FLD Stats'
35-
title: 'FLD Stats table'
36-
fld_stats_details:
37-
file_format: 'tsv'
38-
section_name: 'FLD Stats (Fractions and Ratios)'
39-
description: 'Fractions and Ratios of interest'
40-
plot_type: 'table'
41-
pconfig:
42-
id: 'FLD Stats'
43-
title: 'FLD Stats table'
44-
MACS2_Peak_Annotations:
45-
section_name: 'MACS2 Peaks'
46-
description: 'Peaks called using MACS2. For human or mouse data, Npeaks should be >150,000, though values >100,000 may be acceptable. '
47-
Genrich_Peak_Annotations:
48-
section_name: 'Genrich Peaks'
49-
description: 'Peaks called using Genrich. For human or mouse data, Npeaks should be >150,000, though values >100,000 may be acceptable. '
50-
peak_width_files:
51-
file_format: 'tsv'
52-
section_name: 'Peak width distribution'
53-
description: 'Peak width distribution of consensus peaks.'
54-
plot_type: 'linegraph'
55-
pconfig:
56-
id: 'PWD'
57-
title: 'Peak Width Distribution'
58-
ylab: 'Peak Density Percentage'
59-
xlab: 'Peak Width'
60-
xmax: 20000
61-
frip_stats:
62-
file_format: 'tsv'
63-
section_name: 'FRiP Stats'
64-
description: 'The fraction of reads in called peak regions (FRiP score) should be >0.3, though values greater than 0.2 are acceptable. Fraction of Reads in Peaks/DHS/Enhancers/Promoters are also reported. For human or mouse data, FRiPromoters is 12-20%'
65-
plot_type: 'table'
66-
pconfig:
67-
id: 'FRiP Stats'
68-
title: 'FRiP Stats table'
69-
tss_files:
70-
file_format: 'tsv'
71-
section_name: 'TSS distribution'
72-
description: 'Greenleaf Normalized TSS per sample distribution'
73-
plot_type: 'linegraph'
74-
pconfig:
75-
id: 'TSS Enrichment'
76-
title: 'TSS Enrichment Distribution'
77-
ylab: 'Greenleaf Normalized TSS Enrichment'
78-
xlab: 'Distance from TSS'
79-
tss_knicking_sites_files:
80-
file_format: 'tsv'
81-
section_name: 'TSS Score Scatter'
82-
description: 'TSS score to TSS with >20 Tn5knicking sites scatter. '
83-
plot_type: 'scatter'
84-
pconfig:
85-
id: 'TSS_scatter'
86-
title: 'TSS_Score_Scatter'
87-
ylab: 'TSS score'
88-
xlab: 'Number of TSS sites with > 20 Tn5 knick sites'
2+
Nreads:
3+
section_name: "Nreads"
4+
description: "Number of reads per replicate. Mitochondrial fraction should be < 10%."
5+
nrf_stats:
6+
file_format: "tsv"
7+
section_name: "NRF PBC Stats"
8+
description: "
9+
Non-redundant Fraction (NRF): Number of distinct uniquely mapping reads (i.e. after removing duplicates) / Total number of reads.
10+
PCR Bottlenecking Coefficient 1 (PBC1): (number of genomic locations where exactly one read maps uniquely) / (number of distinct genomic locations to which some read maps uniquely).
11+
PCR Bottlenecking Coefficient 2 (PBC2): (number of genomic locations where only one read maps uniquely) / (number of genomic locations where two reads map uniquely).
12+
The preferred values are as follows: NRF>0.9, PBC1>0.9, and PBC2>3. "
13+
plot_type: "table"
14+
pconfig:
15+
id: "NRF Stats"
16+
title: "NRF Stats table"
17+
fld_files:
18+
file_format: "tsv"
19+
section_name: "Fragment Length Distribution"
20+
description: "Per sample FLD. Peak < 150bp = Nucleosome Free Peak, Peak between 150-300 bp = Mononucleosome Peak, Peak > 300bp = Dinucleosome Peak"
21+
plot_type: "linegraph"
22+
pconfig:
23+
id: "FLD"
24+
title: "Fragment Length Distribution"
25+
ylab: "Normalized read density X 1e3"
26+
xlab: "Fragment Length"
27+
xmax: 1000
28+
fld_stats_peaks:
29+
file_format: "tsv"
30+
section_name: "FLD Stats (Peaks)"
31+
description: "Presence or Absense of nucleosome-free, mono and di-nucleosome peaks in FLD. A nucleosome free region (NFR) must be present. A mononucleosome peak must be present."
32+
plot_type: "table"
33+
pconfig:
34+
id: "FLD Stats"
35+
title: "FLD Stats table"
36+
fld_stats_details:
37+
file_format: "tsv"
38+
section_name: "FLD Stats (Fractions and Ratios)"
39+
description: "Fractions and Ratios of interest"
40+
plot_type: "table"
41+
pconfig:
42+
id: "FLD Stats"
43+
title: "FLD Stats table"
44+
MACS2_Peak_Annotations:
45+
section_name: "MACS2 Peaks"
46+
description: "Peaks called using MACS2. For human or mouse data, Npeaks should be >150,000, though values >100,000 may be acceptable. "
47+
Genrich_Peak_Annotations:
48+
section_name: "Genrich Peaks"
49+
description: "Peaks called using Genrich. For human or mouse data, Npeaks should be >150,000, though values >100,000 may be acceptable. "
50+
peak_width_files:
51+
file_format: "tsv"
52+
section_name: "Peak width distribution"
53+
description: "Peak width distribution of consensus peaks."
54+
plot_type: "linegraph"
55+
pconfig:
56+
id: "PWD"
57+
title: "Peak Width Distribution"
58+
ylab: "Peak Density Percentage"
59+
xlab: "Peak Width"
60+
xmax: 20000
61+
frip_stats:
62+
file_format: "tsv"
63+
section_name: "FRiP Stats"
64+
description: "The fraction of reads in called peak regions (FRiP score) should be >0.3, though values greater than 0.2 are acceptable. Fraction of Reads in Peaks/DHS/Enhancers/Promoters are also reported. For human or mouse data, FRiPromoters is 12-20%"
65+
plot_type: "table"
66+
pconfig:
67+
id: "FRiP Stats"
68+
title: "FRiP Stats table"
69+
tss_files:
70+
file_format: "tsv"
71+
section_name: "TSS distribution"
72+
description: "Greenleaf Normalized TSS per sample distribution"
73+
plot_type: "linegraph"
74+
pconfig:
75+
id: "TSS Enrichment"
76+
title: "TSS Enrichment Distribution"
77+
ylab: "Greenleaf Normalized TSS Enrichment"
78+
xlab: "Distance from TSS"
79+
tss_knicking_sites_files:
80+
file_format: "tsv"
81+
section_name: "TSS Score Scatter"
82+
description: "TSS score to TSS with >20 Tn5knicking sites scatter. "
83+
plot_type: "scatter"
84+
pconfig:
85+
id: "TSS_scatter"
86+
title: "TSS_Score_Scatter"
87+
ylab: "TSS score"
88+
xlab: "Number of TSS sites with > 20 Tn5 knick sites"
8989

9090
sp:
91-
nrf_stats:
92-
fn: 'NRF_stats.tsv'
93-
frip_stats:
94-
fn: 'FRiP_stats.tsv'
95-
fld_stats_peaks:
96-
fn: 'FLD_stats_peaks.tsv'
97-
fld_stats_details:
98-
fn: 'FLD_stats_fractions_ratios.tsv'
99-
tss_knicking_sites_files:
100-
fn: 'data.tss_nicking_sites.txt'
101-
peak_width_files:
102-
fn: '*.annotated.peak_width_density'
103-
fld_files:
104-
fn: '*.fld.txt'
105-
tss_files:
106-
fn: '*.tss.txt'
91+
nrf_stats:
92+
fn: "NRF_stats.tsv"
93+
frip_stats:
94+
fn: "FRiP_stats.tsv"
95+
fld_stats_peaks:
96+
fn: "FLD_stats_peaks.tsv"
97+
fld_stats_details:
98+
fn: "FLD_stats_fractions_ratios.tsv"
99+
tss_knicking_sites_files:
100+
fn: "data.tss_nicking_sites.txt"
101+
peak_width_files:
102+
fn: "*.annotated.peak_width_density"
103+
fld_files:
104+
fn: "*.fld.txt"
105+
tss_files:
106+
fn: "*.tss.txt"
107107
fn_clean_exts:
108-
- '.genrich.narrowPeak.annotation_distribution'
109-
- '.consensus.bed.annotated.peak_width_density'
110-
- '.filt.bam'
111-
- '.preseq'
112-
- '.tss.txt'
113-
- '.fld.txt'
114-
- '.bowtie2.log'
115-
- '.fastq.gz'
108+
- ".genrich.narrowPeak.annotation_distribution"
109+
- ".consensus.bed.annotated.peak_width_density"
110+
- ".filt.bam"
111+
- ".preseq"
112+
- ".tss.txt"
113+
- ".fld.txt"
114+
- ".bowtie2.log"
115+
- ".fastq.gz"

docs/css/custom.css

Lines changed: 3 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -21,7 +21,7 @@ pre {
2121
/* Light blue background for notes */
2222
div.admonition.note {
2323
background-color: #e3f2fd;
24-
border-left: 5px solid #2196F3;
24+
border-left: 5px solid #2196f3;
2525
padding: 10px;
2626
margin: 10px 0;
2727
border-radius: 5px;
@@ -30,7 +30,7 @@ div.admonition.note {
3030
/* Light grey background for info */
3131
div.admonition.info {
3232
background-color: #f5f5f5;
33-
border-left: 5px solid #9E9E9E;
33+
border-left: 5px solid #9e9e9e;
3434
padding: 10px;
3535
margin: 10px 0;
3636
border-radius: 5px;
@@ -39,9 +39,8 @@ div.admonition.info {
3939
/* Orange warning box */
4040
div.admonition.warning {
4141
background-color: #fff3e0;
42-
border-left: 5px solid #FF9800;
42+
border-left: 5px solid #ff9800;
4343
padding: 10px;
4444
margin: 10px 0;
4545
border-radius: 5px;
4646
}
47-

0 commit comments

Comments
 (0)