All notable changes to this project will be documented in this file.
The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.
- Add ability to downsample total reads to match in group normalization
- Add conversion from bigwig to bedgraph and wig formats
- Ability to calculate queries at lower resolutions with bwtools
- Ability to downsample aligned data based on group minimum
- Gini coefficient calculations for NETseq data
- Pseudoalignment with Kallisto for stranded data
- Modeling with Sleuth
- Seperation of feature counting from modeling
- Stranded querys with sense and antisense capabilities
bwtools_query_stranded - Stranded spike-in normalization using variety of methods including Bonhoure et al. method and DEseq2 method
- Deduplication with Picard MarkDuplicates.
- Ability to set specific locations to zero when doing NETseq pause calling
- Give background corrected logos in NETseq
- Specify whether you want sequences pulled for NET-seq pauses or not
- Ability to plot large logos for NET-seq
- More pause calling types for NET-seq
- Ability to skip mark duplicates for bams
- Update versions of some packages including ncbi-acc-download to get around rate filtering issue and multiqc to deal with python updates
- Enable the ability to choose which QC you want to run
- Allow fasta files to complement full IUPAC nomenclature i.e. R complements Y.
- Issue with syntax of comparisons in
coverage_and_norm.smk - Issue with using
not input_sample.isnull()in higher version of snakemake - Issue with
bwtools_multiprocessingnot correctly identifying needed input files - Fixed issues with sample sheets that do not have a input_sample or only have single end samples
- Fixed bug in NETseq pause calling that would only report last chromosome
- Added a log2 strand bias calculation as a possible coverage track
- Calculation of the count of fragments for each contig
- Allow two separate cutadapt runs to occur for sequencing kits that require sequential trimming
- Added a rule to generate annotation files easily with
run_annotations - Added the ability to specify relative coordinates in
bwtools_query - Unstranded Peak calling and motif calling
- Modeling module with DESeq2 and stranded feature quant with HTseq-count
- Ability to better set parameters for ChIP QC in config file
- Refactored
NETseq_pause_calling.pyfor performance boost
- Fixed issue with
workflow/rules/coverage_and_norm.smkextending reads for non-paired end samples - Fixed issue with
multiqcreport not properly reporting pre and post trim fastqc reports - Fixed issue with bamPEFragmentSize would attempt to run on single end data
- Fixed issue with trimmomatic not properly running for single end data
- Fixed issue with spearman correlation calculations failing on updated coverage files from bwtools
- Fixed issue with NETseq pause calling failing when NaNs were present in data by coercing all NaNs to zeros
- Fixed issue where pulling sequences from bed files near genome boundaries would fail. Updated to consider all chromosomes as circular.
- Fixed an issue where NETseq logos would display information content on a non-standard scale. Fixed scaling to be 0 to 2.0 bits
- Fixed an issue where numpy deprecations impact deeptools quality control. Set a lower version of numpy in environment files for quality control
- Add ability to get combined annotation table and bed for genome annotations
- Fixed bug preventing stranded coverage under some normalization schemes
- Added ability to process coverage in a stranded manner
- Added support for NET-seq data
- Added limited support for bisulfite data
- Ability to collapse multiple bigwig files into one using summary stats
- Ability to perform operations between two groups of bigwig files
- Added support for single end libraries
- Added relative polymerase progression calculation to
bwtools_query(http://dx.doi.org/10.1016/j.molcel.2014.02.005) - Add the ability to smooth raw signals using convolution with a Gaussian or flat kernel.
- Add the ability to smooth raw signals using a Savitzky-Golay filter
- Ability to calculate the region-level spearman correlations for a set of samples in
bwtools_query - Add calculations of the traveling ratio (10.1016/j.molcel.2008.12.021)
- Add discovery of a local max within a region
- Output of
bwtools_queryis now compressed (gzip). File extension changed to.tsv.gz
- Allow mix and match between
querysub,queryscalefixedsub, andfixedscale.
- Create a compiled html table of all mutations called across all samples in
variant_calling
- Fixed a bug in renaming
sample/output/output.gdfiles torenamed_output/sample.gdinvariant_calling
- A
variant_callingmodule to call changes from the reference genome with ChIP input samples (#2) - Add option to control what fraction of a region can have NaNs in it and still report a summary value in
bwtools query