Releases: WatsonLab/KOunt
Releases · WatsonLab/KOunt
v1.2.0
This release of KOunt contains the following new features/changes:
- Removal of Barrnap and tRNAscan-SE from the pipeline, all data that would've been annotated by them is run through the rest of the pipeline to save time, flowchart amended to reflect this
- Inclusion of the new quality filtering step, evenness of protein coverage. All of the coverage and kofamscan_results rules have been updated to give users the option of using/no using this
- Replacement of the bedtools_samtools_KOunt.yaml with coverage_KOunt.yaml
- Fixing bug in kallisto KOunt calculation
- Amending bed file creation in the nohit_fastq rules
- Added commands to remove large unwanted output files
- Changed the coverage rules to convert bam files to bed files which reduces memory needed
- Updated the template config.yaml
- Added a file with the average memory and time used for the samples in our manuscript
v1.1.0
This release of KOunt contains updated conda environmental yamls to allow installation with mamba; this is recommended as it is much faster. Links to subsampled rumen metagenome reads (ERR2027889) are also included to enable quick testing of whether installation has been successful. The expected location of the input to KOunt has been changed, now it expects each sample to be in a folder with the sample name; it will now use all samples within the raw_reads directory specified in config.yaml.
The following bug fixes have also been made:
- Edited the recommended snakemake command to include the number of cores
- Removed
profiles.tar.gzonce gunzipped - Removed the
all_kos_foundfile once complete - Removed
v0.24.tar.gzonce gunzipped - Updated
rule kofamscanto have the output ofrule kegg_dbas input - Added commands to shorten the trimmed fastq read ids and add
/1or/2to the end of read ids that don't already end with that - Added
fastq_utilstorule trimto check read names are unique - user has the option to check this or not - Updated
envs/fastp_utils_KOunt.yamlto include the softwarefastq_utils - Added option to
config.yamlto check read names are unique - Amended
rule kallisto_unmappedto usemkdir -pin case a barrnap directory hadn't been made - Amended
scripts/rrna_trna_cov.shto avoid segmentation fault if no RNAs are found - Fixed errors in
rule rna_abundance_noclusteringnot finding output files - To make
rule kofamscan_results_no_clusteringindependant ofrule kegg_fastasthe following new rules were created:barrnap_noclusteringdiamond_search_noclusteringkallisto_noclusteringnohit_annotate_reads_noclusteringnohit_fastas_noclusteringnohit_fastq_noclusteringrna_abundance_noclusteringtrnascan_noclustering
- Replaced
params.misswithoutput.missinrule rna_abundanceandrule rna_abundance_no_clustering - Removed
-cfrom the kallisto commands - Amended
rule kallisto,rule kallisto_unmappedandrule kallisto_noclusteringto filter out KOs with zero abundance - Added
LC_ALL=Cbefore the sort commands inrule coverageto prevent abedtoolserror with the input not being sorted lexicographically - Amended
scripts/rrna_trna_cov.shto remove unneeded output files - Amended
scripts/unmapped_reads_ko_100.shto remove unneeded output files - Added commands to delete raw
kallistooutput and other unneeded output files - Changed output of files going into
mmseqstoMMSeq_Outputs - Removed the
nohit fileoutput from therule kofamscan_results_noref