Releases: LucoDevro/CAGEcleaner
Releases · LucoDevro/CAGEcleaner
v1.5.0
Added features:
- New dereplication mode: dereplicate by genomic neighbourhood
- Support for non-cblaster inputs from TSVs via a session construction helper tool
Improved features:
- Made hit recovery strategies synteny-aware by checking cluster layout instead of cluster contents
- Improved logging, progress bars and error catching
Tool architecture:
- Shifted to multiple class inheritance structure to define logic common to specific workflows
- Converted and integrated bash helper scripts to Python code
Miscellaneous:
- Updated examples
- Moved docs from Wiki to ReadTheDocs
What's Changed
- Synteny-aware hit recovery by @LucoDevro in #4
- Alternative input mode from TSVs by @LucoDevro in #6
- Upgrade to v1.5 by @LucoDevro in #7
Full Changelog: v1.4.5...v1.5.0
v1.4.5
v1.4.4
v1.4.3
v1.4.2
v1.4.1
v1.4.0 [yanked]
Complete remake in object-oriented fashion. Separate child objects for local and remote dereplication runs.
Altered features:
- Merging the dereplication status table and the assembly-scaffold pairs table into one output table (extended binary table)
- Improved verbosity
v1.3.1
v1.3.0
New features:
- Added support for cblaster local and HMM mode (for both Genbank and FASTA + GFF input genome databases).
- Expanded scaffold to assembly mapping to non-gzipped fasta files
- Added option to bypass scaffolds or assemblies to retain in the output in any case
Altered features:
- Added support for more precise ID exclusion and bypassing in case of duplicate IDs by using an assembly:scaffold ID formatting internally
- Minimum ANI threshold of 82% is now enforced: CAGEcleaner exits with a warning if a user passes a lower value.