GitHub - seanvig/rnaseq_count: Snakemake pipeline for RNAseq feature counting

RNAseq mapping and counting pipeline

This workflow performs fastq quality trimming, quality checks, alignment and counting using STAR.

The workflow is built using snakemake, and is organized according to recommended best practices from snakemake as described https://snakemake.readthedocs.io/en/stable/snakefiles/deployment.html. For a tutorial on how to use snakemake, see here. The key files are workflow/Snakefile and config/config.yaml included in this repo. All programs, with the exception of R, are run in conda environemnts specified in the workflow. These conda environments can be reused, or optionally spun up fresh.

The workflow assumes that you have a resources/fastq directory containing the fastq files. It also assumes that you have a reference genome and gtf file located within the resources/genome directory. Included in the workflow is an indexing step that may or may not be necessary depending on where you point to for your reference genome.

The star workflow now includes a fastq quality control step that uses trim galore for fastq quality checking and trimming.

Following alignment and counting, the data is ready for analysis using DESeq2, edgeR, limma, or any other tool of choice.

Setup Environment

Conda

Snakemake recommends using mamba over conda. Mamba is a C++ implementation of conda, and thus will generally run faster. If you have not installed either, we recommend installing Mambaforge.

If you already have conda, mamba is included in the snakemake environment setup below.

Snakemake

To build a snakemake conda environment you can use this command:

conda env create -f workflow/envs/snakemake.yml
conda config --set channel_priority strict

Workflow

This repo contains Rmarkdown documents describing how to execute the workflow.

workflow/notebooks/workflow.md describes how to run the whole workflow

Version control

Best practice is to fork this repo for each project that you want to use the pipeline on.

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
config		config
profiles/default		profiles/default
workflow		workflow
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
rnaseq_count.Rproj		rnaseq_count.Rproj

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RNAseq mapping and counting pipeline

Setup Environment

Conda

Snakemake

Workflow

Version control

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

RNAseq mapping and counting pipeline

Setup Environment

Conda

Snakemake

Workflow

Version control

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages