Pipeline_project_COMP383

Problem 1: Retrieving Data

The links are from the SRA database under the data access tab

To retrieve Donor 1 (2dpi): "wget https://sra-pub-run-odp.s3.amazonaws.com/sra/SRR5660030/SRR5660030"

To retrieve Donor 1 (6dpi): "wget https://sra-pub-run-odp.s3.amazonaws.com/sra/SRR5660033/SRR5660033"

Then use the command fasterq-dump to convert the SRA files to paired-end fastq files

Donor 1 (2dpi) : 'fasterq-dump SRR5660030'

Donor 1 (6dpi): 'fasterq-dump SRR5660033'

To run the Script

clone the repo with

'git clone https://github.com/evumana/Pipeline_project_COMP383.git'

The repo contains 4 samples of test data and a python script.

Move all 4 samples of test data and the python script to your home.

Then run the command

"python3 pipeline_proj_final.py -i sampledata30_1.fastq sampledata30_2.fastq sampledata33_1.fastq sampledata33_2.fastq"

Dependencies

-Biopython

-BLAST+

-Bowtie 2

-SPAdes

-NCBI datasets

-SRA toolkit

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pipeline_project_COMP383

Problem 1: Retrieving Data

To run the Script

Dependencies

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
PipelineProject.log		PipelineProject.log
README.md		README.md
pipeline_proj_final.py		pipeline_proj_final.py
sampledata30_1.fastq		sampledata30_1.fastq
sampledata30_2.fastq		sampledata30_2.fastq
sampledata33_1.fastq		sampledata33_1.fastq
sampledata33_2.fastq		sampledata33_2.fastq

Folders and files

Latest commit

History

Repository files navigation

Pipeline_project_COMP383

Problem 1: Retrieving Data

To run the Script

Dependencies

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages