Skip to content

vinisalazar/sparkmer-count

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

40 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

# Install dependencies
$ pip install pyspark psutil

# Set up docker env
$ docker-compose up -d

# Run application
$ python sparkmer-count/sparkmer-count.py -i data/synechococcus -o synechococcus.csv -k 2

usage: sparkmer-count.py [-h] -i INPUT [-o OUTPUT] [-k KMER]

Script to run kmer count with Spark.

optional arguments:
  -h, --help            show this help message and exit
  -i INPUT, --input INPUT
                        Input directory containing genome contigs file.
  -o OUTPUT, --output OUTPUT
                        Output file with kmer counts. Default is
                        kmer_count_out.txt
  -k KMER, --kmer KMER  Size of kmer. Default is 3

About

k-mer counting with Spark ✨

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors