Skip to content

ramuramaiah/Spark-Odyssey

Repository files navigation

Spark-Odyssey

A journey in to the world of Machine Learning algorithms using Apache Spark.

Prerequisites

Build and Demo process

Clone the Repo

git clone https://github.com/ramuramaiah/spark-odyssey.git

Build

./gradlew clean jar

What the demo does?

The commands to run the Spark jobs are available in the batch files For e.g. To run the collocation algorithm, the colloc.bat has the following entry

colloc.bat

%SPARK_HOME%/bin/spark-submit ^
    --class spark.odyssey.colloc.Driver ^
    --jars file:///C:/mahout/lib/mahout-math-0.13.0.jar ^
    --master local ^
    --deploy-mode client ^
    --driver-memory 4g ^
    --executor-memory 2g ^
    --executor-cores 1 ^
    --queue colloc ^
    build/libs/spark-odyssey.jar ^
    --algo g_2 ^
	-s 1 ^
    "./build/resources/main/input_events.csv" ^
    "./output"

Libraries Included

  • Spark - 2.1.0

Useful Links

Issues or Suggestions

About

A journey in to the world of Machine Learning algorithms using Apache Spark.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors