Repository files navigation CS455 - Introduction to Distributed Systems
Authors
Laksheen Mendis
Menuka Warushavithana
Philip Kirner
Compile and package into a JAR file
Running on a Cluster
Use the scripts in scripts directory
HighestHourByCounty.scala
Find the hour of day when the highest number of traffic violations were recorded in each county in New York City
HighestMonthByCounty.scala
Find the month of year when the highest number of traffic violations were recorded in each county in New York City
ViolationTypeByYear.scala
Find the highest number of recorded violations for each month in each county in New York City
VehicleMakeByCounty.scala
Find the top 5 vehicle makes associated with traffic violations in each county
ViolationTypeByYear.scala
Find the most recorded types of traffic violations occurred in New York City for each year in the dataset.
census-data/census_county_level_processor.py
Extract the census data for counties in New York City from U.S. Census Data
About
No description or website provided.
Topics
Resources
Stars
Watchers
Forks
You can’t perform that action at this time.