Skip to content

Latest commit

 

History

History
15 lines (11 loc) · 687 Bytes

File metadata and controls

15 lines (11 loc) · 687 Bytes

BigData-architecture

pokemon:

  • Data analysis on a simple dataset representing statistics about Pokemons

wordcount:

  • Map-Reduce task that counts words in a text
  • MapReduce job that associates each word with the most frequent word that follows it
  • MapReduce jobs counting the number of unique words in a book

sp500-analysis: Data preprocessing and analysis using Spark:

  • Describe the data: What does the data contain? What are the values of the classical metrics (min, max, average, standard deviation, ...)?
  • Insights: Year in which the stock market varied the most, Company with highest VolumeExchanged/SharesOutstanding for each year...