Skip to content
Ashrith Mekala edited this page Feb 11, 2014 · 1 revision

Mocks data generated by online social gaming entertainment provider

DataSet Format: (Single table format)

column_header type description
cid uuid customer id
cname string name of the user
email string email address of the user
gender string customer's gender
age int age of the customer
address string user provided address during registration, stores only US based addresses other countries gets 'N/A'
country string country to which customer belongs to
register_date long date on which user registered with us in milliseconds
friend_count int number of friends a user has
lifetime int number of days a user has been active since registration date
citygame_played int number of times citygame has been played by user
pictionarygame_played int number of times pictionary game has been played by user
scramblegame_played int number of times scaramble game has been played by user
snipergame_played int number of times sniper game has been played by user
revenue int revenue generated by the user
paid_subscriber string whether the customer is paid customer or not, represented by yes or no

Multi Table Format

Customer table (customer specific records)

column_header type desc
cid uuid (string) customer id
cname string customer name
gender string customer gender
age int age of the customer
register_date long date on which user registered with us in milliseconds
country string country to which customer belongs to
friend count int number of friends a user has
total days int number of days a user has been active since registration date

Revenue table (entry made only if the user has generated some revenue to the site)

column_header type desc
cid uuid (string) customer id
date payment paid long date on which user made payment in milliseconds
revenue int payment amount

Fact table (contains facts on when a user played a specified game)

column_header type desc
cid uuid (string) customer id
game played string name of the game user played
played date long time at which the user initiated playing a game

If data format is in avro, the avro schemas are available in src/main/resources/avro_schemas

Clone this wiki locally