-
Notifications
You must be signed in to change notification settings - Fork 1
Data Import
akollegger edited this page Oct 16, 2012
·
7 revisions
Importing data into Neo4j is very whiteboard friendly -- generally what you draw is what you store. Luckily the Github data is already very graph-like, so modeling is very straightforward.
- Understand the available data
- Whiteboard what you'd like to import
- Generally, import "nodes" then "relationships"
- Pick a reference project
- Read through the code
- notice the strategy for scanning Github
- identify the conversion from Github data to Nodes + Relationships
- Fork and modify
- fork the project
- add more data from Github
- add extra in-graph structures
- add a webapp to proxy Graph Queries
- add a webapp with Data Visualization
- pick an Github API Library in your language of choice
- create a project repository
- scope your ambition
- import data, query with
neo4j-shell - import data, provide web app to proxy Graph Queries
- import data, provide web app with Data Visualization
- import data, query with
- import strategies
- two-phase import - import all nodes, then scan again and create relationships
- accounting - import nodes, keeping track of them and creating relationships when possible
- Github API Libraries - all the usual languages