Skip to content
akollegger edited this page Oct 16, 2012 · 7 revisions

Whiteboard-friendly approach

Importing data into Neo4j is very whiteboard friendly -- generally what you draw is what you store. Luckily the Github data is already very graph-like, so modeling is very straightforward.

  1. Understand the available data
  2. Whiteboard what you'd like to import
  3. Generally, import "nodes" then "relationships"

Modify a reference project

  1. Pick a reference project
  2. Read through the code
    • notice the strategy for scanning Github
    • identify the conversion from Github data to Nodes + Relationships
  3. Fork and modify

DIY From Scratch

  1. pick an Github API Library in your language of choice
  2. create a project repository
  3. scope your ambition
  4. import strategies
    1. two-phase import - import all nodes, then scan again and create relationships
    2. accounting - import nodes, keeping track of them and creating relationships when possible

References

Clone this wiki locally