wikiGraph

What is the shortest path between coffee and astronomy? wikiGraph allows you to find and explore paths between any two pages on Wikipedia.

Data are stored in a graph database (Neo4j) and paths are displayed with a force-directed layout as a scalable vector graphic (D3.js). The challenge was scaling the project from a small subgraph (100k pages) to the full dataset (11m pages) while keeping the query responsive (1 second average).

I completed this project in five weeks as a final project at Hackbright Academy, Fall 2014. Cleaning and parsing were done in Python, and I wrote the front-end in JavaScript.

Project links: Github