An analysis and visualization of collaboration between top GitHub repositories, focused on the relationship between programming languages used and the network structure.
Interactive visualization:
More information and the full analysis: PDF report
- This is my final project for Social Network Analysis, and entry in the GitHub Data Challenge.
- Created using igraph, Gephi and sigma.js.
- Thanks to @frewsxcv and @ThePicard for their help.
- From Google BigQuery:
- repo-attributes.sql creates repo-attributes.csv
- repo-weights.sql creates repo-weights.csv
- The process.py script reads both .csv files and creates repositories.gml
- Gephi loads repositories.gml and creates:
- repositories*.pdf to be included in the report
- repositories.json for sigma.js
- Python code within analysis-*.texw reads repositories.gml and produces output for the report