It's actually super straightforward, I think it'd be possible to look at several high profile repos, scrape the most used, and then look at their bios
-
Might be better to use GithubArchive
-
Command to Get the Data for All of January 2015:
`wget http://data.githubarchive.org/2015-01-{01..30}-{0..23}.json.gz`
-
-
http://githut.info/: Cool Website that has nice visualization of popularity of github projects
-
Visualization of Github Connections: http://graphub.yodas.com/ Github connections as a D3 graph. Find your GH Erdős number from other GH users by co-contributions
-
Better Explorer of Popular Github Projects: http://gitmostwanted.com/
-
Getting the JSON File as Table:
df <- data.frame(matrix(unlist(users), nrow=330, byrow=T),stringsAsFactors=FALSE)