Login to your profile!



No account? sign up!

github     (Recommendation Networks)

Download network data

This network dataset is in the category of Recommendation Networks



Visualize rec-github's link structure and discover valuable insights using the interactive network data visualization and analytics platform. Compare with hundreds of other network data sets across many different categories and domains.

Metadata

CategorySparse Networks
Tags
ShortGithub user-project authorship network
Vertex typeUser, project
Edge typeMembership
FormatBipartite
Edge weightsUnweighted
DescriptionMembership network of the GitHub software development site. The network is bipartite with user and project node types. Edges denote that a user is a member of a project.

Please cite the following if you use the data:

@inproceedings{nr,
     title={The Network Data Repository with Interactive Graph Analytics and Visualization},
     author={Ryan A. Rossi and Nesreen K. Ahmed},
     booktitle={AAAI},
     url={https://networkrepository.com},
     year={2015}
}

Note that if you transform/preprocess the data, please consider sharing the data by uploading it along with the details on the transformation and reference to any published materials using it.

@misc{chaconGithubAuthorship,
     author = {Scott Chacon},
     title = {The 2009 GitHub Contest},
     month = {July},
     year = {2009}
}

Network Data Statistics

Nodes121.7K
Edges440.2K
Density5.94326e-05
Maximum degree3.7K
Minimum degree1
Average degree7
Assortativity-0.0335303
Number of triangles986.2K
Average number of triangles8
Maximum number of triangles21.8K
Average clustering coefficient0.00618751
Fraction of closed triangles0.0157259
Maximum k-core48
Lower bound of Maximum Clique8

Network Data Preview

Interactive visualization of rec-github's graph structure

Interactively explore the networks graph structure!

  • Use mouse wheel to zoom in/out
  • Mouseover nodes to see their degree
  • Drag network to see more details

Loading...

Interactive Visualization of Node-level Properties and Statistics

Tools for Interactive Exploration of Node-level Statistics

Visualize and interactively explore rec-github and its important node-level statistics!

  • Each point represents a node (vertex) in the graph.
  • A subset of interesting nodes may be selected and their properties may be visualized across all node-level statistics. To select a subset of nodes, hold down the left mouse button while dragging the mouse in any direction until the nodes of interest are highlighted.This feature allows users to explore and analyze various subsets of nodes and their important interesting statistics and properties to gain insights into the graph data
  • Zoom in/out on the visualization you created at any point by using the buttons below on the left.
  • Once a subset of interesting nodes are selected, the user may further analyze by selecting and drilling down on any of the interesting properties using the left menu below.
  • We also have tools for interactively visualizing, comparing, and exploring the graph-level properties and statistics.
Note: You are not logged in!
Please login or join the community to leverage the many other tools and features available in our interactive graph analytics platform.

Interactive Visualization of Node-level Feature Distributions

Node-level Feature Distributions

degree distribution

Loading...

degree CDF

Loading...

degree CCDF

Loading...

kcore distribution

Loading...

kcore CDF

Loading...

kcore CCDF

Loading...

triangle distribution

Loading...

triangle CDF

Loading...

triangle CCDF

Loading...

All visualizations and analytics are interactive and flexible for exploratory analysis and data mining in real-time and include the following features:

  • Degree, k-core, triangles, and triangle-core distributions. We include plots for each of the fundamental graph features and counts of the number with a particular property (i.e., number of nodes that form k triangles or have degree k, etc.)
  • We also include the CDF and CCDF distributions for each graph in the collection.
  • All visualizations and plots are zoomable. One may zoom-in or out on the data visualization using scrolling.
  • Panning. Users may also click anywhere on the plot and move the mouse in any direction to pan.
  • Adjust scale and other application dependent-parameters. All interactive visualizations may adjust the scale which is particularly important in certain types of graph data that contain highly skewed graph properties (power-lawed graphs and/or networks) such as degree distribution.