NetSet

WikiLinks

Info
scikit-network name wikilinks
Description Partial graph of Wikipedia (2013 dump). The adjacency matrix represents the links between articles. The biadjacency matrix represents the links between articles (rows) and words (columns) contained in their summaries (lemmatization by Spacy https://spacy.io with model "en_core_web_lg").
Creation date January 2023
Download tar.gz (885.1 MB) | zip (887.1 MB)
Adjacency
Nodes 3,210,346 articles
Edges 67,196,296 links
Type Directed
Biadjacency
Nodes 3,210,346 articles + 913,054 words
Edges 138,856,393 counts
Attributes
Attribute Sample Type
Adjacency True bool_
Biadjacency 3 int64
Names Albert Einstein string
Column names tree string