id
type
5 (blog/news article)
status
21 (imported old-v2, waiting for another import)
review version
0
cleanup version
0
pending deletion
0 (-)
created at
2025-11-18 12:11:26
updated at
2025-11-18 12:11:28
url
https://100daysofnetworks.substack.com/p/day-3-of-100daysofnetworks
url length
67
url crc
14024
url crc32
3304142536
location type
1 (url matches target location, page_location is empty)
canonical status
10 (verified canonical url)
canonical page id
domain id
domain tld
2211
domain parts
0
originating warc id
-
originating url
https://data.commoncrawl.org/crawl-data/CC-MAIN-2025-33/segments/1754151279957.1/warc/CC-MAIN-20250807120334-20250807150334-00914.warc.gz
source type
11 (CommonCrawl)
server ip
Publication date
2025-08-07 12:34:34
Fetch attempts
0
Original html size
257847
Normalized and saved size
113219
title
excerpt
content
Welcome to day 3 of #100days of networks. If you would like to learn more about networks and network analysis, please buy a copy of my book!Today, we are going to talk about CENTRALITIES. Network Centralities are a useful tool to quickly identify interesting nodes (people, things, etc) from any network. Once you have built a graph, you should use centralities to get a lay of the land, to "learn the main characters", so to say.In today's exercise, we will use the Les Miserables graph from NetworkX, to keep things simple.You can use my Github code to follow along.Here is a bit about centralities:Degree Centrality: Importance based on the number of degrees (edges)Betweenness Centrality: Importance based on whether a node sits between other nodes; Information flows through them. Can also be gatekeepers. They have power.Closeness Centrality: Importance based on a nodes closeness to other nodes. Has to do with number of steps away.PageRank: Importance based on number of inbound and outbound ...
author
David Knickerbocker
updated
2025-11-28 08:31:59
block type
0
extracted fields
36
extracted bits
article author
full content
detected location
0
detected language
1 (English)
category id
-
index version
1
paywall score
0
spam phrases
0
text nonlatin
0
text cyrillic
0
text characters
11174
text words
2364
text unique words
579
text lines
149
text sentences
173
text paragraphs
42
text words per sentence
13
text matched phrases
0
text matched dictionaries
0
links self subdomains
0
links other subdomains
0
links other domains
9 - a.co, networkx.org
links spam adult
0
links spam random
0
links spam expired
0
links ext activities
0
links ext ecommerce
0
links ext finance
0
links ext crypto
0
links ext booking
0
links ext news
0
links ext leaks
0
links ext ugc
10 - substack.com
links ext klim
0
links ext generic
0
status
0
updated
2025-11-28 08:31:59
image author
featured image