id
type
5 (blog/news article)
status
21 (imported old-v2, waiting for another import)
review version
0
cleanup version
0
pending deletion
0 (-)
created at
2025-11-05 19:01:11
updated at
2025-11-05 19:01:12
url
https://mijn.host/blog/wat-is-shared-hosting/
url length
45
url crc
4736
url crc32
3639874176
location type
1 (url matches target location, page_location is empty)
canonical status
10 (verified canonical url)
canonical page id
domain id
domain tld
2451
domain parts
0
originating warc id
-
originating url
https://data.commoncrawl.org/crawl-data/CC-MAIN-2025-33/segments/1754151280114.71/warc/CC-MAIN-20250809172211-20250809202211-00497.warc.gz
source type
11 (CommonCrawl)
server ip
Publication date
2025-08-09 18:29:02
Fetch attempts
0
Original html size
113376
Normalized and saved size
44126
title
Wat is shared hosting? Alles wat je moet weten
excerpt
content
author
updated
2025-12-07 04:49:38
block type
0
extracted fields
137
extracted bits
featured image
title
OpenGraph suggests this is an article
detected location
0
detected language
9 (Dutch)
category id
index version
2025110801
paywall score
0
spam phrases
0
text nonlatin
0
text cyrillic
0
text characters
8502
text words
1612
text unique words
532
text lines
128
text sentences
106
text paragraphs
31
text words per sentence
15
text matched phrases
2
text matched dictionaries
3
image author