id
type
5 (blog/news article)
status
21 (imported old-v2, waiting for another import)
review version
1
cleanup version
0
pending deletion
0 (-)
created at
2026-01-05 05:52:12
updated at
2026-01-05 05:52:12
url
https://www.nitjanda.is/blogs/news
url length
34
url crc
27620
url crc32
1583508452
location type
1 (url matches target location, page_location is empty)
canonical status
10 (verified canonical url)
canonical page id
domain id
domain tld
352
domain parts
2
originating warc id
6603768
originating url
source type
11 (CommonCrawl)
server ip
Publication date
2025-07-19 14:54:24
Fetch attempts
0
Original html size
101673
Normalized and saved size
47278
title
News
excerpt
content
author
updated
1768033834
block type
0
extracted fields
9
extracted bits
featured image
title
detected location
0
detected language
1 (English)
category id
Other [en] (231)
index version
2025123101
paywall score
0
spam phrases
0
text nonlatin
0
text cyrillic
0
text characters
140
text words
31
text unique words
25
text lines
10
text sentences
2
text paragraphs
0
text words per sentence
15
text matched phrases
0
text matched dictionaries
0
image author