id
type
5 (blog/news article)
status
15 (other 4xx/5xx http response (other than 403/404/429))
review version
0
cleanup version
0
pending deletion
0 (-)
created at
2025-06-18 15:17:18
updated at
2025-06-18 15:17:18
url
https://bajzel.ustrazaka.pl/2020/04/witaj-swiecie/
url length
50
url crc
14441
url crc32
2123380841
location type
0 (new or legacy)
canonical status
10 (verified canonical url)
canonical page id
-
domain id
domain tld
0
domain parts
0
originating warc id
-
originating url
http://bajzel.ustrazaka.pl/2020/04/witaj-swiecie/
source type
10 (canonical url)
server ip
Publication date
2025-11-14 19:01:22
Fetch attempts
1
Original html size
0
Normalized and saved size
0
block type
3
extracted fields
0
extracted bits
–
detected location
0
detected language
0 (awaiting analysis)
category id
-
index version
0
paywall score
0
spam phrases
0