id
type
5 (blog/news article)
status
21 (imported old-v2, waiting for another import)
review version
0
cleanup version
0
pending deletion
0 (-)
created at
2025-06-24 01:33:57
updated at
2025-11-19 00:55:29
url
https://praca.gazetaprawna.pl/artykuly/1434422,kara-za-zbieranie-adresow-dla-zus-w-aktach-osobowych.html
url length
104
url crc
47174
url crc32
519223366
location type
4 (page_location points to new url in different domain)
canonical status
10 (verified canonical url)
canonical page id
-
location
https://www.gazetaprawna.pl/praca/artykuly/1434422,kara-za-zbieranie-adresow-dla-zus-w-aktach-osobowych.html
domain id
domain tld
0
domain parts
0
originating warc id
-
originating url
https://praca.gazetaprawna.pl/artykuly/1434422,kara-za-zbieranie-adresow-dla-zus-w-aktach-osobowych.html#new_tab
source type
10 (canonical url)
server ip
Publication date
2025-11-19 00:55:28
Fetch attempts
1
Original html size
282965
Normalized and saved size
282965
title
Bez kary za zbieranie adresów dla ZUS w aktach osobowych
excerpt
content
Reklama ...
author
updated
1767284943
block type
809
extracted fields
233
extracted bits
featured image
title
full content
content was extracted heuristically
OpenGraph suggests this is an article
detected location
0
detected language
121 (Polish)
category id
-
index version
1
paywall score
0
spam phrases
0
text nonlatin
0
text cyrillic
0
text characters
7426
text words
1272
text unique words
669
text lines
1
text sentences
68
text paragraphs
1
text words per sentence
18
text matched phrases
0
text matched dictionaries
0
links self subdomains
0
links other subdomains
4
links other domains
2
links spam adult
0
links spam random
0
links spam expired
0
links ext activities
0
links ext ecommerce
0
links ext finance
0
links ext crypto
0
links ext booking
0
links ext news
115
links ext leaks
0
links ext ugc
1
links ext klim
0
links ext generic
1