Main

type

0

status

20

review version

1

cleanup version

2

pending deletion

0

created at

2023-12-05 06:44:00

updated at

2024-01-23 06:40:35

Address

url

https://chelm.naszemiasto.pl/przeglad-prasy-18072022-chelm-zobacz-wczorajsza-prasowke-poznaj-najwazniejsze-informacje-z-chelma/ar/c1p1-19738689

url length

143

url crc

30240

url crc32

29062688

location type

3

canonical status

30

canonical page id

2219080202

location

https://chelm.naszemiasto.pl/przeglad-prasy-z-chelma-sprawdz-najwazniejsze-informacje-z-wczoraj-prasowka-23-01-2024/ar/c1p1-19738689

Source

domain id

74184676

domain tld

616

domain parts

3

originating warc id

-

originating url

https://garazowki.pl/post-sitemap10.xml

source type

1

Server response

server ip

-

Publication date

2024-01-23 06:40:35

Fetch attempts

0

Original html size

0

Normalized and saved size

174627

Text analysis

block type

0

extracted fields

61

extracted bits

detected location

0

detected language

121 (Polish)

category id

Pogoda i klimat (34)

index version

2025030501

paywall score

0

spam phrases

0