Main

type

0

status

20

review version

1

cleanup version

2

pending deletion

0

created at

2023-12-05 06:44:00

updated at

2024-05-20 06:25:06

Address

url

https://swidnik.naszemiasto.pl/przeglad-prasy-12122021-swidnik-zobacz-wczorajsza-prasowke-poznaj-najwazniejsze-informacje-ze-swidnika/ar/c1p1-19738787

url length

150

url crc

60663

url crc32

2902191351

location type

3

canonical status

30

canonical page id

2219081083

location

https://swidnik.naszemiasto.pl/prasowka-ze-swidnika-20-05-2024-przeglad-prasy-zestawienie-najwazniejszych-informacji/ar/c1p1-19738787

Source

domain id

2865845

domain tld

616

domain parts

3

originating warc id

-

originating url

https://garazowki.pl/post-sitemap14.xml

source type

1

Server response

server ip

-

pubdate

2024-05-20 06:25:06

attempts

0

size orig

0

size saved

85692

Text analysis

block type

0

extracted fields

61

extracted bits

detected location

0

detected language

121 (Polish)

category id

Rośliny (120)

index version

2025030501

paywall score

0

spam phrases

0