Main

type

0 (not classified)

status

30 (imported + raw text content deleted)

review version

1

cleanup version

2

pending deletion

0 (-)

created at

2023-10-02 21:31:59

updated at

2024-11-30 21:36:29

Address

url

https://diety.nfz.gov.pl/blad-logowania

url length

39

url crc

42298

url crc32

1347069242

location type

1 (url matches target location, page_location is empty)

canonical status

2 (missing canonical tag in html)

canonical page id

-

Source

domain id

2965648

domain tld

4503

domain parts

4

originating warc id

-

originating url

https://diety.nfz.gov.pl/sitemap.xml

source type

1 (sitemap)

Server response

server ip

-

Publication date

2024-11-30 21:36:29

Fetch attempts

0

Original html size

0

Normalized and saved size

18912

Text analysis

block type

0

extracted fields

4

extracted bits

detected location

0

detected language

121 (Polish)

category id

Medycyna (36)

index version

2025030501

paywall score

0

spam phrases

0