Main

type

0 (not classified)

status

40 (problem with import, need manual fix)

review version

0

cleanup version

0

pending deletion

0 (-)

created at

2025-10-20 08:29:02

updated at

2025-10-20 08:29:03

Address

url

https://libr.sejm.gov.pl/tek01/txt/kpol/1791-r0.html

url length

52

url crc

2384

url crc32

3708881232

location type

1 (url matches target location, page_location is empty)

canonical status

2 (missing canonical tag in html)

canonical page id

-

Source

domain id

13560263

domain tld

4503

domain parts

0

originating warc id

-

originating url

https://data.commoncrawl.org/crawl-data/CC-MAIN-2025-33/segments/1754151280914.63/warc/CC-MAIN-20250812045045-20250812075045-00867.warc.gz

source type

11 (CommonCrawl)

Server response

server ip

45.60.74.103

Publication date

2025-08-12 05:23:52

Fetch attempts

0

Original html size

2209

Normalized and saved size

2076

Text analysis

block type

1

extracted fields

0

extracted bits

detected location

0

detected language

0 (awaiting analysis)

category id

-

index version

0

paywall score

0

spam phrases

0