Main

type

0

status

20

review version

0

cleanup version

0

pending deletion

0

created at

2025-04-20 02:33:11

updated at

2025-09-20 18:15:45

Address

url

https://www.cuslubicz.pl/135,dokumenty-do-pobrania

url length

50

url crc

5171

url crc32

818549811

location type

1

canonical status

2

canonical page id

-

Source

domain id

326772266

domain tld

0

domain parts

0

originating warc id

-

originating url

https://cuslubicz.pl/sitemap.xml

source type

1

Server response

server ip

91.224.61.10

pubdate

2025-09-20 18:15:45

attempts

1

size orig

162435

size saved

122309

Text analysis

block type

0

extracted fields

12

extracted bits

detected location

0

detected language

121 (Polish)

category id

-

index version

1

paywall score

0

spam phrases

0

Text statistics

text nonlatin

0

text cyrillic

0

text characters

2859

text words

437

text unique words

232

text lines

84

text sentences

15

text paragraphs

3

text words per sentence

29

text matched phrases

0

text matched dictionaries

0