Main

type

2

status

20

review version

1

cleanup version

2

pending deletion

0

created at

2023-10-03 12:09:06

updated at

2025-10-12 22:20:45

Address

url

https://media.pracuj.pl/search

url length

30

url crc

710

url crc32

2999583430

location type

1

canonical status

10

canonical page id

104758522

Source

domain id

52365078

domain tld

616

domain parts

3

originating warc id

-

originating url

https://media.pracuj.pl/sitemap.xml

source type

1

Server response

server ip

34.111.245.103

pubdate

2025-08-13 07:33:03

attempts

0

size orig

38800

size saved

31583

Text analysis

block type

0

extracted fields

9

extracted bits

detected location

0

detected language

121 (Polish)

category id

Kariera (184)

index version

2025061201

paywall score

0

spam phrases

0

Text statistics

text nonlatin

0

text cyrillic

0

text characters

5143

text words

832

text unique words

457

text lines

93

text sentences

53

text paragraphs

14

text words per sentence

15

text matched phrases

0

text matched dictionaries

0