Main

type

7 (about/contact/privacy/terms page)

status

21 (imported old-v2, waiting for another import)

review version

0

cleanup version

0

pending deletion

0 (-)

created at

2025-01-12 06:38:16

updated at

2026-01-12 22:35:59

Address

url

https://rekrutacja.curie.pl/registration/

url length

41

url crc

21901

url crc32

2267960717

location type

1 (url matches target location, page_location is empty)

canonical status

10 (verified canonical url)

canonical page id

1816221075

Source

domain id

42299359

domain tld

0

domain parts

0

originating warc id

-

originating url

https://rekrutacja.curie.pl/wp-sitemap-posts-page-1.xml

source type

1 (sitemap)

Server response

server ip

195.78.67.22

Publication date

2025-07-17 15:53:27

Fetch attempts

1

Original html size

56826

Normalized and saved size

29485

Content

title

Rejestracja Kandydata – Rekrutacja.Curie.pl

excerpt

content

author

updated

2026-01-18 09:53:05

Text analysis

block type

0

extracted fields

8

extracted bits

title

detected location

0

detected language

121 (Polish)

category id

Kariera (184)

index version

2025123101

paywall score

0

spam phrases

0

Text statistics

text nonlatin

0

text cyrillic

0

text characters

808

text words

119

text unique words

88

text lines

47

text sentences

3

text paragraphs

0

text words per sentence

39

text matched phrases

5

text matched dictionaries

6