Main

type

0 (not classified)

status

21 (imported old-v2, waiting for another import)

review version

1

cleanup version

2

pending deletion

0 (-)

created at

2024-03-14 23:48:13

updated at

2026-01-19 01:15:39

Address

url

https://kariera.innergo.pl/innergo_przyjazne_srodowisku/

url length

56

url crc

18983

url crc32

4145498663

location type

1 (url matches target location, page_location is empty)

canonical status

10 (verified canonical url)

canonical page id

874088168

Source

domain id

14730674

domain tld

616

domain parts

3

originating warc id

-

originating url

https://kariera.innergo.pl/post-sitemap.xml

source type

1 (sitemap)

Server response

server ip

109.95.157.165

Publication date

2025-07-16 04:45:53

Fetch attempts

1

Original html size

214757

Normalized and saved size

39045

Content

title

INNERGO przyjazne środowisku - INNERGO Kariera

excerpt

content

author

Innergo

updated

1769428798

Text analysis

block type

0

extracted fields

141

extracted bits

featured image
article author
title
OpenGraph suggests this is an article

detected location

0

detected language

121 (Polish)

category id

Kariera (184)

index version

1

paywall score

0

spam phrases

0

Text statistics

text nonlatin

0

text cyrillic

0

text characters

1450

text words

238

text unique words

174

text lines

44

text sentences

17

text paragraphs

3

text words per sentence

14

text matched phrases

2

text matched dictionaries

6