Main

type

0 (not classified)

status

21 (imported old-v2, waiting for another import)

review version

0

cleanup version

0

pending deletion

0 (-)

created at

2025-11-07 19:08:04

updated at

2025-11-07 19:08:05

Address

url

https://sp-chwaliszew.krotoszyn.pl/aktualnosci-lista-strona-2.html

url length

66

url crc

37241

url crc32

123834745

location type

1 (url matches target location, page_location is empty)

canonical status

2 (missing canonical tag in html)

canonical page id

-

Source

domain id

201545593

domain tld

616

domain parts

0

originating warc id

-

originating url

https://data.commoncrawl.org/crawl-data/CC-MAIN-2025-33/segments/1754151280090.92/warc/CC-MAIN-20250809075926-20250809105926-00947.warc.gz

source type

11 (CommonCrawl)

Server response

server ip

51.68.154.108

Publication date

2025-08-09 09:59:44

Fetch attempts

0

Original html size

68573

Normalized and saved size

55415

Content

title

excerpt

content

author

CONCEPT Intermedia www.sam3.pl

updated

1766927733

Text analysis

block type

0

extracted fields

4

extracted bits

article author

detected location

170

detected language

121 (Polish)

category id

Dzieci (50)

index version

2025123101

paywall score

0

spam phrases

0

Text statistics

text nonlatin

0

text cyrillic

0

text characters

5535

text words

896

text unique words

456

text lines

170

text sentences

28

text paragraphs

14

text words per sentence

32

text matched phrases

14

text matched dictionaries

16