Main

type

0 (not classified)

status

21 (imported old-v2, waiting for another import)

review version

0

cleanup version

0

pending deletion

0 (-)

created at

2025-12-04 16:10:19

updated at

2025-12-04 16:10:20

Address

url

http://archiwum.ock.org.pl/Oswiecimski-Uniwersytet-Dzieciecy-1576.html?strona=1

url length

79

url crc

41338

url crc32

3571425658

location type

1 (url matches target location, page_location is empty)

canonical status

2 (missing canonical tag in html)

canonical page id

-

Source

domain id

48757985

domain tld

4018

domain parts

0

originating warc id

-

originating url

https://data.commoncrawl.org/crawl-data/CC-MAIN-2025-33/segments/1754151279890.42/warc/CC-MAIN-20250806043623-20250806073623-00953.warc.gz

source type

11 (CommonCrawl)

Server response

server ip

86.106.91.183

Publication date

2025-08-06 06:15:26

Fetch attempts

0

Original html size

26083

Normalized and saved size

26043

Content

title

Oświęcimski Uniwersytet Dziecięcy - Oświęcimskie Centrum Kultury

excerpt

content

author

updated

1766727345

Text analysis

block type

0

extracted fields

8

extracted bits

title

detected location

191

detected language

121 (Polish)

category id

Edukacja (47)

index version

2025123101

paywall score

0

spam phrases

0

Text statistics

text nonlatin

0

text cyrillic

0

text characters

4149

text words

832

text unique words

245

text lines

121

text sentences

60

text paragraphs

19

text words per sentence

13

text matched phrases

13

text matched dictionaries

8