Main

type

5

status

21

review version

0

cleanup version

0

pending deletion

0

created at

2025-12-24 01:41:07

updated at

2025-12-24 01:41:07

Address

url

https://www.dziennik.pl/artykuly/9517615,kard-nycz-swieckosci-panstwa-nie-przekreslaja-krzyze.html

url length

98

url crc

8363

url crc32

1500324011

location type

1

canonical status

10

canonical page id

3155241520

Source

domain id

70165729

domain tld

616

domain parts

2

originating warc id

-

originating url

https://data.commoncrawl.org/crawl-data/CC-MAIN-2025-33/segments/1754151279789.28/warc/CC-MAIN-20250804114251-20250804144251-00720.warc.gz

source type

11

Server response

server ip

108.138.85.4

pubdate

2025-08-04 12:53:52

attempts

0

size orig

297307

size saved

196062

Content

page id

3155241520

title

Kard. Nycz: Świeckości państwa nie przekreślają krzyże

excerpt

content

...

author

updated

1767102063

Text analysis

block type

0

extracted fields

235

extracted bits

featured image
image author
title
full content
content was extracted heuristically
OpenGraph suggests this is an article

detected location

40

detected language

121 (Polish)

category id

Religia (116)

index version

2025123101

paywall score

0

spam phrases

0

Text statistics

text nonlatin

0

text cyrillic

0

text characters

1832

text words

304

text unique words

213

text lines

1

text sentences

24

text paragraphs

1

text words per sentence

12

text matched phrases

5

text matched dictionaries

1