id
type
0
status
21
review version
0
cleanup version
0
pending deletion
0
created at
2025-11-12 12:22:26
updated at
2025-11-12 12:22:27
url
https://schronisko.zyrardow.pl/22,o-schronisku
url length
46
url crc
33325
url crc32
2956886573
location type
1
canonical status
2
canonical page id
-
domain id
domain tld
616
domain parts
0
originating warc id
-
originating url
https://data.commoncrawl.org/crawl-data/CC-MAIN-2025-33/segments/1754151280029.61/warc/CC-MAIN-20250808131348-20250808161348-00707.warc.gz
source type
11
page id
title
O schronisku | Schronisko Żyrardów im. psa Kazana
excerpt
content
author
Schronisko Żyrardów im. psa Kazana
updated
1763671406
block type
0
extracted fields
12
extracted bits
article author
title
detected location
0
detected language
121 (Polish)
category id
index version
2025123101
paywall score
0
spam phrases
0
text nonlatin
0
text cyrillic
0
text characters
4085
text words
645
text unique words
276
text lines
148
text sentences
26
text paragraphs
2
text words per sentence
24
text matched phrases
15
text matched dictionaries
7
image author
featured image