id
type
7
status
21
review version
1
cleanup version
0
pending deletion
0
created at
2025-04-06 12:38:09
updated at
2025-11-02 23:50:33
url
https://bombelcafe.pl/regulamin/
url length
32
url crc
48674
url crc32
4070293026
location type
1
canonical status
10
canonical page id
domain id
domain tld
616
domain parts
2
originating warc id
-
originating url
https://bombelcafe.pl/page-sitemap.xml
source type
1
block type
0
extracted fields
136
extracted bits
title
OpenGraph suggests this is an article
detected location
0
detected language
121 (Polish)
category id
index version
2025110801
paywall score
0
spam phrases
0
text nonlatin
0
text cyrillic
0
text characters
16685
text words
2663
text unique words
1082
text lines
217
text sentences
167
text paragraphs
14
text words per sentence
15
text matched phrases
14
text matched dictionaries
9
image author
featured image