id
type
0 (not classified)
status
21 (imported old-v2, waiting for another import)
review version
0
cleanup version
0
pending deletion
0 (-)
created at
2025-11-14 14:58:05
updated at
2025-11-14 14:58:06
url
https://www.allsilver.pl/pl/c1000040,broszki
url length
44
url crc
33852
url crc32
2478933052
location type
1 (url matches target location, page_location is empty)
canonical status
2 (missing canonical tag in html)
canonical page id
-
domain id
domain tld
616
domain parts
0
originating warc id
-
originating url
https://data.commoncrawl.org/crawl-data/CC-MAIN-2025-33/segments/1754151280019.53/warc/CC-MAIN-20250808034803-20250808064803-00905.warc.gz
source type
11 (CommonCrawl)
server ip
Publication date
2025-08-08 04:08:14
Fetch attempts
0
Original html size
79684
Normalized and saved size
78115
title
Biżuteria » BIŻUTERIA SREBRNA » BROSZKI
excerpt
content
BROSZKI Wyświetl na stronie: 21 42 63 wszystkie 15% + 5% Symbol: DPSBRO-0001 BROSZKA Waga: 4.38g 15% + 5% Symbol: DPBRO-0006 BROSZKA Waga: 5.6g 15% + 5% Symbol: DPBRO-0004 BROSZKA Waga: 6.2g 15% + 5% Symbol: DPBRO-0008 BROSZKA Waga: 4.09g 15% + 5% Symbol: 17315BH BROSZKA Waga: 9.86g 15% + 5% Symbol: AP25275 BROSZKA Waga: 6.1g 15% + 5% Symbol: AP25281 BROSZKA Waga: 6.93g 15% + 5% Symbol: AP25282 BROSZKA Waga: 5.92g 15% + 5% Symbol: AP25271 ...
author
Bartek
updated
1764904401
block type
0
extracted fields
108
extracted bits
article author
title
full content
content was extracted heuristically
detected location
0
detected language
121 (Polish)
category id
index version
2025123101
paywall score
0
spam phrases
0
text nonlatin
0
text cyrillic
0
text characters
727
text words
184
text unique words
62
text lines
1
text sentences
1
text paragraphs
1
text words per sentence
184
text matched phrases
2
text matched dictionaries
2
image author
featured image