id
type
0 (not classified)
status
21 (imported old-v2, waiting for another import)
review version
0
cleanup version
0
pending deletion
0 (-)
created at
2025-12-26 17:05:02
updated at
2025-12-26 17:05:02
url
http://a-novi.pl/podstrony/galeria.php?rid=4&galeria=projekty&katalog=magazyn%2Fimage%2Fprojekty%2Fanovi-57765b56ae881
url length
118
url crc
44605
url crc32
266972733
location type
1 (url matches target location, page_location is empty)
canonical status
2 (missing canonical tag in html)
canonical page id
-
domain id
domain tld
616
domain parts
2
originating warc id
-
originating url
https://data.commoncrawl.org/crawl-data/CC-MAIN-2025-33/segments/1754151279750.4/warc/CC-MAIN-20250803230942-20250804020942-00678.warc.gz
source type
11 (CommonCrawl)
server ip
Publication date
2025-08-04 00:04:54
Fetch attempts
0
Original html size
3033
Normalized and saved size
3033
title
Koncepcja budynku szkoły z biblioteką i zapleczem sportowym
excerpt
content
author
updated
1767433963
block type
0
extracted fields
8
extracted bits
title
detected location
0
detected language
123 (uncertain Polish)
category id
Pozostałe (16)
index version
2025123101
paywall score
0
spam phrases
0
text nonlatin
0
text cyrillic
0
text characters
137
text words
21
text unique words
12
text lines
2
text sentences
1
text paragraphs
0
text words per sentence
21
text matched phrases
0
text matched dictionaries
0
image author
featured image