id
type
0 (not classified)
status
21 (imported old-v2, waiting for another import)
review version
1
cleanup version
0
pending deletion
0 (-)
created at
2026-01-17 12:00:04
updated at
2026-01-17 12:00:05
url
http://astrovansonly.com/history.html
url length
37
url crc
62132
url crc32
2588603060
location type
1 (url matches target location, page_location is empty)
canonical status
2 (missing canonical tag in html)
canonical page id
-
domain id
domain tld
2211
domain parts
2
originating warc id
6580405
originating url
source type
11 (CommonCrawl)
server ip
Publication date
2025-07-16 14:18:03
Fetch attempts
0
Original html size
11685
Normalized and saved size
11391
title
History
excerpt
content
1985 Sed ut perspiciatis, unde omnis iste natus error sit voluptatem accusantium doloremque laudantium, totam rem aperiam eaque ipsa, quae ab illo inventore veritatis et quasi architecto beatae vitae dicta sunt, explicabo. Nemo enim ipsam voluptatem, quia voluptas sit, aspernatur aut odit aut fugit, sed quia consequuntur magni dolores eos, qui ratione voluptatem sequi nesciunt, neque porro quisquam est, qui dolorem ipsum.Sed ut perspiciatis, unde omnis iste natus error sit voluptatem accusantium doloremque laudantium, totam rem aperiam eaque ipsa, quae ab illo inventore veritatis et quasi architecto beatae vitae dicta sunt, explicabo. Nemo enim ipsam voluptatem, quia voluptas sit, aspernatur aut odit aut fugit, sed quia consequuntur magni dolores eos, qui ratione voluptatem sequi nesciunt, neque porro quisquam est, qui dolorem ipsum.Sed ut perspiciatis, unde omnis iste natus error sit voluptatem accusantium doloremque laudant...
author
updated
1769140847
block type
0
extracted fields
104
extracted bits
title
full content
content was extracted heuristically
detected location
0
detected language
126 (language undetectable (empty document, too short, or engines disagree))
category id
-
index version
1
paywall score
0
spam phrases
0
text nonlatin
0
text cyrillic
0
text characters
2936
text words
517
text unique words
68
text lines
1
text sentences
14
text paragraphs
1
text words per sentence
36
text matched phrases
0
text matched dictionaries
0
image author
featured image