id
type
0 (not classified)
status
21 (imported old-v2, waiting for another import)
review version
0
cleanup version
0
pending deletion
0 (-)
created at
2025-12-03 13:08:52
updated at
2025-12-03 13:08:54
pol page id
pol status
0
pol hosts ticketing
pol hosts ecommerce
pol hosts finance
pol hosts crypto
pol hosts leak
pol hosts devel
pol hosts ugc
pol hosts klim
pol hosts builders
pol hosts self subdomains
pol hosts other subdomains
pol hosts other domains
fragile.earth
pol updated
1767011625
url
https://www.100artworks.today/TheWoundsOfWorld.html
url length
51
url crc
42363
url crc32
3251742075
location type
1 (url matches target location, page_location is empty)
canonical status
2 (missing canonical tag in html)
canonical page id
-
domain id
domain tld
2929
domain parts
0
originating warc id
-
originating url
https://data.commoncrawl.org/crawl-data/CC-MAIN-2025-33/segments/1754151279908.86/warc/CC-MAIN-20250806105403-20250806135403-00660.warc.gz
source type
11 (CommonCrawl)
server ip
Publication date
2025-08-06 11:58:25
Fetch attempts
0
Original html size
10285
Normalized and saved size
8353
title
100 Artworks Today: The Wounds of World
excerpt
content
The Wounds of World The most inaccessible and remote places on earth are adversely affected by human activity. In 'The Wounds of World' we see a wilderness landscape riven by the scars of our neglect. Our insatiable desire for comfort and transport requires more energy than the earth's resources can sustain. Fifty thousand square miles of forest are lost each year as we strip the earth of its vegetation. We pierce the earth's skin up to four kilometers below its surface and remove over sixty billion tonnes of raw materials each year. We make far more produ...
author
Mike de sousa
updated
1767011625
block type
0
extracted fields
108
extracted bits
article author
title
full content
content was extracted heuristically
detected location
0
detected language
1 (English)
category id
Medicine [en] (226)
index version
2025123101
paywall score
0
spam phrases
0
text nonlatin
0
text cyrillic
0
text characters
1496
text words
323
text unique words
192
text lines
1
text sentences
16
text paragraphs
1
text words per sentence
20
text matched phrases
1
text matched dictionaries
2
links self subdomains
0
links other subdomains
0
links other domains
1
links spam adult
0
links spam random
0
links spam expired
0
links ext activities
0
links ext ecommerce
0
links ext finance
0
links ext crypto
0
links ext booking
0
links ext news
0
links ext leaks
0
links ext ugc
0
links ext klim
0
links ext generic
0
image author
featured image