Main

type

0 (not classified)

status

21 (imported old-v2, waiting for another import)

review version

0

cleanup version

0

pending deletion

0 (-)

created at

2025-11-16 07:26:11

updated at

2025-11-16 07:26:12

Address

url

https://www.api.gov.uk/he/historic-england-heritage-at-risk-register-2021/

url length

74

url crc

40005

url crc32

3209600069

location type

1 (url matches target location, page_location is empty)

canonical status

10 (verified canonical url)

canonical page id

2990009972

Source

domain id

294992912

domain tld

826

domain parts

0

originating warc id

-

originating url

https://data.commoncrawl.org/crawl-data/CC-MAIN-2025-33/segments/1754151279985.2/warc/CC-MAIN-20250807213109-20250808003109-00748.warc.gz

source type

11 (CommonCrawl)

Server response

server ip

185.199.108.153

Publication date

2025-08-07 21:46:39

Fetch attempts

0

Original html size

44579

Normalized and saved size

43163

Content

title

Historic England Heritage at Risk Register 2021

excerpt

content

Historic England Heritage at Risk Register 2021 Overview Data from the annual Heritage at Risk Register for 2021 Heritage at Risk provides an understanding of the overall state of England’s heritage assets. Every year Historic England updates the Heritage at Risk Register. The end result is a dynamic picture of the sites most at risk and most in need of safeguarding for the future. Assets may be assessed by using multiple methodologies so may appear multiple times. For example, a scheduled monument could be made up of archaeological remains and a standing structure. In this instance, the remains would be assessed using the archaeological risk assessment, and the structure using the buildings or structures assessment. Conservation Area information is not complete due to availability of Conservation Area spatial data. This data and its spatial depictions are purely indicative and are not a definitive representation. Users are advised to seek clarification and confir...

author

updated

1764777919

Text analysis

block type

0

extracted fields

105

extracted bits

featured image
title
full content
content was extracted heuristically

detected location

0

detected language

1 (English)

category id

-

index version

1

paywall score

0

spam phrases

0

Text statistics

text nonlatin

0

text cyrillic

0

text characters

1126

text words

201

text unique words

116

text lines

1

text sentences

10

text paragraphs

1

text words per sentence

20

text matched phrases

0

text matched dictionaries

0