Main

type

5 (blog/news article)

status

21 (imported old-v2, waiting for another import)

review version

0

cleanup version

0

pending deletion

0 (-)

created at

2025-10-28 15:56:55

updated at

2025-10-28 15:56:56

Address

url

https://www.southernheatcorp.com/blog/page/2/

url length

45

url crc

26544

url crc32

2883413936

location type

1 (url matches target location, page_location is empty)

canonical status

10 (verified canonical url)

canonical page id

2846676217

Source

domain id

80703669

domain tld

2211

domain parts

0

originating warc id

-

originating url

https://data.commoncrawl.org/crawl-data/CC-MAIN-2025-33/segments/1754151280245.49/warc/CC-MAIN-20250811034648-20250811064648-00943.warc.gz

source type

11 (CommonCrawl)

Server response

server ip

144.208.84.130

Publication date

2025-08-11 04:36:51

Fetch attempts

0

Original html size

137168

Normalized and saved size

54942

Content

title

Blog - Southern Heat Corp - Page 2

excerpt

content

author

Southern Heat

updated

2025-11-12 22:44:19

Text analysis

block type

0

extracted fields

141

extracted bits

featured image
article author
title
OpenGraph suggests this is an article

detected location

0

detected language

1 (English)

category id

Pogoda i klimat (34)

index version

2025110801

paywall score

0

spam phrases

0

Text statistics

text nonlatin

0

text cyrillic

0

text characters

17732

text words

3048

text unique words

1052

text lines

199

text sentences

165

text paragraphs

56

text words per sentence

18

text matched phrases

43

text matched dictionaries

2