Main

type

5 (blog/news article)

status

21 (imported old-v2, waiting for another import)

review version

0

cleanup version

0

pending deletion

0 (-)

created at

2025-10-17 18:53:52

updated at

2026-01-12 03:09:25

Address

url

https://dubheadz.co.uk/2024/10/28/hello-world/

url length

46

url crc

21865

url crc32

4149171561

location type

4 (page_location points to new url in different domain)

canonical status

30 (canonical url is different, page_canonical_page_id points to it)

canonical page id

3568789550

location

https://www.dubheadz.co.uk/2024/10/28/hello-world/

Source

domain id

438586757

domain tld

826

domain parts

0

originating warc id

-

originating url

https://dubheadz.co.uk/

source type

4 (mainpage of this domain)

Server response

server ip

67.205.26.243

Publication date

2026-01-12 03:09:21

Fetch attempts

1

Original html size

104340

Normalized and saved size

21338

Content

title

Hello world! - Just another WordPress site

excerpt

content

author

updated

1769824969

Text analysis

block type

0

extracted fields

136

extracted bits

title
OpenGraph suggests this is an article

detected location

0

detected language

1 (English)

category id

-

index version

1

paywall score

0

spam phrases

0

Text statistics

text nonlatin

0

text cyrillic

0

text characters

523

text words

106

text unique words

74

text lines

26

text sentences

10

text paragraphs

0

text words per sentence

10

text matched phrases

0

text matched dictionaries

0