Main

type

5 (blog/news article)

status

30 (imported + raw text content deleted)

review version

0

cleanup version

0

pending deletion

0 (-)

created at

2025-04-11 14:03:45

updated at

2025-10-28 07:12:56

Address

url

https://www.annsimmons.com/blog/

url length

32

url crc

40657

url crc32

379821777

location type

1 (url matches target location, page_location is empty)

canonical status

2 (missing canonical tag in html)

canonical page id

-

Source

domain id

63401355

domain tld

0

domain parts

0

originating warc id

-

originating url

https://annsimmons.com/sitemap.xml

source type

1 (sitemap)

Server response

server ip

162.241.253.87

Publication date

2025-10-28 07:12:51

Fetch attempts

1

Original html size

135019

Normalized and saved size

51991

Content

title

Blog

excerpt

content

author

updated

1762427431

Text analysis

block type

0

extracted fields

137

extracted bits

featured image
title
OpenGraph suggests this is an article

detected location

0

detected language

1 (English)

category id

Dźwięk (196)

index version

2025110801

paywall score

0

spam phrases

0

Text statistics

text nonlatin

0

text cyrillic

0

text characters

5948

text words

1354

text unique words

530

text lines

169

text sentences

60

text paragraphs

10

text words per sentence

22

text matched phrases

4

text matched dictionaries

4