Main

type

0 (not classified)

status

20 (imported old-v1, waiting for another import)

review version

0

cleanup version

0

pending deletion

0 (-)

created at

2025-03-23 17:56:49

updated at

2025-05-27 10:23:38

pol page id

1918893710

pol status

0

pol hosts ticketing

pol hosts ecommerce

amazon.com

pol hosts finance

pol hosts crypto

pol hosts leak

pol hosts devel

pol hosts ugc

instagram.com facebook.com twitter.com

pol hosts klim

pol hosts builders

pol hosts self subdomains

pol hosts other subdomains

fast.fonts.net

pol hosts other domains

matthewmarks.com simonandschuster.biz audible.com

pol updated

1749227732

Address

url

http://origin.www.annetruitt.org/news

url length

37

url crc

8052

url crc32

2395676532

location type

1 (url matches target location, page_location is empty)

canonical status

2 (missing canonical tag in html)

canonical page id

-

Source

domain id

95140705

domain tld

0

domain parts

0

originating warc id

-

originating url

https://annetruitt.org/sitemap.xml

source type

1 (sitemap)

Server response

server ip

-

Publication date

2025-05-27 10:23:38

Fetch attempts

0

Original html size

49603

Normalized and saved size

48816

Text analysis

block type

0

extracted fields

4

extracted bits

detected location

0

detected language

1 (English)

category id

Pozostałe (16)

index version

2025061201

paywall score

0

spam phrases

0