Main

type

5 (blog/news article)

status

21 (imported old-v2, waiting for another import)

review version

0

cleanup version

0

pending deletion

0 (-)

created at

2025-12-04 22:49:32

updated at

2025-12-04 22:49:33

Address

url

https://www.miketysonundisputedtruth.com/2024/03/

url length

49

url crc

63931

url crc32

2393897403

location type

1 (url matches target location, page_location is empty)

canonical status

2 (missing canonical tag in html)

canonical page id

-

Source

domain id

47696545

domain tld

2211

domain parts

0

originating warc id

-

originating url

https://data.commoncrawl.org/crawl-data/CC-MAIN-2025-33/segments/1754151279890.42/warc/CC-MAIN-20250806043623-20250806073623-00663.warc.gz

source type

11 (CommonCrawl)

Server response

server ip

3.167.56.101

Publication date

2025-08-06 05:35:49

Fetch attempts

0

Original html size

88188

Normalized and saved size

57850

Content

title

March 2024 - Mike-Tyson-Undisputed-Truth

excerpt

content

author

by

updated

2026-01-09 18:08:33

Text analysis

block type

0

extracted fields

12

extracted bits

article author
title

detected location

0

detected language

1 (English)

category id

Spam (233)

index version

2025123101

paywall score

0

spam phrases

142

Text statistics

text nonlatin

0

text cyrillic

0

text characters

16367

text words

2954

text unique words

786

text lines

162

text sentences

141

text paragraphs

67

text words per sentence

20

text matched phrases

108

text matched dictionaries

5