Main

type

0 (not classified)

status

21 (imported old-v2, waiting for another import)

review version

0

cleanup version

0

pending deletion

0 (-)

created at

2025-05-12 18:20:04

updated at

2025-11-06 08:24:00

Address

url

https://www.spectator.co.uk/writer/mark-galeotti/

url length

49

url crc

16654

url crc32

4272439566

location type

1 (url matches target location, page_location is empty)

canonical status

30 (canonical url is different, page_canonical_page_id points to it)

canonical page id

2967603971

Source

domain id

83663339

domain tld

0

domain parts

0

originating warc id

-

originating url

https://spectator.co.uk/

source type

22 (Telegram)

Server response

server ip

192.0.66.195

Publication date

2025-11-06 08:24:00

Fetch attempts

1

Original html size

156439

Normalized and saved size

106579

Content

title

Mark Galeotti, Author at The Spectator

excerpt

content

Thursday 30 Oct 2025 Tuesday 30 Sep 2025 Wednesday 17 Sep 2025 Friday 5 Sep 2025 Tuesday 2 Sep 2025 Thursday 28 Aug 2025 Monday 18 Aug 2025 ...

author

updated

1765245085

Text analysis

block type

0

extracted fields

105

extracted bits

featured image
title
full content
content was extracted heuristically

detected location

0

detected language

126 (language undetectable (empty document, too short, or engines disagree))

category id

Pozostałe (16)

index version

2025110801

paywall score

0

spam phrases

0

Text statistics

text nonlatin

0

text cyrillic

0

text characters

317

text words

80

text unique words

28

text lines

1

text sentences

1

text paragraphs

0

text words per sentence

80

text matched phrases

0

text matched dictionaries

0