Main

type

5 (blog/news article)

status

21 (imported old-v2, waiting for another import)

review version

0

cleanup version

0

pending deletion

0 (-)

created at

2025-10-14 07:18:33

updated at

2025-10-14 07:18:34

Address

url

https://www.insightvacations.com/blog/battlefields-of-wwi-and-wwii/

url length

67

url crc

42387

url crc32

449029523

location type

1 (url matches target location, page_location is empty)

canonical status

10 (verified canonical url)

canonical page id

2721172786

Source

domain id

112088591

domain tld

2211

domain parts

0

originating warc id

-

originating url

https://data.commoncrawl.org/crawl-data/CC-MAIN-2025-33/segments/1754151281008.23/warc/CC-MAIN-20250812234112-20250813024112-00797.warc.gz

source type

11 (CommonCrawl)

Server response

server ip

13.249.39.100

Publication date

2025-08-13 01:22:49

Fetch attempts

0

Original html size

393183

Normalized and saved size

97732

Content

title

Explore the Battlefields of WWI and WWII on This New Tour

excerpt

content

author

Leanne Williams

updated

1763480164

Text analysis

block type

0

extracted fields

141

extracted bits

featured image
article author
title
OpenGraph suggests this is an article

detected location

0

detected language

1 (English)

category id

Podróże (51)

index version

2025110801

paywall score

0

spam phrases

0

Text statistics

text nonlatin

0

text cyrillic

0

text characters

8591

text words

1769

text unique words

717

text lines

116

text sentences

74

text paragraphs

25

text words per sentence

23

text matched phrases

21

text matched dictionaries

6