Main

type

0 (not classified)

status

21 (imported old-v2, waiting for another import)

review version

0

cleanup version

0

pending deletion

0 (-)

created at

2025-10-21 07:34:58

updated at

2025-10-21 07:34:58

Address

url

https://www.imm-cologne.com/imm-cologne-exhibitors/list-of-exhibitors/favorites.php/press?route=merkliste

url length

105

url crc

10094

url crc32

4283836270

location type

1 (url matches target location, page_location is empty)

canonical status

2 (missing canonical tag in html)

canonical page id

-

Source

domain id

44284769

domain tld

2211

domain parts

0

originating warc id

-

originating url

https://data.commoncrawl.org/crawl-data/CC-MAIN-2025-33/segments/1754151280906.43/warc/CC-MAIN-20250812014435-20250812044435-00937.warc.gz

source type

11 (CommonCrawl)

Server response

server ip

3.167.88.15

Publication date

2025-08-12 02:00:47

Fetch attempts

0

Original html size

91094

Normalized and saved size

43324

Content

title

Favorites

excerpt

content

Favorites of the imm cologne 2024 Favorites - Review 2024 Exhibition tour Organize personal schedule Save selection Exhibitor (0) No favorites available! Products (0) No favorites available! Events (0) No favorites available! Speaker (0) No favorites available!

author

updated

1762279916

Text analysis

block type

0

extracted fields

105

extracted bits

featured image
title
full content
content was extracted heuristically

detected location

0

detected language

126 (language undetectable (empty document, too short, or engines disagree))

category id

Pozostałe (16)

index version

2025103102

paywall score

0

spam phrases

0

Text statistics

text nonlatin

0

text cyrillic

0

text characters

212

text words

36

text unique words

21

text lines

1

text sentences

4

text paragraphs

1

text words per sentence

9

text matched phrases

0

text matched dictionaries

0