Main

type

0 (not classified)

status

21 (imported old-v2, waiting for another import)

review version

0

cleanup version

0

pending deletion

0 (-)

created at

2025-10-13 22:24:16

updated at

2025-10-13 22:24:18

Address

url

https://www.biontech.com/int/en/home/newsroom.html

url length

50

url crc

12714

url crc32

2354655658

location type

1 (url matches target location, page_location is empty)

canonical status

10 (verified canonical url)

canonical page id

2719407684

Source

domain id

116176972

domain tld

2211

domain parts

0

originating warc id

-

originating url

https://data.commoncrawl.org/crawl-data/CC-MAIN-2025-33/segments/1754151281020.56/warc/CC-MAIN-20250813024931-20250813054931-00088.warc.gz

source type

11 (CommonCrawl)

Server response

server ip

23.48.203.102

Publication date

2025-08-13 04:23:07

Fetch attempts

0

Original html size

835136

Normalized and saved size

831097

Content

title

Newsroom

excerpt

content

Welcome to the BioNTech Newsroom In this section you can find a variety of media materials and useful information to download. Media Downloads Conference "Working Together to Promote Vaccine Equity for Africa", December 18, 2023 ...

author

updated

1762095708

Text analysis

block type

0

extracted fields

104

extracted bits

title
full content
content was extracted heuristically

detected location

0

detected language

1 (English)

category id

Koronawirus (17)

index version

2025103102

paywall score

0

spam phrases

0

Text statistics

text nonlatin

0

text cyrillic

0

text characters

7227

text words

1285

text unique words

292

text lines

1

text sentences

13

text paragraphs

1

text words per sentence

98

text matched phrases

45

text matched dictionaries

5