Main

type

0 (not classified)

status

30 (imported + raw text content deleted)

review version

0

cleanup version

0

pending deletion

0 (-)

created at

2025-10-05 07:56:38

updated at

2025-10-05 07:56:38

Address

url

https://vaxart.com/publications/

url length

32

url crc

30956

url crc32

1381595372

location type

1 (url matches target location, page_location is empty)

canonical status

30 (canonical url is different, page_canonical_page_id points to it)

canonical page id

2805215154

Source

domain id

200424345

domain tld

2211

domain parts

0

originating warc id

-

originating url

https://data.commoncrawl.org/crawl-data/CC-MAIN-2025-33/segments/1754151573816.93/warc/CC-MAIN-20250814035453-20250814065453-00994.warc.gz

source type

11 (CommonCrawl)

Server response

server ip

141.193.213.10

Publication date

2025-08-14 05:08:47

Fetch attempts

0

Original html size

124359

Normalized and saved size

55452

Content

title

Publications

excerpt

content

[av_section min_height=” min_height_pc=’25’ min_height_px=’500px’ padding=’huge’ custom_margin=’0px’ custom_margin_sync=’true’ svg_div_top=” svg_div_top_color=’#333333′ svg_div_top_width=’100′ svg_div_top_height=’50’ svg_div_top_max_height=’none’ svg_div_top_opacity=” svg_div_bottom=” svg_div_bottom_color=’#333333′ svg_div_bottom_width=’100′ svg_div_bottom_height=’50’ svg_div_bottom_max_height=’none’ svg_div_bottom_opacity=” color=’alternate_color’ background=’bg_color’ custom_bg=” background_gradient_direction=’vertical’ background_gradient_color1=’#000000′ background_gradient_color2=’#ffffff’ background_gradient_color3=” src=’https://vaxart.com/wp-content/uploads/2022/03/danilo-alvesd-Y14ONzYtxb4-unsplash-1030×687.jpg’ attachment=’34’ attachment_size=’large’ attach=’scroll’ position=’center center’ repeat=’stretch’ video=” video_ratio=’16:9′ overlay_enable=’aviaTBoverlay_enable’ overlay_opacity=’0.7′ overlay_color=’#000000′ overlay_pattern=” overlay_custom_pattern=” shadow=’no-shadow...

author

updated

1763388448

Text analysis

block type

0

extracted fields

232

extracted bits

title
full content
content was extracted heuristically
OpenGraph suggests this is an article

detected location

0

detected language

1 (English)

category id

Medycyna (36)

index version

2025110801

paywall score

0

spam phrases

0

Text statistics

text nonlatin

5

text cyrillic

0

text characters

20722

text words

3811

text unique words

938

text lines

1

text sentences

114

text paragraphs

1

text words per sentence

33

text matched phrases

12

text matched dictionaries

9