Main

type

0 (not classified)

status

21 (imported old-v2, waiting for another import)

review version

1

cleanup version

0

pending deletion

0 (-)

created at

2025-03-09 00:32:32

updated at

2026-01-01 17:11:24

Address

url

https://www.noterro.com/blog

url length

28

url crc

11933

url crc32

1957637789

location type

1 (url matches target location, page_location is empty)

canonical status

10 (verified canonical url)

canonical page id

1849526246

Source

domain id

13726883

domain tld

2211

domain parts

2

originating warc id

-

originating url

https://noterro.com/sitemap.xml

source type

1 (sitemap)

Server response

server ip

18.211.166.153

Publication date

2025-08-03 11:40:51

Fetch attempts

0

Original html size

439250

Normalized and saved size

106163

Content

title

Blog

excerpt

content

author

updated

1768766008

Text analysis

block type

0

extracted fields

9

extracted bits

featured image
title

detected location

0

detected language

1 (English)

category id

Medycyna (36)

index version

1

paywall score

0

spam phrases

0

Text statistics

text nonlatin

0

text cyrillic

0

text characters

8308

text words

1342

text unique words

374

text lines

391

text sentences

22

text paragraphs

0

text words per sentence

61

text matched phrases

0

text matched dictionaries

0