Main

type

0 (not classified)

status

21 (imported old-v2, waiting for another import)

review version

0

cleanup version

0

pending deletion

0 (-)

created at

2024-10-21 08:09:22

updated at

2026-01-01 07:34:46

Address

url

https://warszawa.77c.eu/warszawa-rehabilitacja/

url length

47

url crc

51893

url crc32

2124335797

location type

1 (url matches target location, page_location is empty)

canonical status

10 (verified canonical url)

canonical page id

1809512741

Source

domain id

292965458

domain tld

0

domain parts

0

originating warc id

-

originating url

https://warszawa.77c.eu/post-sitemap.xml

source type

1 (sitemap)

Server response

server ip

188.210.222.2

Publication date

2025-07-20 13:37:06

Fetch attempts

0

Original html size

223543

Normalized and saved size

26676

Content

title

Warszawa sprzęt rehabilitacyjny ortopedyczny

excerpt

content

author

admin

updated

2026-01-12 12:11:37

Text analysis

block type

0

extracted fields

140

extracted bits

article author
title
OpenGraph suggests this is an article

detected location

40

detected language

121 (Polish)

category id

Medycyna (36)

index version

2025123101

paywall score

0

spam phrases

0

Text statistics

text nonlatin

0

text cyrillic

0

text characters

8364

text words

1021

text unique words

295

text lines

84

text sentences

28

text paragraphs

18

text words per sentence

36

text matched phrases

61

text matched dictionaries

6