Main

type

5 (blog/news article)

status

30 (imported + raw text content deleted)

review version

1

cleanup version

2

pending deletion

0 (-)

created at

2024-03-14 14:34:14

updated at

2025-12-07 20:20:13

Address

url

https://apartdomek.pl/2017/11/03/hello-world/

url length

45

url crc

41012

url crc32

2113249332

location type

1 (url matches target location, page_location is empty)

canonical status

10 (verified canonical url)

canonical page id

712068763

Source

domain id

67229249

domain tld

616

domain parts

2

originating warc id

-

originating url

https://apartdomek.pl/wp-sitemap-posts-post-1.xml

source type

1 (sitemap)

Server response

server ip

91.205.73.224

Publication date

2025-08-05 17:00:08

Fetch attempts

0

Original html size

99717

Normalized and saved size

14033

Content

title

Hello world!

excerpt

content

author

updated

1768516427

Text analysis

block type

0

extracted fields

136

extracted bits

title
OpenGraph suggests this is an article

detected location

0

detected language

1 (English)

category id

Spam (233)

index version

2025123101

paywall score

0

spam phrases

3

Text statistics

text nonlatin

0

text cyrillic

0

text characters

202

text words

38

text unique words

35

text lines

12

text sentences

6

text paragraphs

0

text words per sentence

6

text matched phrases

3

text matched dictionaries

1