Main

type

5 (blog/news article)

status

21 (imported old-v2, waiting for another import)

review version

0

cleanup version

0

pending deletion

0 (-)

created at

2025-10-06 22:18:16

updated at

2025-10-06 22:18:18

Address

url

https://28i.com.br/2015/10/21/hello-world/

url length

42

url crc

2093

url crc32

2292844589

location type

1 (url matches target location, page_location is empty)

canonical status

10 (verified canonical url)

canonical page id

2693129538

Source

domain id

299520094

domain tld

76

domain parts

0

originating warc id

-

originating url

https://data.commoncrawl.org/crawl-data/CC-MAIN-2025-33/segments/1754151572092.97/warc/CC-MAIN-20250813183110-20250813213110-00054.warc.gz

source type

11 (CommonCrawl)

Server response

server ip

191.101.104.243

Publication date

2025-08-13 19:59:29

Fetch attempts

0

Original html size

29442

Normalized and saved size

8772

Content

title

Branding. Marketing. – 28i/Agency

excerpt

content

Vivamus magna justo, lacinia eget consectetur sed, convallis at tellus. Vestibulum ante ipsum primis in faucibus orci luctus et ultrices posuere cubilia Curae; Donec velit neque, auctor sit amet aliquam vel, ullamcorper sit amet ligula. Proin eget tortor risus. Sed porttitor lectus nibh. Curabitur arcu erat, accumsan id imperdiet et, porttitor at sem. Vestibulum ante ipsum primis in faucibus orci luctus et ultrices posuere cubilia Curae; Donec velit neque, auctor sit amet aliquam vel, ullamcorper sit amet ligula. Lorem ipsum dolor sit amet, consectetur adipiscing elit. Vestibulum ante ipsum primis in faucibus orci luctus et ultrices posuere cubilia Curae; Donec velit neque, auctor sit amet aliquam vel, ullamcorper sit amet ligula. Vestibulum ante ipsum primis in faucibus orci luctus et ultrices posuere cubilia Curae; Donec velit neque, auctor sit amet aliquam vel, ullamcorper sit amet ligula. Pellentesque in ipsum id orci porta dapibu...

author

updated

1762324191

Text analysis

block type

0

extracted fields

104

extracted bits

title
full content
content was extracted heuristically

detected location

0

detected language

126 (language undetectable (empty document, too short, or engines disagree))

category id

Lorem ipsum (237)

index version

2025110801

paywall score

0

spam phrases

0

Text statistics

text nonlatin

0

text cyrillic

0

text characters

781

text words

143

text unique words

53

text lines

1

text sentences

10

text paragraphs

1

text words per sentence

14

text matched phrases

20

text matched dictionaries

1