Main

type

5 (blog/news article)

status

21 (imported old-v2, waiting for another import)

review version

0

cleanup version

0

pending deletion

0 (-)

created at

2025-11-10 21:06:23

updated at

2025-11-10 21:06:24

Address

url

https://www.botify.com/blog/suggested-patterns-simon

url length

52

url crc

44156

url crc32

3678907516

location type

1 (url matches target location, page_location is empty)

canonical status

10 (verified canonical url)

canonical page id

2950705860

Source

domain id

40468202

domain tld

2211

domain parts

2

originating warc id

-

originating url

https://data.commoncrawl.org/crawl-data/CC-MAIN-2025-33/segments/1754151280040.84/warc/CC-MAIN-20250808192612-20250808222612-00907.warc.gz

source type

11 (CommonCrawl)

Server response

server ip

52.85.132.88

Publication date

2025-08-08 20:56:43

Fetch attempts

0

Original html size

70444

Normalized and saved size

47760

Content

title

Simon Explains Suggested Patterns | Botify

excerpt

content

April 27, 2014Annabelle BouardDirector of Education & Training Services Hello Simon! Could you please introduce yourself?Yes, of course. My name is Simon Dollé, I'm a Research Engineer at Botify. I'm the lead dev for the Suggested Patterns functionality. I work on another project as well, which is top secret ( :-) ) and should yield results around the end of the second quarter of 2014.Can you explain what the Suggested Patterns aim at?As you know, Botify is a powerful SEO analytics application that performs automated audits of websites. The application's goal is to quickly identify all SEO optimizations that could result in increased traffic and revenue for your website. These automated analyses are based on your website's crawl data: during our crawl, we collect a huge amount of information, such as depth, number of pages, title tags, performance, etc.This information represents a very large amount of data. One needs to be able to analyze this data and interpret results to adeq...

author

updated

1764184384

Text analysis

block type

0

extracted fields

105

extracted bits

featured image
title
full content
content was extracted heuristically

detected location

0

detected language

1 (English)

category id

Edukacja (47)

index version

2025110801

paywall score

0

spam phrases

0

Text statistics

text nonlatin

0

text cyrillic

0

text characters

4264

text words

897

text unique words

367

text lines

1

text sentences

32

text paragraphs

1

text words per sentence

28

text matched phrases

1

text matched dictionaries

1