Main

type

0

status

10

review version

0

cleanup version

0

pending deletion

0

created at

2025-07-04 12:36:21

updated at

2026-02-07 19:22:14

Address

url

https://www.cloudflare.com/press-releases/2025/cloudflare-just-changed-how-ai-crawlers-scrape-the-internet-at-large/

url length

116

url crc

45122

url crc32

2881663042

location type

1

canonical status

10

canonical page id

2321763204

Source

domain id

173561409

domain tld

0

domain parts

0

originating warc id

-

originating url

https://cloudflare.com/

source type

21

Server response

server ip

104.16.124.96

pubdate

2025-07-11 12:18:06

attempts

0

size orig

259485

size saved

236581

Text analysis

block type

0

extracted fields

105

extracted bits

featured image
title
full content
content was extracted heuristically

detected location

0

detected language

1 (English)

category id

Dziennikarze (129)

index version

1

paywall score

1

spam phrases

0

Text statistics

text nonlatin

14

text cyrillic

0

text characters

24502

text words

4516

text unique words

1284

text lines

1

text sentences

140

text paragraphs

1

text words per sentence

32

text matched phrases

6

text matched dictionaries

5