Main

type

5

status

21

review version

0

cleanup version

0

pending deletion

0

created at

2025-12-11 14:18:43

updated at

2025-12-11 14:18:43

Address

url

https://studio-legale.com/2021/11/

url length

34

url crc

25301

url crc32

40592085

location type

1

canonical status

2

canonical page id

-

Source

domain id

41387980

domain tld

2211

domain parts

0

originating warc id

-

originating url

https://data.commoncrawl.org/crawl-data/CC-MAIN-2025-33/segments/1754151279837.14/warc/CC-MAIN-20250805001900-20250805031900-00846.warc.gz

source type

11

Server response

server ip

172.67.74.252

pubdate

2025-08-05 01:02:57

attempts

0

size orig

161889

size saved

103970

Content

page id

3131763816

title

november 2021 - STUDIO-LEGALE

excerpt

content

author

updated

1767295296

Text analysis

block type

0

extracted fields

8

extracted bits

title

detected location

0

detected language

126 (language undetectable (empty document, too short, or engines disagree))

category id

Ransomware (18)

index version

2025123101

paywall score

0

spam phrases

0

Text statistics

text nonlatin

0

text cyrillic

0

text characters

54377

text words

9569

text unique words

2704

text lines

448

text sentences

417

text paragraphs

137

text words per sentence

22

text matched phrases

40

text matched dictionaries

8