Main

type

0 (not classified)

status

30 (imported + raw text content deleted)

review version

0

cleanup version

0

pending deletion

0 (-)

created at

2025-10-13 01:52:11

updated at

2025-10-13 01:52:13

pol page id

2712099469

pol status

0

pol hosts ticketing

pol hosts ecommerce

pol hosts finance

pol hosts crypto

pol hosts leak

pol hosts devel

pol hosts ugc

pol hosts klim

pol hosts builders

pol hosts self subdomains

pol hosts other subdomains

pol hosts other domains

zhihaoruan.xyz

pol updated

1763265396

Address

url

https://www.grasp.upenn.edu/people/zhihao-ruan/

url length

47

url crc

44433

url crc32

647736721

location type

1 (url matches target location, page_location is empty)

canonical status

10 (verified canonical url)

canonical page id

2712099469

Source

domain id

40605262

domain tld

2295

domain parts

0

originating warc id

-

originating url

https://data.commoncrawl.org/crawl-data/CC-MAIN-2025-33/segments/1754151281020.56/warc/CC-MAIN-20250813024931-20250813054931-00982.warc.gz

source type

11 (CommonCrawl)

Server response

server ip

158.130.67.172

Publication date

2025-08-13 03:50:57

Fetch attempts

0

Original html size

49449

Normalized and saved size

30490

Content

title

Zhihao Ruan - GRASP Lab

excerpt

content

author

updated

1763265396

Text analysis

block type

0

extracted fields

137

extracted bits

featured image
title
OpenGraph suggests this is an article

detected location

0

detected language

1 (English)

category id

Edukacja (47)

index version

2025110801

paywall score

0

spam phrases

0

Text statistics

text nonlatin

0

text cyrillic

0

text characters

1452

text words

213

text unique words

84

text lines

97

text sentences

2

text paragraphs

0

text words per sentence

106

text matched phrases

10

text matched dictionaries

2