Main

type

0 (not classified)

status

21 (imported old-v2, waiting for another import)

review version

0

cleanup version

0

pending deletion

0 (-)

created at

2025-07-06 15:49:43

updated at

2025-12-03 02:01:01

Address

url

https://robeson100.rutgers.edu/past-events

url length

42

url crc

9931

url crc32

1692149451

location type

1 (url matches target location, page_location is empty)

canonical status

10 (verified canonical url)

canonical page id

2331974389

Source

domain id

413353097

domain tld

0

domain parts

0

originating warc id

-

originating url

https://robeson100.rutgers.edu/

source type

4 (mainpage of this domain)

Server response

server ip

23.185.0.4

Publication date

2025-12-03 02:01:01

Fetch attempts

1

Original html size

93155

Normalized and saved size

83172

Content

title

Past Events | Paul Robeson at Rutgers

excerpt

content

author

updated

1765179627

Text analysis

block type

0

extracted fields

8

extracted bits

title

detected location

0

detected language

1 (English)

category id

Edukacja (47)

index version

2025110801

paywall score

0

spam phrases

0

Text statistics

text nonlatin

0

text cyrillic

0

text characters

22490

text words

4292

text unique words

1247

text lines

347

text sentences

145

text paragraphs

42

text words per sentence

29

text matched phrases

44

text matched dictionaries

8