Main

type

0 (not classified)

status

21 (imported old-v2, waiting for another import)

review version

0

cleanup version

0

pending deletion

0 (-)

created at

2025-12-23 12:25:15

updated at

2025-12-23 12:25:15

Address

url

https://www.eeri.org/chapters/norcal/news-and-events

url length

52

url crc

25137

url crc32

2445959729

location type

1 (url matches target location, page_location is empty)

canonical status

2 (missing canonical tag in html)

canonical page id

-

Source

domain id

84774567

domain tld

2688

domain parts

2

originating warc id

-

originating url

https://data.commoncrawl.org/crawl-data/CC-MAIN-2025-33/segments/1754151279808.15/warc/CC-MAIN-20250804145214-20250804175214-00309.warc.gz

source type

11 (CommonCrawl)

Server response

server ip

184.154.166.212

Publication date

2025-08-04 16:14:54

Fetch attempts

0

Original html size

115086

Normalized and saved size

69960

Content

title

Northern California Chapter - News and Announcements

excerpt

content

author

updated

1767047057

Text analysis

block type

0

extracted fields

8

extracted bits

title

detected location

0

detected language

1 (English)

category id

Other [en] (231)

index version

2025123101

paywall score

52

spam phrases

1

Text statistics

text nonlatin

0

text cyrillic

0

text characters

14842

text words

2806

text unique words

826

text lines

355

text sentences

133

text paragraphs

38

text words per sentence

21

text matched phrases

15

text matched dictionaries

7