Main

type

7 (about/contact/privacy/terms page)

status

21 (imported old-v2, waiting for another import)

review version

0

cleanup version

0

pending deletion

0 (-)

created at

2025-08-18 15:48:49

updated at

2026-01-01 22:03:55

Address

url

https://about.dc.gov/page/history-and-tourism

url length

45

url crc

54843

url crc32

3993753147

location type

4 (page_location points to new url in different domain)

canonical status

30 (canonical url is different, page_canonical_page_id points to it)

canonical page id

2607418472

location

https://dc.gov/page/history-and-tourism

Source

domain id

1420092

domain tld

2410

domain parts

0

originating warc id

-

originating url

https://about.dc.gov/

source type

4 (mainpage of this domain)

Server response

server ip

104.18.36.77

Publication date

2026-01-01 22:03:55

Fetch attempts

1

Original html size

74205

Normalized and saved size

31324

Content

title

History and Tourism

excerpt

content

author

updated

1768469297

Text analysis

block type

0

extracted fields

8

extracted bits

title

detected location

0

detected language

1 (English)

category id

Law and order [en] (220)

index version

2025123101

paywall score

0

spam phrases

0

Text statistics

text nonlatin

0

text cyrillic

0

text characters

997

text words

161

text unique words

90

text lines

56

text sentences

3

text paragraphs

0

text words per sentence

53

text matched phrases

1

text matched dictionaries

1