id
type
5 (blog/news article)
status
21 (imported old-v2, waiting for another import)
review version
0
cleanup version
0
pending deletion
0 (-)
created at
2025-12-27 23:02:10
updated at
2025-12-27 23:02:10
url
https://freeexpression.law/2019/08/13/harvard-cyberlaw-clinic-files-amicus-brief-arguing-for-broader-access-to-government-databases/
url length
132
url crc
63780
url crc32
1874393380
location type
1 (url matches target location, page_location is empty)
canonical status
10 (verified canonical url)
canonical page id
domain id
domain tld
2541
domain parts
2
originating warc id
-
originating url
https://data.commoncrawl.org/crawl-data/CC-MAIN-2025-33/segments/1754151279722.3/warc/CC-MAIN-20250803165217-20250803195217-00368.warc.gz
source type
11 (CommonCrawl)
server ip
Publication date
2025-08-03 18:08:52
Fetch attempts
0
Original html size
39391
Normalized and saved size
4780
title
Harvard Cyberlaw Clinic files amicus brief arguing for broader access to government databases
excerpt
content
author
Alena Farber, Ian Kalish
updated
1768246100
block type
0
extracted fields
141
extracted bits
featured image
article author
title
OpenGraph suggests this is an article
detected location
0
detected language
126 (language undetectable (empty document, too short, or engines disagree))
category id
-
index version
1
paywall score
0
spam phrases
0
text nonlatin
0
text cyrillic
0
text characters
107
text words
17
text unique words
17
text lines
1
text sentences
1
text paragraphs
0
text words per sentence
17
text matched phrases
0
text matched dictionaries
0
image author