id
type
0 (not classified)
status
30 (imported + raw text content deleted)
review version
0
cleanup version
0
pending deletion
0 (-)
created at
2025-10-07 05:37:37
updated at
2025-10-07 05:37:38
url
https://annual.wikimedia.org/2016/fact-10.html
url length
46
url crc
31861
url crc32
3404627061
location type
1 (url matches target location, page_location is empty)
canonical status
2 (missing canonical tag in html)
canonical page id
-
domain id
domain tld
2688
domain parts
0
originating warc id
-
originating url
https://data.commoncrawl.org/crawl-data/CC-MAIN-2025-33/segments/1754151567216.67/warc/CC-MAIN-20250813090531-20250813120531-00697.warc.gz
source type
11 (CommonCrawl)
server ip
Publication date
2025-08-13 10:12:07
Fetch attempts
0
Original html size
26666
Normalized and saved size
25599
title
239 million people used the internet for the first time in 2016
excerpt
content
It is estimated that 61% of the world does not have access to the internet. But that’s changing quickly. Over the past decade, a quarter of the world’s population has connected to the internet, many through mobile devices. In India alone, more than 100 million people have gained internet access every year since 2014. In 2016, the Wikimedia Foundation created a task force to understand the needs of new internet users. The New Readers team researched internet use in eight countries and traveled to Mexico, India, and Nigeria to interview nearly 200 people about their information-seeking habits. The findings portray internet use as dominated by mobile devices, limited connectivity, task-oriented browsing, and trust in the search bar over specific web properties. Now the New Readers team is working with colleagues across the Foundation on new solutions to help readers in places with low to limited internet connectivity. The Reading team is building mobile featur...
author
updated
1762865425
block type
0
extracted fields
105
extracted bits
featured image
title
full content
content was extracted heuristically
detected location
0
detected language
1 (English)
category id
index version
2025110801
paywall score
0
spam phrases
0
text nonlatin
0
text cyrillic
0
text characters
1141
text words
221
text unique words
132
text lines
1
text sentences
12
text paragraphs
1
text words per sentence
18
text matched phrases
1
text matched dictionaries
1
links self subdomains
0
links other subdomains
0
links other domains
0
links spam adult
0
links spam random
0
links spam expired
0
links ext activities
0
links ext ecommerce
0
links ext finance
0
links ext crypto
0
links ext booking
0
links ext news
0
links ext leaks
0
links ext ugc
5
links ext klim
0
links ext generic
0
image author