Main

processing priority

4

site type

3 (personal blog or private political site, e.g. Blogspot, Substack, also small blogs on own domains)

review version

11

html import

20 (imported)

Events

first seen date

2023-11-13 14:49:38

expired found date

-

created at

2024-06-06 09:24:02

updated at

2025-12-29 06:54:27

Domain name statistics

length

29

crc

27583

tld

2211

nm parts

0

nm random digits

0

nm rare letters

0

Connections

is subdomain of id

69893241 (blogspot.com)

previous id

0

replaced with id

0

related id

-

dns primary id

0

dns alternative id

0

lifecycle status

0 (unclassified, or currently active)

Subdomains and pages

deleted subdomains

0

page imported products

0

page imported random

0

page imported parking

0

Error counters

count skipped due to recent timeouts on the same server IP

0

count content received but rejected due to 11-799

0

count dns errors

0

count cert errors

0

count timeouts

0

count http 429

0

count http 404

0

count http 403

0

count http 5xx

0

next operation date

-

Server

server bits

server ip

-

Mainpage statistics

mp import status

20

mp rejected date

-

mp saved date

-

mp size orig

173847

mp size raw text

38467

mp inner links count

80

mp inner links status

20 (imported)

Open Graph

title

Sapping Attention

description

Digital Humanities: Using tools from the 1990s to answer questions from the 1960s about 19th century America.

image

site name

author

updated

2025-12-17 00:33:42

raw text

Sapping Attention Sapping Attention Digital Humanities: Using tools from the 1990s to answer questions from the 1960s about 19th century America. Thursday, February 3, 2022 What's in the Hathi Trust? (This is a post I've had unpublished since writing it in 2016. Just hitting publish without reviewing right now because it's something I find myself periodically looking at the charts for). As we get ready to launch the full Hathi Trust+Bookworm to allow tracking words across 13 million books, I've been working on fixing up the metadata from the original MARC records. This is useful information to have for anyone using Hathi to find books; it's hard to know the general outlines of a collection like this. So what follows are some general outlines about what books are included in the Hathi Trust. This is closely related, by the way, to what books are included in Google Books; more on that below. One hugely important question is where the books come from. Different libraries ha...

Text analysis

redirect type

0 (-)

block type

0 (no issues)

detected language

1 (English)

category id

Edukacja (47)

index version

2025110801

spam phrases

0

Text statistics

text nonlatin

0

text cyrillic

0

text characters

30425

text words

6282

text unique words

1738

text lines

404

text sentences

269

text paragraphs

80

text words per sentence

23

text matched phrases

8

text matched dictionaries

6

RSS

rss status

32 (unknown)

rss found date

2024-01-01 01:51:40

rss size orig

410785

rss items

25

rss spam phrases

0

rss detected language

1 (English)

inbefore feed id

-

inbefore status

0 (new)

Sitemap

sitemap status

30 (processing completed, results pushed to table crawler_sitemaps.ext_domain_sitemap_lists)

sitemap review version

1

sitemap urls count

168

sitemap urls adult

0

sitemap filtered products

0

sitemap filtered videos

0

sitemap found date

2024-01-08 13:55:53

sitemap process date

2024-08-15 13:17:54

sitemap first import date

-

sitemap last import date

-