id
name
processing priority
4
site type
0 (generic, awaiting analysis)
review version
11
html import
20 (imported)
first seen date
2023-09-26 11:49:25
expired found date
-
created at
2024-06-11 09:31:10
updated at
2025-04-18 16:50:44
length
10
crc
63927
tld
2211
nm parts
0
nm random digits
0
nm rare letters
0
is subdomain of id
-
previous id
0
replaced with id
0
related id
-
dns primary id
dns alternative id
0
lifecycle status
0 (unclassified, or currently active)
deleted subdomains
0
page imported products
0
page imported random
0
page imported parking
0
count skipped due to recent timeouts on the same server IP
0
count content received but rejected due to 11-799
0
count dns errors
0
count cert errors
0
count timeouts
0
count http 429
0
count http 404
0
count http 403
0
count http 5xx
0
next operation date
-
server bits
—
server ip
-
mp import status
20
mp rejected date
-
mp saved date
-
mp size orig
19615
mp size raw text
8220
mp inner links count
0
mp inner links status
1 (no links)
title
description
image
site name
author
updated
2025-12-25 02:02:43
raw text
xx/xx Patreon posts 06/24 A discussion of discussions on AI bias 05/24 What the FTC got wrong in the Google antitrust investigation 03/24 How web bloat impacts users with slow devices 02/24 Diseconomies of scale in fraud, spam, support, and moderation 02/24 Why it's impossible to agree on what's allowed 01/24 Notes on Cruise's pedestrian accident 01/24 Why do people post on [bad platform] instead of [good platform]? 12/23 How bad are search results? Let's compare Google, Bing, Marginalia, Kagi, Mwmbl, and ChatGPT 09/22 Futurist prediction methods and accuracy 04/22 In defense of simple architectures 03/22 Why is it so hard to buy things that work well? 02/22 Misidentifying talent 02/22 A decade of major cache incidents at Twitter 02/22 Cocktail party ideas 12/21 The container throttling problem 12/21 Some thoughts on writing 12/21 Some latency measurement pitfalls 11/21 Major errors on this blog (and their corrections) 11/21 Ind...
redirect type
0 (-)
block type
0 (no issues)
detected language
1 (English)
category id
AI [en] (229)
index version
2025123101
spam phrases
0
text nonlatin
0
text cyrillic
0
text characters
6127
text words
1388
text unique words
605
text lines
369
text sentences
44
text paragraphs
0
text words per sentence
31
text matched phrases
1
text matched dictionaries
6
rss path
rss status
32 (unknown)
rss found date
2023-12-27 11:44:10
rss size orig
6696858
rss items
128
rss spam phrases
27
rss detected language
1 (English)
inbefore feed id
-
inbefore status
0 (new)
sitemap path
sitemap status
30 (processing completed, results pushed to table crawler_sitemaps.ext_domain_sitemap_lists)
sitemap review version
1
sitemap urls count
142
sitemap urls adult
0
sitemap filtered products
0
sitemap filtered videos
0
sitemap found date
2023-12-27 10:11:14
sitemap process date
2024-10-20 18:23:30
sitemap first import date
-
sitemap last import date
-