id
processing priority
4
site type
3 (personal blog or private political site, e.g. Blogspot, Substack, also small blogs on own domains)
review version
11
html import
20 (imported)
first seen date
2024-08-26 09:21:00
expired found date
-
created at
2024-08-26 09:21:00
updated at
2024-08-28 13:33:18
length
21
crc
28697
tld
2211
nm parts
0
nm random digits
0
nm rare letters
0
is subdomain of id
31727457 (substack.com)
previous id
0
replaced with id
0
related id
-
dns primary id
0
dns alternative id
0
lifecycle status
0 (unclassified, or currently active)
deleted subdomains
0
page imported products
0
page imported random
0
page imported parking
0
count skipped due to recent timeouts on the same server IP
0
count content received but rejected due to 11-799
0
count dns errors
0
count cert errors
0
count timeouts
0
count http 429
0
count http 404
0
count http 403
0
count http 5xx
0
next operation date
-
server bits
—
server ip
-
mp import status
20
mp rejected date
-
mp saved date
-
mp size orig
392314
mp size raw text
4110
mp inner links count
19
mp inner links status
10 (links queued, awaiting import)
title
description
image
site name
author
Davis Blalock
updated
2026-02-27 17:18:19
raw text
Davis Summarizes Papers | Davis Blalock | Substack Davis Summarizes Papers Subscribe Sign in Home AI Analysis Archive About 2024-8-25: Scaling curves for All of the Things Good news: we got a bunch of important findings this week. 1 hr ago • Davis Blalock 2 Share this post 2024-8-25: Scaling curves for All of the Things dblalock.substack.com Copy link Facebook Email Note Other Latest Top Discussions 2024-8-4 arXiv roundup: LLama 3.1, training a 100T biological neural net In case you’re wondering what I’ve been up to instead of posting for the past couple months, I was kicking off a training run for a 100T parameter… Aug 5 • Davis Blalock 31 Share this post 2024-8-4 arXiv roundup: LLama 3.1, training a 100T biological neural net dblalock.substack.com Copy link Facebook Email Note Other 2 2024-4-28 arXiv roundup: data and scaling, backlog highlights part 3 Besides getting to cover unusually interesting wor...
redirect type
0 (-)
block type
0 (no issues)
detected language
1 (English)
category id
index version
1
spam phrases
0
text nonlatin
0
text cyrillic
0
text characters
3054
text words
574
text unique words
253
text lines
169
text sentences
10
text paragraphs
4
text words per sentence
57
text matched phrases
0
text matched dictionaries
0
links self subdomains
0
links other subdomains
0
links other domains
links spam adult
0
links spam random
0
links spam expired
0
links ext activities
0
links ext ecommerce
0
links ext finance
0
links ext crypto
0
links ext booking
0
links ext news
0
links ext leaks
0
links ext ugc
21 - substack.com, artificialintelligencemadesimple.substack.com, thetechbuzz.substack.com, thegradientpub.substack.com
links ext klim
0
links ext generic
0
dol status
0
dol updated
2026-02-27 17:18:19
rss path
rss status
1 (priority 1 already searched, no matches found)
rss found date
-
rss size orig
0
rss items
0
rss spam phrases
0
rss detected language
0 (awaiting analysis)
inbefore feed id
-
inbefore status
0 (new)
sitemap path
sitemap status
30 (processing completed, results pushed to table crawler_sitemaps.ext_domain_sitemap_lists)
sitemap review version
1
sitemap urls count
100
sitemap urls adult
0
sitemap filtered products
0
sitemap filtered videos
0
sitemap found date
2024-08-28 06:41:39
sitemap process date
2024-08-28 06:41:40
sitemap first import date
-
sitemap last import date
-