id
processing priority
4
site type
3 (personal blog or private political site, e.g. Blogspot, Substack, also small blogs on own domains)
review version
11
html import
20 (imported)
first seen date
2024-03-08 16:28:53
expired found date
-
created at
2024-06-10 04:42:41
updated at
2026-01-07 10:50:52
length
25
crc
51897
tld
2211
nm parts
0
nm random digits
0
nm rare letters
0
is subdomain of id
13642151 (wordpress.com)
previous id
0
replaced with id
0
related id
-
dns primary id
0
dns alternative id
0
lifecycle status
0 (unclassified, or currently active)
deleted subdomains
0
page imported products
0
page imported random
0
page imported parking
0
count skipped due to recent timeouts on the same server IP
0
count content received but rejected due to 11-799
0
count dns errors
0
count cert errors
0
count timeouts
0
count http 429
0
count http 404
0
count http 403
0
count http 5xx
0
next operation date
-
server bits
—
server ip
-
mp import status
20
mp rejected date
-
mp saved date
-
mp size orig
231624
mp size raw text
67830
mp inner links count
23
mp inner links status
20 (imported)
title
PaulingBlog
description
Exploring the life and work of Linus Pauling, history's only recipient of two unshared Nobel Prizes.
site name
PaulingBlog
author
updated
2025-12-22 10:35:43
raw text
PaulingBlog | Exploring the life and work of Linus Pauling, history's only recipient of two unshared Nobel Prizes. Home About Us Creating The Pauling Catalogue PaulingBlog Entries RSS | Comments RSS Thanks for Reading! 1,335,621 views Pages About Us Creating The Pauling Catalogue Categories Ava Helen Pauling (29) Colleagues of Pauling (141) Documentary History Websites (307) DNA (19) Hemoglobin & Sickle Cell Anemia (43) Nature of the Chemical Bond (82) Peace Activism (138) Scientific War Work (15) Structure of Proteins (30) Facets of Linus Pauling (103) Featured Documents (15) General Chemistry (7) Guggenheim Foundation (17) Just for fun (22) Lawsuits (7) Linus Pauling Institute (32) Orthomolecular Medicine (80) Patents (13) Pauling and Oregon (66) Pauling as Administrator (20) Pauling-related Events (40) Peter Pauling (10) Primary Source Websites (31) Linus Pauling Day-by-Day (6) Linus ...
redirect type
31 (document.location)
block type
0 (no issues)
detected language
1 (English)
category id
index version
2025110801
spam phrases
2
text nonlatin
0
text cyrillic
0
text characters
53287
text words
10463
text unique words
2528
text lines
809
text sentences
370
text paragraphs
107
text words per sentence
28
text matched phrases
24
text matched dictionaries
12
links self subdomains
0
links other subdomains
8 - watch.opb.org, blog.americanchemistry.com, cultureofchemistry.fieldofscience.com, blogs.sciencemag.org, wavefunction.fieldofscience.com, blogs.royalsociety.org, physicsbuzz.physicscentral.com
links other domains
22 - worldscientific.com, lifetalk.net, compoundchem.com, masterorganicchemistry.com, organic-chemistry.org, polymersolutions.com, planetary.org, mosaicscience.com, nautil.us, scienceblogs.com, softmachines.org, dailygalaxy.com, lastwordonnothing.com, evolvingthoughts.net, skullsinthestars.com, timetoeatthedogs.com, iopblog.org, preposterousuniverse.com, quantumfrontiers.com, physicsdetective.com
links spam adult
0
links spam random
0
links spam expired
0
links ext activities
10
links ext ecommerce
0
links ext finance
0
links ext crypto
0
links ext booking
0
links ext news
0
links ext leaks
0
links ext ugc
63 - s0.wp.com, wp.me, s1.wp.com, wordpress.com, paulingblog.files.wordpress.com, addtoany.com, osurarebooks.tumblr.com, thebrewstorian.tumblr.com, cultureofchemistry.blogspot.com, chempics.wordpress.com, greenchemuoft.wordpress.com, backreaction.blogspot.com, philipball.blogspot.com, medium.com, etherwave.wordpress.com, nanoscale.blogspot.com, physicsandphysicists.blogspot.com, physicsfromtheedge.blogspot.com, flickr.com, en.wordpress.com
links ext klim
0
links ext generic
0
dol status
0
dol updated
2025-12-22 10:35:43
rss status
32 (unknown)
rss found date
2024-03-11 09:51:39
rss size orig
111816
rss items
10
rss spam phrases
2
rss detected language
1 (English)
inbefore feed id
-
inbefore status
0 (new)
sitemap path
sitemap status
30 (processing completed, results pushed to table crawler_sitemaps.ext_domain_sitemap_lists)
sitemap review version
1
sitemap urls count
838
sitemap urls adult
0
sitemap filtered products
0
sitemap filtered videos
0
sitemap found date
2024-03-11 08:46:44
sitemap process date
2024-10-31 10:12:38
sitemap first import date
-
sitemap last import date
-