id
related bits
0
processing priority
4
site type
3 (personal blog or private political site, e.g. Blogspot, Substack, also small blogs on own domains)
review version
11
html import
20 (imported)
first seen date
2024-09-15 02:20:10
expired found date
-
created at
2024-09-15 02:20:10
updated at
2026-01-01 13:13:48
length
22
crc
42059
tld
2211
nm parts
0
nm random digits
0
nm rare letters
0
is subdomain of id
13642151 (wordpress.com)
previous id
0
replaced with id
0
related id
-
dns primary id
0
dns alternative id
0
lifecycle status
0 (unclassified, or currently active)
deleted subdomains
0
page imported products
0
page imported random
0
page imported parking
0
count skipped due to recent timeouts on the same server IP
0
count content received but rejected due to 11-799
0
count dns errors
0
count cert errors
0
count timeouts
0
count http 429
0
count http 404
0
count http 403
0
count http 5xx
0
next operation date
-
server bits
—
server ip
-
mp import status
20
mp rejected date
-
mp saved date
-
mp size orig
138535
mp size raw text
34299
mp inner links count
12
mp inner links status
10 (links queued, awaiting import)
title
Just a simple Hadoop DBA
description
Adventures with Data and Massively Parallel Databases
image
site name
Just a simple Hadoop DBA
author
updated
2026-03-02 18:37:25
raw text
Just a simple Hadoop DBA | Adventures with Data and Massively Parallel Databases Just a simple Hadoop DBA Adventures with Data and Massively Parallel Databases About Presentations On Error Messages Posted: April 9, 2014 | Author: prodlife | Filed under: Uncategorized | Tags: developer , errors , tips | 2 Comments Here’s a pet peeve of mine: Customers who don’t read the error messages. The usual symptom is a belief that there is just on error: “Doesn’t work”, and that all forms of “doesn’t work” are the same. So if you tried something, got an error, your changed something and you are still getting an error, nothing changed. I hope everyone who reads this blog understand why this behavior makes any troubleshooting nearly impossible. So I won’t bother to explain why I find this so annoying and so self defeating. Instead, I’ll explain what can we, as developers, can do to improve the situation a bit. (OMG, did I just refer to myself as a developer? I...
redirect type
0 (-)
block type
0 (no issues)
detected language
1 (English)
category id
index version
1
spam phrases
0
text nonlatin
0
text cyrillic
0
text characters
25739
text words
5417
text unique words
1395
text lines
505
text sentences
289
text paragraphs
83
text words per sentence
18
text matched phrases
0
text matched dictionaries
0
links self subdomains
0
links other subdomains
8 - docs.oracle.com, commons.apache.org, oozie.apache.org, forums.oracle.com, technology.amis.nl, blogs.oracle.com, blog.tanelpoder.com
links other domains
11 - oracle.com, ardentperf.com, oracledoug.com, highscalability.com, oraclemusings.com, paulgraham.com, pythian.com, randsinrepose.com, structureddata.org
links spam adult
0
links spam random
0
links spam expired
0
links ext activities
0
links ext ecommerce
0
links ext finance
0
links ext crypto
0
links ext booking
0
links ext news
0
links ext leaks
0
links ext ugc
44 - s0.wp.com, wp.me, s1.wp.com, wordpress.com, docs.google.com, en.wikipedia.org, twitter.com, slideshare.net, rwijk.blogspot.com, dbasrus.blogspot.com, kevinclosson.wordpress.com, jonathanlewis.wordpress.com
links ext klim
0
links ext generic
0
dol status
0
dol updated
2026-03-02 18:37:25
rss status
32 (unknown)
rss found date
2024-11-12 15:30:59
rss size orig
52795
rss items
10
rss spam phrases
0
rss detected language
1 (English)
inbefore feed id
-
inbefore status
0 (new)
sitemap path
sitemap status
40 (completed successful import of reports.txt file to table in_pages)
sitemap review version
2
sitemap urls count
257
sitemap urls adult
0
sitemap filtered products
0
sitemap filtered videos
0
sitemap found date
2024-10-13 05:37:22
sitemap process date
2024-10-13 05:37:23
sitemap first import date
-
sitemap last import date
2026-01-01 13:13:48