id
related bits
0
processing priority
4
site type
3 (personal blog or private political site, e.g. Blogspot, Substack, also small blogs on own domains)
review version
11
html import
20 (imported)
first seen date
2024-10-22 20:55:58
expired found date
-
created at
2024-10-22 20:55:57
updated at
2026-01-22 13:21:31
length
27
crc
16121
tld
2211
nm parts
0
nm random digits
0
nm rare letters
0
is subdomain of id
13642151 (wordpress.com)
previous id
0
replaced with id
0
related id
-
dns primary id
0
dns alternative id
0
lifecycle status
0 (unclassified, or currently active)
deleted subdomains
0
page imported products
0
page imported random
0
page imported parking
0
count skipped due to recent timeouts on the same server IP
0
count content received but rejected due to 11-799
0
count dns errors
0
count cert errors
0
count timeouts
0
count http 429
0
count http 404
0
count http 403
0
count http 5xx
0
next operation date
-
server bits
—
server ip
-
mp import status
20
mp rejected date
-
mp saved date
-
mp size orig
346557
mp size raw text
119219
mp inner links count
68
mp inner links status
20 (imported)
title
Archives Watch
description
Documenting Refugee Studies and Human Rights in the Archive
image
site name
Archives Watch
author
updated
2026-01-20 18:32:21
raw text
Archives Watch | Documenting Refugee Studies and Human Rights in the Archive Home About Contact Us Links Archives Watch Documenting Refugee Studies and Human Rights in the Archive Feeds: Posts Comments 2015 in review January 4, 2016 by Paul V Dudman The WordPress.com stats helper monkeys prepared a 2015 annual report for this blog. Here’s an excerpt: A San Francisco cable car holds 60 people. This blog was viewed about 630 times in 2015. If it were a cable car, it would take about 11 trips to carry that many people. Click here to see the complete report. Posted in Uncategorized | Leave a Comment » Archives in the News: Updates from the UEL Archives (weekly) May 10, 2015 by Paul V Dudman Henry Treece – history in the making – Untold lives blog via Untold lives blog http://britishlibrary.typepad.co.uk/untoldlives/ tags: IFTTT Feedly archives Additions to our collections | Archives & Special Collections via Archives...
redirect type
31 (document.location)
block type
0 (no issues)
detected language
1 (English)
category id
Other [en] (231)
index version
2025123101
spam phrases
0
text nonlatin
0
text cyrillic
0
text characters
65535
text words
17456
text unique words
3349
text lines
2047
text sentences
570
text paragraphs
226
text words per sentence
30
text matched phrases
17
text matched dictionaries
10
links self subdomains
0
links other subdomains
51 - britishlibrary.typepad.co.uk, uel.ac.uk, britac.ac.uk, blogs.ncvo.org.uk, jiscmail.ac.uk, brunel.ac.uk, ies.sas.ac.uk, ucl.ac.uk, events.sas.ac.uk, pittrivers-sound.blogspot.co.uk, vam.ac.uk, ncl.ac.uk, ioe.ac.uk, jiscdigitalmedia.ac.uk, icarchives.webbler.co.uk, stwebmail.uel.ac.uk, www2.gre.ac.uk, reading.ac.uk, northampton.ac.uk, reg.guardian.managemyaccount.co.uk, blogs.bl.uk, feeds.delicious.com
links other domains
247 - diigo.com, lgf.org.uk, voluntarysectorarchives.org.uk, ncvo.org.uk, cilip.org.uk, uta.fi, websense.com, mailcontrol.com, communityarchives.org.uk, h-net.org, uni-goettingen.de, clevelandjewishnews.com, stljewishlight.com, newhamrecorder.co.uk, algemeiner.com, jns.org, vahs.org.uk, africainwords.com, electronicintifada.net, israelnationalnews.com, cio.co.nz, bit.ly, twmuseums.org.uk, thinkingdigital.co.uk, bl.uk, coneyhq.org, dominicwilcox.com, baltanlaboratories.org, or-bits.com, crumbweb.org, archives.org.uk, religiousarchivesgroup.org.uk, worldbulletin.net, forward.com, editorialmanager.com, springer.com, girona.cat, facetpublishing.co.uk, prestocentre.org, royalvoluntaryservice.org.uk, redcross.org.uk, tandfonline.com, guardian.co.uk, leicestershirehistory.co.uk, berkshirerecordoffice.org.uk, experiencewoodhorn.com, maryevans.com, allafrica.com, witness.org, ift.tt
links spam adult
2
links spam random
0
links spam expired
0
links ext activities
15
links ext ecommerce
0
links ext finance
0
links ext crypto
0
links ext booking
0
links ext news
12
links ext leaks
0
links ext ugc
79 - s0.wp.com, wp.me, s1.wp.com, wordpress.com, bangladesharchives.wordpress.com, hrlibs.blogspot.com, digital-archiving.blogspot.com, uelarchivesportal.wordpress.com, facetpublishing.wordpress.com, librarianshipwreck.wordpress.com, denbighshirearchives.wordpress.com, catandindexgroup.wordpress.com, brunelspecialcollections.wordpress.com, markjowen.wordpress.com, commons.wikipedia.org, en.wikipedia.org, failureinthearchives.wordpress.com, linkedin.com, twitter.com, marthasadie.wordpress.com, ukhrg.wordpress.com, refugeearchives.wordpress.com
links ext klim
0
links ext generic
14
dol status
0
dol updated
2026-01-20 18:32:21
rss status
32 (unknown)
rss found date
2024-10-22 20:55:58
rss size orig
27981
rss items
10
rss spam phrases
0
rss detected language
1 (English)
inbefore feed id
-
inbefore status
0 (new)
sitemap path
sitemap status
30 (processing completed, results pushed to table crawler_sitemaps.ext_domain_sitemap_lists)
sitemap review version
1
sitemap urls count
456
sitemap urls adult
0
sitemap filtered products
0
sitemap filtered videos
0
sitemap found date
2024-10-22 20:55:59
sitemap process date
2024-10-22 20:56:00
sitemap first import date
-
sitemap last import date
-