Main

processing priority

4

site type

3 (personal blog or private political site, e.g. Blogspot, Substack, also small blogs on own domains)

review version

11

html import

20 (imported)

Events

first seen date

2024-10-15 01:33:50

expired found date

-

created at

2024-10-15 01:33:50

updated at

2026-01-27 15:43:53

Domain name statistics

length

21

crc

49038

tld

2211

nm parts

0

nm random digits

0

nm rare letters

0

Connections

is subdomain of id

13642151 (wordpress.com)

previous id

0

replaced with id

0

related id

-

dns primary id

0

dns alternative id

0

lifecycle status

0 (unclassified, or currently active)

Subdomains and pages

deleted subdomains

0

page imported products

0

page imported random

0

page imported parking

0

Error counters

count skipped due to recent timeouts on the same server IP

0

count content received but rejected due to 11-799

0

count dns errors

0

count cert errors

0

count timeouts

0

count http 429

0

count http 404

0

count http 403

0

count http 5xx

0

next operation date

-

Server

server bits

server ip

-

Mainpage statistics

mp import status

20

mp rejected date

-

mp saved date

-

mp size orig

352078

mp size raw text

110826

mp inner links count

85

mp inner links status

20 (imported)

Open Graph

title

Software Development at Royal Danish Library

description

A peekhole into the life of the software development department at the Royal Danish Library

image

site name

Software Development at Royal Danish Library

author

updated

2026-01-26 01:40:14

raw text

Software Development at Royal Danish Library | A peekhole into the life of the software development department at the Royal Danish Library Software Development at Royal Danish Library A peekhole into the life of the software development department at the Royal Danish Library Skip to content Home About Net Archive Search ← Older posts Beware the cursorMark, my son! Posted on October 24, 2023 by Toke Eskildsen ’Twas brillig, and the slithy toves Did gyre and gimble in the wabe: All mimsy were the borogoves, And the mome raths outgrabe. The setting: A slightly hacked Solr 7 Cloud providing search for the Danish Netarchive, populated using webarchive-discovery and accessed using SolrWayback . About 130TB of index handled by 150 shards. 48 billion documents; not too mimsy. The task: Export data from the netarchive. Ranging from simple “ Get all unique domains ” over “ Get all text written in Danish ” to “ Get a WARC with all unique content for the domain ...

Text analysis

redirect type

0 (-)

block type

0 (no issues)

detected language

1 (English)

category id

SEC and Crypto [en] (228)

index version

2025123101

spam phrases

0

Text statistics

text nonlatin

0

text cyrillic

39

text characters

65535

text words

17974

text unique words

3187

text lines

1832

text sentences

857

text paragraphs

255

text words per sentence

20

text matched phrases

2

text matched dictionaries

7

RSS

rss status

32 (unknown)

rss found date

2024-10-15 01:33:51

rss size orig

242785

rss items

10

rss spam phrases

0

rss detected language

1 (English)

inbefore feed id

-

inbefore status

0 (new)

Sitemap

sitemap status

40 (completed successful import of reports.txt file to table in_pages)

sitemap review version

2

sitemap urls count

165

sitemap urls adult

0

sitemap filtered products

0

sitemap filtered videos

0

sitemap found date

2024-10-15 01:33:51

sitemap process date

2024-10-15 01:33:52

sitemap first import date

-

sitemap last import date

2025-12-25 22:09:45