id
name
related bits
0
processing priority
3
site type
0 (generic, awaiting analysis)
review version
11
html import
20 (imported)
first seen date
2023-12-17 19:01:10
expired found date
-
created at
2024-06-06 00:11:46
updated at
2025-12-28 16:00:23
length
8
crc
49187
tld
276
nm parts
0
nm random digits
0
nm rare letters
0
is subdomain of id
-
previous id
0
replaced with id
0
related id
-
dns primary id
dns alternative id
0
lifecycle status
0 (unclassified, or currently active)
deleted subdomains
0
page imported products
0
page imported random
0
page imported parking
0
count skipped due to recent timeouts on the same server IP
0
count content received but rejected due to 11-799
0
count dns errors
0
count cert errors
0
count timeouts
0
count http 429
0
count http 404
0
count http 403
0
count http 5xx
0
next operation date
-
server bits
—
server ip
-
mp import status
20
mp rejected date
-
mp saved date
-
mp size orig
13056
mp size raw text
1181
mp inner links count
1
mp inner links status
20 (imported)
title
description
Information is nothing without retrieval
image
site name
author
© 2023 Webis
updated
2025-12-16 09:10:07
raw text
Webis People For Students Lecturenotes Research Publications Data Events Facilities Webis.de People For Students Lecturenotes Research Publications Data Events Facilities Information is nothing without retrieval The Webis Group addresses challenges of the information society by conducting basic research, developing technology, and implementing and evaluating prototypes for future information systems. Our research contributes to web mining and retrieval, machine learning, computational linguistics, and symbolic AI. Learn More Search Services Args Argument search ChatNoir Web search IR Anthology Scholarly search on IR Netspeak Writing assistance Picapica Plagiarism detection TIRA Experiment execution Groningen Home People Teaching Research Hannover Home People Teaching Research Jena Home People Teaching Research ...
redirect type
0 (-)
block type
0 (no issues)
detected language
1 (English)
category id
index version
2025110801
spam phrases
0
text nonlatin
0
text cyrillic
0
text characters
817
text words
122
text unique words
76
text lines
66
text sentences
3
text paragraphs
1
text words per sentence
40
text matched phrases
2
text matched dictionaries
2
links self subdomains
0
links other subdomains
16 - assets.webis.de, ir.webis.de, hannover.webis.de, ai.uni-hannover.de, jena.webis.de, leipzig.webis.de, weimar.webis.de
links other domains
14 - args.me, chatnoir.eu, netspeak.org, picapica.org, tira.io, rug.nl, temir.org
links spam adult
0
links spam random
0
links spam expired
0
links ext activities
0
links ext ecommerce
0
links ext finance
0
links ext crypto
0
links ext booking
0
links ext news
0
links ext leaks
0
links ext ugc
3 - twitter.com, youtube.com
links ext klim
0
links ext generic
0
dol status
0
dol updated
2025-12-16 09:10:07
rss path
rss status
3 (priority 3 already searched, no matches found)
rss found date
-
rss size orig
0
rss items
0
rss spam phrases
0
rss detected language
0 (awaiting analysis)
inbefore feed id
-
inbefore status
0 (new)
sitemap path
sitemap status
30 (processing completed, results pushed to table crawler_sitemaps.ext_domain_sitemap_lists)
sitemap review version
1
sitemap urls count
607
sitemap urls adult
0
sitemap filtered products
0
sitemap filtered videos
0
sitemap found date
2023-12-25 17:17:31
sitemap process date
2024-10-02 08:23:41
sitemap first import date
-
sitemap last import date
-