id
name
processing priority
3
site type
0 (generic, awaiting analysis)
review version
11
html import
0 (new)
first seen date
2025-08-10 04:43:10
expired found date
-
created at
2024-07-21 14:32:59
updated at
2026-01-14 20:22:33
length
12
crc
20067
tld
380
nm parts
0
nm random digits
0
nm rare letters
0
deleted subdomains
0
page imported products
0
page imported random
0
page imported parking
0
count skipped due to recent timeouts on the same server IP
0
count content received but rejected due to 11-799
0
count dns errors
0
count cert errors
3
count timeouts
0
count http 429
0
count http 404
0
count http 403
0
count http 5xx
0
next operation date
2025-12-23 13:57:38
server bits
APACHE
DEBIAN
server ip
mp import status
20
mp rejected date
-
mp saved date
2026-01-14 20:22:33
mp size orig
41751
mp size raw text
3610
mp inner links count
20
mp inner links status
10 (links queued, awaiting import)
title
description
image
site name
author
updated
2026-03-01 21:22:40
raw text
CLARIN-IT | the Italian Common Language Resources and Technology Infrastructure CLARIN-IT Facebook CLARIN-IT Twitter About Governance Consortium Centres Logo Join Access Events Initiatives News Home the Italian Common Language Resources and Technology Infrastructure "CLARIN ERIC is one of the 20 European Research Infrastructure Consortia in which Italy takes part thanks to the investment by the Italian Ministry of University and Research (MUR) through the Ordinary Fund of Public Research Bodies (FOE), initially assigned on an extraordinary basis and, in recent years, stably structured through the financing item "projects of international significance". Italy takes part in CLARIN ERIC through the CLARIN-IT National Consortium." [ Ministerial Decree no. 1082 of 10/09/2021 - National Plan for Research Infrastructures (PNIR) 2021-2027 ] English Italian Italy Full Member of CLARIN ERIC from 1 st October 2015 CLARIN-IT structure and functioning The...
redirect type
0 (-)
block type
0 (no issues)
detected language
1 (English)
category id
-
index version
1
spam phrases
0
text nonlatin
0
text cyrillic
0
text characters
2846
text words
514
text unique words
277
text lines
79
text sentences
10
text paragraphs
7
text words per sentence
51
text matched phrases
0
text matched dictionaries
0
links self subdomains
0
links other subdomains
6 - eventi.unibo.it, digitaltools.labcd.unipi.it, aiucd2025.dlls.univr.it, cloud.garr.it
links other domains
14 - clarin.eu, eepurl.com, devsaran.com
links spam adult
0
links spam random
0
links spam expired
0
links ext activities
0
links ext ecommerce
0
links ext finance
0
links ext crypto
0
links ext booking
0
links ext news
0
links ext leaks
0
links ext ugc
4 - facebook.com, twitter.com
links ext klim
0
links ext generic
1
dol status
0
dol updated
2026-03-01 21:22:40
rss path
rss status
0 (new)
rss found date
-
rss size orig
0
rss items
0
rss spam phrases
0
rss detected language
0 (awaiting analysis)
inbefore feed id
-
inbefore status
0 (new)
sitemap path
sitemap status
0 (new)
sitemap review version
2
sitemap urls count
0
sitemap urls adult
0
sitemap filtered products
0
sitemap filtered videos
0
sitemap found date
-
sitemap process date
-
sitemap first import date
-
sitemap last import date
-