id
related bits
0
processing priority
3
site type
5 (wiki-type site, growing by topic rather than chronologically)
review version
11
html import
20 (imported)
first seen date
2024-03-02 13:10:53
expired found date
-
created at
2024-06-07 04:05:52
updated at
2025-12-30 07:58:02
length
23
crc
22935
tld
86
nm parts
0
nm random digits
0
nm rare letters
0
is subdomain of id
87719371 (github.io)
previous id
0
replaced with id
0
related id
-
dns primary id
0
dns alternative id
0
lifecycle status
0 (unclassified, or currently active)
deleted subdomains
0
page imported products
0
page imported random
0
page imported parking
0
count skipped due to recent timeouts on the same server IP
0
count content received but rejected due to 11-799
0
count dns errors
0
count cert errors
0
count timeouts
0
count http 429
0
count http 404
0
count http 403
0
count http 5xx
0
next operation date
-
server bits
—
server ip
-
mp import status
20
mp rejected date
-
mp saved date
-
mp size orig
15386
mp size raw text
8184
mp inner links count
4
mp inner links status
20 (imported)
title
description
image
site name
author
updated
2025-12-18 05:42:50
raw text
Home André F. T. Martins Home Jobs Publications Software Courses SARDINE Lab Home Contact information: andre.t.martins AT tecnico DOT ulisboa DOT pt Instituto de Telecomunicacões Torre Norte - Sala 9.07 Av. Rovisco Pais, 1 1049-001 Lisboa - Portugal Phone: +351 218418454 I am an Associate Professor at Instituto Superior Técnico , Senior Researcher at the Instituto de Telecomunicações , and VP of AI Research at Unbabel in Lisbon , Portugal . I also do scientific consulting for Priberam Labs . I work on natural language processing and machine learning. Until 2012, I was a PhD student in the joint CMU-Portugal program in Language Technologies, at Carnegie Mellon University and Instituto Superior Técnico. My advisors were Mário Figueiredo , Noah Smith , Pedro Aguiar and Eric Xing . Post-docs Chryssa Zerva (Post-doc at IT, 2021-) Vlad Niculae (Post-doc at IT, 2018-2020, now Assistant Professor at Unive...
redirect type
0 (-)
block type
0 (no issues)
detected language
1 (English)
category id
index version
2025110801
spam phrases
0
text nonlatin
0
text cyrillic
0
text characters
6146
text words
1270
text unique words
444
text lines
250
text sentences
71
text paragraphs
20
text words per sentence
17
text matched phrases
7
text matched dictionaries
4
links self subdomains
0
links other subdomains
24 - tecnico.ulisboa.pt, lx.it.pt, labs.priberam.com, isr.ist.utl.pt, nilc.icmc.usp.br, informatik.tu-darmstadt.de, users.monash.edu.au, cl.uni-heidelberg.de, nt.tuwien.ac.at, sepln2022.grupolys.org, lxmls.it.pt, talnrecital2021.inria.fr, lumlis.tecnico.ulisboa.pt, athnlp.iit.demokritos.gr, proceedings.mlr.press, alt.qcri.org, www-05.ibm.com
links other domains
30 - unbabel.com, it.pt, vene.ro, phontron.com, ai.google, icai.ai, triton-conference.org, ellis.eu, ivan-titov.org, mblondel.org, eurnlp.org, mlrs.ai, culturgest.pt, statmt.org, aclweb.org, transacl.org, jmlr.org, icml.cc, getpelican.com, python.org, smashingmagazine.com
links spam adult
0
links spam random
0
links spam expired
0
links ext activities
10
links ext ecommerce
0
links ext finance
0
links ext crypto
0
links ext booking
0
links ext news
0
links ext leaks
0
links ext ugc
18 - en.wikipedia.org, linkedin.com, youtube.com, docs.google.com
links ext klim
0
links ext generic
2
dol status
0
dol updated
2025-12-18 05:42:50
rss path
rss status
3 (priority 3 already searched, no matches found)
rss found date
-
rss size orig
0
rss items
0
rss spam phrases
0
rss detected language
0 (awaiting analysis)
inbefore feed id
-
inbefore status
0 (new)
sitemap path
sitemap status
1 (priority 1 already searched, no matches found)
sitemap review version
1
sitemap urls count
0
sitemap urls adult
0
sitemap filtered products
0
sitemap filtered videos
0
sitemap found date
-
sitemap process date
2024-07-01 15:24:15
sitemap first import date
-
sitemap last import date
-