id
name
processing priority
4
site type
0 (generic, awaiting analysis)
review version
11
html import
20 (imported)
first seen date
2024-02-08 19:51:57
expired found date
-
created at
2024-05-31 19:31:42
updated at
2025-12-26 00:20:15
length
11
crc
31985
tld
2211
nm parts
0
nm random digits
0
nm rare letters
0
is subdomain of id
-
previous id
0
replaced with id
0
related id
-
dns primary id
dns alternative id
0
lifecycle status
0 (unclassified, or currently active)
deleted subdomains
0
page imported products
0
page imported random
0
page imported parking
0
count skipped due to recent timeouts on the same server IP
0
count content received but rejected due to 11-799
0
count dns errors
0
count cert errors
0
count timeouts
0
count http 429
0
count http 404
0
count http 403
0
count http 5xx
0
next operation date
-
server bits
—
server ip
-
mp import status
20
mp rejected date
-
mp saved date
-
mp size orig
147339
mp size raw text
11940
mp inner links count
0
mp inner links status
20 (imported)
title
description
Geza Kovacs, Senior Research Scientist at Google
image
site name
author
updated
2025-12-13 06:15:15
raw text
Geza Kovacs, Senior Research Scientist at Google Geza Kovacs CV / Resume Close Menu Geza Kovacs Research Open-Source Teaching Contact Publications Geza Kovacs Research Open-Source Teaching Contact Publications CV / Resume Geza Kovacs Geza Kovacs I'm Geza Kovacs, a Senior Research Scientist at Google, working on applications of LLMs. I was previously at Lilt , working on improving translators' productivity using interactive machine translation. I did my PhD in Computer Science at Stanford, where I was advised by Michael Bernstein in the Human-Computer Interaction group, and undergrad and masters at MIT. See my resume for details. Research Automatic Correction of Human Translations We develop Transformer-based models for performing automatic correction of translation errors made by human translators, and show that in a human-in-the-loop setting our system helps translators produce higher quality translations. Jessy Lin, Geza Kovacs ...
redirect type
0 (-)
block type
0 (no issues)
detected language
1 (English)
category id
index version
2025110801
spam phrases
0
text nonlatin
12
text cyrillic
0
text characters
9334
text words
1694
text unique words
674
text lines
302
text sentences
122
text paragraphs
25
text words per sentence
13
text matched phrases
5
text matched dictionaries
4
links self subdomains
0
links other subdomains
14 - dl.acm.org, addons.mozilla.org
links other domains
23 - lilt.com, aclanthology.org, aclweb.org, daemo.org, npmjs.com, pypi.org, venmo.com, livescript.net, ubuntuforums.org
links spam adult
0
links spam random
0
links spam expired
0
links ext activities
58
links ext ecommerce
links ext finance
1 - paypal.me
links ext crypto
0
links ext booking
0
links ext news
1
links ext leaks
0
links ext ugc
54 - youtube.com, en.wikipedia.org, linkedin.com, facebook.com
links ext klim
0
links ext generic
16
dol status
0
dol updated
2025-12-13 06:15:15
rss path
rss status
3 (priority 3 already searched, no matches found)
rss found date
-
rss size orig
0
rss items
0
rss spam phrases
0
rss detected language
0 (awaiting analysis)
inbefore feed id
-
inbefore status
0 (new)
sitemap path
sitemap status
30 (processing completed, results pushed to table crawler_sitemaps.ext_domain_sitemap_lists)
sitemap review version
1
sitemap urls count
2
sitemap urls adult
0
sitemap filtered products
0
sitemap filtered videos
0
sitemap found date
2024-02-21 08:59:53
sitemap process date
2024-11-23 03:13:21
sitemap first import date
-
sitemap last import date
-