id
processing priority
3
site type
5 (wiki-type site, growing by topic rather than chronologically)
review version
11
html import
20 (imported)
first seen date
2024-09-28 05:51:28
expired found date
-
created at
2024-09-28 05:51:28
updated at
2026-01-14 11:08:54
length
20
crc
62359
tld
86
nm parts
0
nm random digits
0
nm rare letters
0
is subdomain of id
87719371 (github.io)
previous id
0
replaced with id
0
related id
-
dns primary id
0
dns alternative id
0
lifecycle status
0 (unclassified, or currently active)
deleted subdomains
0
page imported products
0
page imported random
0
page imported parking
0
count skipped due to recent timeouts on the same server IP
0
count content received but rejected due to 11-799
0
count dns errors
0
count cert errors
0
count timeouts
0
count http 429
0
count http 404
0
count http 403
0
count http 5xx
0
next operation date
-
server bits
—
server ip
-
mp import status
20
mp rejected date
-
mp saved date
-
mp size orig
18493
mp size raw text
3018
mp inner links count
0
mp inner links status
1 (no links)
title
Hongjin Su
description
Hongjin Su, Hongjin Su, hongjinsu
image
site name
Hongjin Su
author
Hongjin Su 苏弘锦
updated
2026-02-22 07:49:45
raw text
Hongjin Su Skip links Skip to primary navigation Skip to content Skip to footer Hongjin Su Toggle menu Hongjin Su 苏弘锦 Follow Hong Kong Email Semantic Scholar Google Scholar Twitter CV Hi! I am a second-year PhD student in the Natural Language Processing group at the University of Hong Kong ( HKUNLP ). I am fortunate to be advised by Dr. Tao Yu (core) , Dr. Lingpeng Kong and Prof. Ben Kao . My primary interests are Data Science and Natural Language Processing. Previously, I graduated from the Chinese University of Hong Kong, Computer Science, in 2022. Publications BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval Hongjin Su* , Howard Yen*, Mengzhou Xia*, Weijia Shi, Niklas Muennighoff, Han-yu Wang, Haisu Liu, Quan Shi, Zachary S. Siegel, Michael Tang, Ruoxi Sun, Jinsung Yoon, Sercan O. Arik, Danqi Chen, Tao Yu Preprint [ paper ] [ code ] [ data ] [ website ] ARKS: Active Retrie...
redirect type
0 (-)
block type
0 (no issues)
detected language
1 (English)
category id
AI [en] (229)
index version
2025123101
spam phrases
0
text nonlatin
3
text cyrillic
0
text characters
2069
text words
418
text unique words
243
text lines
93
text sentences
15
text paragraphs
6
text words per sentence
27
text matched phrases
3
text matched dictionaries
3
links self subdomains
0
links other subdomains
links other domains
12 - semanticscholar.org, link-to-whatever-social-network.com, contextual.ai, xlang.ai, pypi.org, aclanthology.org, slideslive.com, interspeech2020.org, jekyllrb.com, mademistakes.com
links spam adult
0
links spam random
0
links spam expired
0
links ext activities
14
links ext ecommerce
0
links ext finance
0
links ext crypto
0
links ext booking
0
links ext news
0
links ext leaks
0
links ext ugc
21 - twitter.com
links ext klim
0
links ext generic
1
dol status
0
dol updated
2026-02-22 07:49:45
rss path
rss status
1 (priority 1 already searched, no matches found)
rss found date
-
rss size orig
0
rss items
0
rss spam phrases
0
rss detected language
0 (awaiting analysis)
inbefore feed id
-
inbefore status
0 (new)
sitemap path
sitemap status
40 (completed successful import of reports.txt file to table in_pages)
sitemap review version
2
sitemap urls count
6
sitemap urls adult
0
sitemap filtered products
0
sitemap filtered videos
0
sitemap found date
2024-09-28 05:51:28
sitemap process date
2024-09-28 05:51:28
sitemap first import date
-
sitemap last import date
2026-01-14 11:08:54