Main

processing priority

3

site type

5 (wiki-type site, growing by topic rather than chronologically)

review version

11

html import

20 (imported)

Events

first seen date

2024-02-11 23:14:59

expired found date

-

created at

2024-06-07 04:11:44

updated at

2025-12-30 08:06:53

Domain name statistics

length

18

crc

58242

tld

86

nm parts

0

nm random digits

0

nm rare letters

0

Connections

is subdomain of id

87719371 (github.io)

previous id

0

replaced with id

0

related id

-

dns primary id

0

dns alternative id

0

lifecycle status

0 (unclassified, or currently active)

Subdomains and pages

deleted subdomains

0

page imported products

0

page imported random

0

page imported parking

0

Error counters

count skipped due to recent timeouts on the same server IP

0

count content received but rejected due to 11-799

0

count dns errors

0

count cert errors

0

count timeouts

0

count http 429

0

count http 404

0

count http 403

0

count http 5xx

0

next operation date

-

Server

server bits

server ip

-

Mainpage statistics

mp import status

20

mp rejected date

-

mp saved date

-

mp size orig

46924

mp size raw text

1486

mp inner links count

5

mp inner links status

20 (imported)

Open Graph

title

description

image

site name

author

updated

2025-12-18 05:53:08

raw text

ⅅialⅅoc 🎉🎉🎉 Checkout our 2 nd DialDoc Workshop co-located with ACL 2022 ! Checkout our new data and task MultiDoc2Dial . Check out our Shared Task at 1st ⅅial ⅅoc Workshop at ACL-IJCNLP 2021 . leaderboard for Shared Task at DialDoc2021 is still on! Overview For goal-oriented document-grounded dialogs, it often involves complex contexts for identifying the most relevant information, which requires better understanding of the inter-relations between conversations and documents. Meanwhile, many online user-oriented documents use both semi-structured and unstructured contents for guiding users to access information of different contexts. Thus, we create a new goal-oriented document-grounded dialogue dataset that captures more diverse scenarios derived from various document contents from multiple domains such ssa.gov and studentaid.gov . For data collection, we propose a novel pipelin...

Text analysis

redirect type

0 (-)

block type

0 (no issues)

detected language

1 (English)

category id

Edukacja (47)

index version

2025110801

spam phrases

0

Text statistics

text nonlatin

0

text cyrillic

0

text characters

1094

text words

192

text unique words

125

text lines

41

text sentences

12

text paragraphs

2

text words per sentence

16

text matched phrases

1

text matched dictionaries

1

RSS

rss path

rss status

3 (priority 3 already searched, no matches found)

rss found date

-

rss size orig

0

rss items

0

rss spam phrases

0

rss detected language

0 (awaiting analysis)

inbefore feed id

-

inbefore status

0 (new)

Sitemap

sitemap path

sitemap status

1 (priority 1 already searched, no matches found)

sitemap review version

1

sitemap urls count

0

sitemap urls adult

0

sitemap filtered products

0

sitemap filtered videos

0

sitemap found date

-

sitemap process date

2024-07-01 15:27:43

sitemap first import date

-

sitemap last import date

-