Main

related bits

0

processing priority

3

site type

5 (wiki-type site, growing by topic rather than chronologically)

review version

11

html import

20 (imported)

Events

first seen date

2024-09-16 15:45:15

expired found date

-

created at

2024-09-16 15:45:15

updated at

2026-03-03 23:11:30

Domain name statistics

length

19

crc

39049

tld

86

nm parts

0

nm random digits

0

nm rare letters

0

Connections

is subdomain of id

87719371 (github.io)

previous id

0

replaced with id

0

related id

-

dns primary id

0

dns alternative id

0

lifecycle status

0 (unclassified, or currently active)

Subdomains and pages

deleted subdomains

0

page imported products

0

page imported random

0

page imported parking

0

Error counters

count skipped due to recent timeouts on the same server IP

0

count content received but rejected due to 11-799

0

count dns errors

0

count cert errors

0

count timeouts

0

count http 429

0

count http 404

0

count http 403

0

count http 5xx

0

next operation date

-

Server

server bits

server ip

-

Mainpage statistics

mp import status

20

mp rejected date

-

mp saved date

-

mp size orig

8462

mp size raw text

1108

mp inner links count

2

mp inner links status

20 (imported)

Open Graph

title

About Noah’s ARK

description

image

site name

Noah's ARK

author

updated

2026-02-23 12:42:26

raw text

About Noah’s ARK - Noah’s ARK Noah's ARK People Publications Acknowledgments Follow Seattle Github About Noah’s ARK Noah’s ARK is an informal collection of researchers, led by Prof. Noah Smith within the Natural Language Processing group and the Allen School at the University of Washington and formerly at Carnegie Mellon University ’s Language Technologies Institute and Machine Learning Department . We have published numerous papers about NLP ; data and code is linked within each paper. Project pages, datasets, and code from 2006–2015 can be found on our page at CMU . If you are an undergraduate or masters student in Seattle and would like to work on research with Noah’s ARK, please complete this demonstration of research interest . Please follow the instructions carefully. In general, we encourage you to apply if you are a student at UW or any other school in or near Seattle, and you think you have a role to play in the ARK. Sitemap Follow:...

Text analysis

redirect type

0 (-)

block type

0 (no issues)

detected language

1 (English)

category id

Edukacja (47)

index version

1

spam phrases

0

Text statistics

text nonlatin

0

text cyrillic

0

text characters

843

text words

171

text unique words

107

text lines

40

text sentences

9

text paragraphs

2

text words per sentence

19

text matched phrases

0

text matched dictionaries

0

RSS

rss path

rss status

1 (priority 1 already searched, no matches found)

rss found date

-

rss size orig

0

rss items

0

rss spam phrases

0

rss detected language

0 (awaiting analysis)

inbefore feed id

-

inbefore status

0 (new)

Sitemap

sitemap status

40 (completed successful import of reports.txt file to table in_pages)

sitemap review version

2

sitemap urls count

17

sitemap urls adult

0

sitemap filtered products

0

sitemap filtered videos

0

sitemap found date

2024-09-19 19:29:26

sitemap process date

2024-09-19 19:29:27

sitemap first import date

-

sitemap last import date

2026-03-03 23:11:30