Main

processing priority

3

site type

0 (generic, awaiting analysis)

review version

11

html import

20 (imported)

Events

first seen date

2024-08-28 18:01:56

expired found date

-

created at

2024-08-28 18:01:56

updated at

2026-01-29 20:32:46

Domain name statistics

length

21

crc

48390

tld

86

nm parts

0

nm random digits

0

nm rare letters

0

Connections

is subdomain of id

63990134 (quanteda.io)

previous id

0

replaced with id

0

related id

-

dns primary id

0

dns alternative id

0

lifecycle status

0 (unclassified, or currently active)

Subdomains and pages

deleted subdomains

0

page imported products

0

page imported random

0

page imported parking

0

Error counters

count skipped due to recent timeouts on the same server IP

0

count content received but rejected due to 11-799

0

count dns errors

0

count cert errors

0

count timeouts

0

count http 429

0

count http 404

0

count http 403

0

count http 5xx

0

next operation date

-

Server

server bits

server ip

-

Mainpage statistics

mp import status

20

mp rejected date

-

mp saved date

-

mp size orig

44933

mp size raw text

5150

mp inner links count

1

mp inner links status

20 (imported)

Open Graph

title

description

Introduction to quantitative text analysis using quanteda

image

site name

author

Kohei Watanabe and Stefan Müller

updated

2026-01-28 16:23:53

raw text

quanteda tutorials :: Tutorials for quanteda 1. Introduction Install packages R commands 2. Data Import Pre-formatted files Multiple text files Different encodings 3. Basic Operations Workflow Corpus Construct a corpus Document-level variables Subset corpus Change units of texts Extract tags from texts Tokens Construct a tokens object Keyword-in-contexts Select tokens Compound tokens Look up dictionary Generate n-grams Document-feature matrix Construct a DFM Select features Look up dictionary Group documents Feature co-occurence matrix Construct a FCM 4. Statistical Analysis Simple frequency analysis Lexical diversity Document/feature similarity Relative frequency analysi...

Text analysis

redirect type

0 (-)

block type

0 (no issues)

detected language

1 (English)

category id

Other [en] (231)

index version

2025123101

spam phrases

0

Text statistics

text nonlatin

0

text cyrillic

0

text characters

3668

text words

645

text unique words

344

text lines

167

text sentences

36

text paragraphs

9

text words per sentence

17

text matched phrases

0

text matched dictionaries

0

RSS

rss path

rss status

1 (priority 1 already searched, no matches found)

rss found date

-

rss size orig

0

rss items

0

rss spam phrases

0

rss detected language

0 (awaiting analysis)

inbefore feed id

-

inbefore status

0 (new)

Sitemap

sitemap status

30 (processing completed, results pushed to table crawler_sitemaps.ext_domain_sitemap_lists)

sitemap review version

1

sitemap urls count

58

sitemap urls adult

0

sitemap filtered products

0

sitemap filtered videos

0

sitemap found date

2024-08-29 14:18:18

sitemap process date

2024-08-29 14:18:18

sitemap first import date

-

sitemap last import date

-