Main

type

5 (blog/news article)

status

21 (imported old-v2, waiting for another import)

review version

0

cleanup version

0

pending deletion

0 (-)

created at

2025-06-02 07:06:57

updated at

2025-11-02 00:50:57

Address

url

https://posit.co/blog/tips-for-getting-started-with-the-nfl-big-data-bowl/

url length

74

url crc

42962

url crc32

3921389522

location type

1 (url matches target location, page_location is empty)

canonical status

10 (verified canonical url)

canonical page id

-

Source

domain id

80278935

domain tld

0

domain parts

0

originating warc id

-

originating url

https://www.rstudio.com/blog/tips-for-getting-started-with-the-nfl-big-data-bowl

source type

10 (canonical url)

Server response

server ip

23.185.0.4

Publication date

2025-11-02 00:50:57

Fetch attempts

1

Original html size

317177

Normalized and saved size

191234

Content

title

Tips for Getting Started With the NFL Big Data Bowl From the 2022 Winners - Posit

excerpt

content

author

https://pos.it/facebook

updated

2025-11-15 09:53:57

Text analysis

block type

0

extracted fields

141

extracted bits

featured image
article author
title
OpenGraph suggests this is an article

detected location

0

detected language

1 (English)

category id

Zastosowania AI (149)

index version

2025110801

paywall score

0

spam phrases

0

Text statistics

text nonlatin

0

text cyrillic

0

text characters

6600

text words

1155

text unique words

353

text lines

254

text sentences

38

text paragraphs

15

text words per sentence

30

text matched phrases

14

text matched dictionaries

4