Main

type

0 (not classified)

status

21 (imported old-v2, waiting for another import)

review version

0

cleanup version

0

pending deletion

0 (-)

created at

2025-11-16 03:40:40

updated at

2025-11-16 03:40:40

Address

url

https://www.curitiba.pr.gov.br/noticias/criancas-aprendem-modalidades-esportivas-olimpicas-em-aulas-gratuitas-da-prefeitura/55073

url length

129

url crc

46533

url crc32

3337336261

location type

1 (url matches target location, page_location is empty)

canonical status

2 (missing canonical tag in html)

canonical page id

-

Source

domain id

196995380

domain tld

76

domain parts

0

originating warc id

-

originating url

https://data.commoncrawl.org/crawl-data/CC-MAIN-2025-33/segments/1754151279985.2/warc/CC-MAIN-20250807213109-20250808003109-00958.warc.gz

source type

11 (CommonCrawl)

Server response

server ip

23.12.145.52

Publication date

2025-08-07 22:44:47

Fetch attempts

0

Original html size

415064

Normalized and saved size

406418

Content

title

Crianças aprendem modalidades esportivas olímpicas em aulas gratuitas da Prefeitura

excerpt

content

author

Prefeitura de Curitiba

updated

2026-01-09 10:15:30

Text analysis

block type

0

extracted fields

13

extracted bits

featured image
article author
title

detected location

0

detected language

10 (Portuguese)

category id

Lekkoatletyka (72)

index version

2025123101

paywall score

0

spam phrases

0

Text statistics

text nonlatin

0

text cyrillic

0

text characters

8672

text words

1748

text unique words

573

text lines

141

text sentences

117

text paragraphs

25

text words per sentence

14

text matched phrases

16

text matched dictionaries

4