Main

type

5 (blog/news article)

status

21 (imported old-v2, waiting for another import)

review version

0

cleanup version

0

pending deletion

0 (-)

created at

2025-12-06 05:04:57

updated at

2025-12-06 05:04:58

Address

url

https://bochnia.pttk.pl/blog/news_archive/2024-11

url length

49

url crc

19020

url crc32

175196748

location type

1 (url matches target location, page_location is empty)

canonical status

2 (missing canonical tag in html)

canonical page id

-

Source

domain id

4542971

domain tld

616

domain parts

0

originating warc id

-

originating url

https://data.commoncrawl.org/crawl-data/CC-MAIN-2025-33/segments/1754151279884.50/warc/CC-MAIN-20250805222253-20250806012253-00999.warc.gz

source type

11 (CommonCrawl)

Server response

server ip

185.157.81.168

Publication date

2025-08-05 23:40:55

Fetch attempts

0

Original html size

50709

Normalized and saved size

41732

Content

title

Oddział PTTK w Bochni

excerpt

content


 
 
 
 
 
 
 
 
 
 
 
 Andrzejki w Bieszczadach
 
 brak kategorii 
 
 
 PTTK Bochnia
 2024-11-30
 
 
 
 
 Andrzejki w Bieszczadach
  
  
 30.11.-1.12.2024 r.
  
  
  
 
 Zapraszamy na Andrzejki Turystyczne.
 Po raz kolejny wybierzemy się na wędrówkę po Bieszczadach, a wieczorem przy muzyce i poczęstunku, poświętujemy Andrzejki. Tradycyjnie wszyscy...

author

updated

1767181859

Text analysis

block type

0

extracted fields

233

extracted bits

featured image
title
full content
content was extracted heuristically
OpenGraph suggests this is an article

detected location

146

detected language

121 (Polish)

category id

Podróże (51)

index version

2025123101

paywall score

0

spam phrases

0

Text statistics

text nonlatin

0

text cyrillic

0

text characters

1737

text words

276

text unique words

165

text lines

1

text sentences

10

text paragraphs

1

text words per sentence

27

text matched phrases

7

text matched dictionaries

3