id
type
0 (not classified)
status
21 (imported old-v2, waiting for another import)
review version
0
cleanup version
0
pending deletion
0 (-)
created at
2025-10-23 22:11:05
updated at
2025-10-23 22:11:05
pol page id
pol status
0
pol hosts ticketing
pol hosts ecommerce
pol hosts finance
pol hosts crypto
pol hosts leak
pol hosts devel
pol hosts ugc
pol hosts klim
pol hosts builders
pol hosts self subdomains
pol hosts other subdomains
bilder.sky.de fast.skydeutschland.demdex.net assets.adobedtm.com cdn.privacy-mgmt.com
pol hosts other domains
sky.at
pol updated
1762280036
url
https://www.sky.de/serien/pastewka
url length
34
url crc
60073
url crc32
484108969
location type
1 (url matches target location, page_location is empty)
canonical status
10 (verified canonical url)
canonical page id
domain id
domain tld
276
domain parts
0
originating warc id
-
originating url
https://data.commoncrawl.org/crawl-data/CC-MAIN-2025-33/segments/1754151280903.0/warc/CC-MAIN-20250811192912-20250811222912-00669.warc.gz
source type
11 (CommonCrawl)
server ip
Publication date
2025-08-11 21:27:23
Fetch attempts
0
Original html size
68324
Normalized and saved size
49367
title
Pastewka
excerpt
content

 
 
 
 
 
 
 
 Du bist bereits Kunde?
 Den richtigen Service und persönliche Angebote erhältst du im Login-Bereich.
 Bitte logge dich hier ein. 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 Alles zu Serien
 Pastewka
 
 
 
 
 Über die Serie
 
 Bildergalerie
 
 Sendetermine
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 Bastian Pastewka spielt Bastian Pastewka: Mit seiner Serie gibt der Comedian Einblicke in sein turbulentes Leben, in dem es nicht an peinlichen Situationen mangelt.
 
 Bastian Pastewka spielt Bastian Pastewka: Mit seiner Serie gibt der Comedian Einblicke in sein turbulen...
author
sky.de
updated
1762280036
block type
0
extracted fields
109
extracted bits
featured image
article author
title
full content
content was extracted heuristically
detected location
0
detected language
2 (German)
category id
Pozostałe (16)
index version
2025103102
paywall score
0
spam phrases
0
text nonlatin
0
text cyrillic
0
text characters
1826
text words
302
text unique words
178
text lines
1
text sentences
21
text paragraphs
1
text words per sentence
14
text matched phrases
0
text matched dictionaries
0
links self subdomains
0
links other subdomains
27
links other domains
2
links spam adult
0
links spam random
0
links spam expired
0
links ext activities
0
links ext ecommerce
0
links ext finance
0
links ext crypto
0
links ext booking
0
links ext news
0
links ext leaks
0
links ext ugc
0
links ext klim
0
links ext generic
0
image author