id
type
5 (blog/news article)
status
21 (imported old-v2, waiting for another import)
review version
0
cleanup version
0
pending deletion
0 (-)
created at
2025-10-23 22:18:22
updated at
2025-10-23 22:18:23
url
https://biisit.info/2021/kappale/52/138911/28
url length
45
url crc
50952
url crc32
3216623368
location type
1 (url matches target location, page_location is empty)
canonical status
30 (canonical url is different, page_canonical_page_id points to it)
canonical page id
domain id
domain tld
2476
domain parts
0
originating warc id
-
originating url
https://data.commoncrawl.org/crawl-data/CC-MAIN-2025-33/segments/1754151280903.0/warc/CC-MAIN-20250811192912-20250811222912-00614.warc.gz
source type
11 (CommonCrawl)
server ip
Publication date
2025-08-11 21:25:51
Fetch attempts
0
Original html size
68825
Normalized and saved size
65973
title
Viikate - Varjorastaat
excerpt
content
31.12.2021 20:31 29.12.2021 23:39 29.12.2021 16:48 29.12.2021 13:13 28.12.2021 02:34 27.12.2021 11:54 27.12.2021 03:37 26.12.2021 19:33 26.12.2021 01:18 25.12.2021 21:29 25.12.2021 12:11 25.12.2021 02:17 24.12.2021 07:27 22.12.2021 22:18 22.12.2021 12:53 22.12.2021 04:18 21.12.2021 14:21 21.12.2021 11:15 21.12.2021 01:18 20.12.2021 20:13 20.12.2021 11:15 20.12.2021 00:36 19.12.2021 16:13 19.12.2021 07:29 18.12.2021 22:17 18.12.2021 11:33 18.12.2021 01:16 17.12.2021 14:55 16.12.2021 00:17 15.12.2021 19:11 15.12.2021 04:34 14.12.2021 18:13 14.12.2021 12:14 14.12.2021 03:43 13.12.2021 21:12 13.12.2021 15:21 13.12.2021 11:14 13.12.2021 02:34 12.12.2021 17:31 12.12.2021 09:09 11.12.2021 21:30 11.12.2021 14:34 10.12.2021 20:52 09.12.2021 22:38 09.12.2021 12:35 09.12.2021 03:16 08.12.2021 20:13 08.12.2021 00:14 07.12.2021 13:15 07.12.2021 11:15 06.12.2021 17:37 06.12.2021 04:33 05.12.2021 17:53 04.12.2021 20:46 04.12.2021 15:31 04.12.2021 10:31 04.12.2021 00:31 03.12.2021 22:15 02.12.2021 0...
author
updated
1762911478
block type
0
extracted fields
105
extracted bits
featured image
title
full content
content was extracted heuristically
detected location
0
detected language
125 (detectable language, but not yet supported)
category id
Pozostałe (16)
index version
2025110801
paywall score
0
spam phrases
0
text nonlatin
0
text cyrillic
0
text characters
15636
text words
6515
text unique words
56
text lines
1
text sentences
1
text paragraphs
1
text words per sentence
255
text matched phrases
0
text matched dictionaries
0
links self subdomains
0
links other subdomains
0
links other domains
0
links spam adult
0
links spam random
0
links spam expired
0
links ext activities
0
links ext ecommerce
0
links ext finance
0
links ext crypto
0
links ext booking
0
links ext news
0
links ext leaks
0
links ext ugc
1
links ext klim
0
links ext generic
1
image author
featured image