Main

type

5 (blog/news article)

status

21 (imported old-v2, waiting for another import)

review version

0

cleanup version

0

pending deletion

0 (-)

created at

2025-10-23 22:18:22

updated at

2025-10-23 22:18:23

Address

url

https://biisit.info/2021/kappale/52/138911/28

url length

45

url crc

50952

url crc32

3216623368

location type

1 (url matches target location, page_location is empty)

canonical status

30 (canonical url is different, page_canonical_page_id points to it)

canonical page id

2860891214

Source

domain id

25154237

domain tld

2476

domain parts

0

originating warc id

-

originating url

https://data.commoncrawl.org/crawl-data/CC-MAIN-2025-33/segments/1754151280903.0/warc/CC-MAIN-20250811192912-20250811222912-00614.warc.gz

source type

11 (CommonCrawl)

Server response

server ip

104.21.46.184

Publication date

2025-08-11 21:25:51

Fetch attempts

0

Original html size

68825

Normalized and saved size

65973

Content

title

Viikate - Varjorastaat

excerpt

content

31.12.2021 20:31 29.12.2021 23:39 29.12.2021 16:48 29.12.2021 13:13 28.12.2021 02:34 27.12.2021 11:54 27.12.2021 03:37 26.12.2021 19:33 26.12.2021 01:18 25.12.2021 21:29 25.12.2021 12:11 25.12.2021 02:17 24.12.2021 07:27 22.12.2021 22:18 22.12.2021 12:53 22.12.2021 04:18 21.12.2021 14:21 21.12.2021 11:15 21.12.2021 01:18 20.12.2021 20:13 20.12.2021 11:15 20.12.2021 00:36 19.12.2021 16:13 19.12.2021 07:29 18.12.2021 22:17 18.12.2021 11:33 18.12.2021 01:16 17.12.2021 14:55 16.12.2021 00:17 15.12.2021 19:11 15.12.2021 04:34 14.12.2021 18:13 14.12.2021 12:14 14.12.2021 03:43 13.12.2021 21:12 13.12.2021 15:21 13.12.2021 11:14 13.12.2021 02:34 12.12.2021 17:31 12.12.2021 09:09 11.12.2021 21:30 11.12.2021 14:34 10.12.2021 20:52 09.12.2021 22:38 09.12.2021 12:35 09.12.2021 03:16 08.12.2021 20:13 08.12.2021 00:14 07.12.2021 13:15 07.12.2021 11:15 06.12.2021 17:37 06.12.2021 04:33 05.12.2021 17:53 04.12.2021 20:46 04.12.2021 15:31 04.12.2021 10:31 04.12.2021 00:31 03.12.2021 22:15 02.12.2021 0...

author

updated

1762911478

Text analysis

block type

0

extracted fields

105

extracted bits

featured image
title
full content
content was extracted heuristically

detected location

0

detected language

125 (detectable language, but not yet supported)

category id

Pozostałe (16)

index version

2025110801

paywall score

0

spam phrases

0

Text statistics

text nonlatin

0

text cyrillic

0

text characters

15636

text words

6515

text unique words

56

text lines

1

text sentences

1

text paragraphs

1

text words per sentence

255

text matched phrases

0

text matched dictionaries

0