id
type
5 (blog/news article)
status
21 (imported old-v2, waiting for another import)
review version
0
cleanup version
0
pending deletion
0 (-)
created at
2025-10-23 16:01:29
updated at
2025-10-23 16:01:30
url
https://biisit.info/2023/kappale/99/887
url length
39
url crc
15705
url crc32
2416196953
location type
1 (url matches target location, page_location is empty)
canonical status
2 (missing canonical tag in html)
canonical page id
-
domain id
domain tld
2476
domain parts
0
originating warc id
-
originating url
https://data.commoncrawl.org/crawl-data/CC-MAIN-2025-33/segments/1754151280903.0/warc/CC-MAIN-20250811192912-20250811222912-00844.warc.gz
source type
11 (CommonCrawl)
server ip
Publication date
2025-08-11 20:08:13
Fetch attempts
0
Original html size
32465
Normalized and saved size
29613
title
Guns N' Roses - You could be mine
excerpt
content
Soitettu yhteensä 186 kertaa vuoden 2023 aikana 31.12.2023 12:51, Radio Rock 30.12.2023 05:02, Radio Rock 29.12.2023 07:51, Radio Rock 27.12.2023 19:58, Radio Rock 26.12.2023 23:40, Radio Rock 26.12.2023 09:42, Radio Rock 25.12.2023 22:33, Radio Rock 24.12.2023 19:34, Ysäri 21.12.2023 17:30, Radio Rock 19.12.2023 15:18, Radio Rock 18.12.2023 10:31, Radio Rock 18.12.2023 03:46, Radio Rock 16.12.2023 12:48, Radio Rock 15.12.2023 05:12, Radio Rock 13.12.2023 08:09, Radio Rock 13.12.2023 02:17, Radio Rock 10.12.2023 22:53, Radio Rock 10.12.2023 00:41, Radio Rock 04.12.2023 04:09, Radio Rock 03.12.2023 12:32, Ysäri 02.12.2023 01:25, Radio Rock 30.11.2023 10:57, Radio City 28.11.2023 23:19, Radio Rock 28.11.2023 17:56, Radio Voima 28.11.2023 12:30, Radio Rock 25.11.2023 05:39, Radio Rock 24.11.2023 06:34, Radio Rock 18.11.2023 11:46, Radio Rock 12.11.2023 18:55, Radio Rock 11.11.2023 15:46, Radio Rock 11.11.2023 12:34, Ysäri 11.11.2023 09:49, Radio Rock 11.11.2023 03:16, Radio ...
author
updated
1762345948
block type
0
extracted fields
105
extracted bits
featured image
title
full content
content was extracted heuristically
detected location
0
detected language
126 (language undetectable (empty document, too short, or engines disagree))
category id
index version
2025110801
paywall score
0
spam phrases
0
text nonlatin
0
text cyrillic
0
text characters
3922
text words
1291
text unique words
78
text lines
1
text sentences
1
text paragraphs
1
text words per sentence
255
text matched phrases
1
text matched dictionaries
1
links self subdomains
0
links other subdomains
0
links other domains
0
links spam adult
0
links spam random
0
links spam expired
0
links ext activities
0
links ext ecommerce
0
links ext finance
0
links ext crypto
0
links ext booking
0
links ext news
0
links ext leaks
0
links ext ugc
1
links ext klim
0
links ext generic
1
image author
featured image