id
type
0 (not classified)
status
21 (imported old-v2, waiting for another import)
review version
0
cleanup version
0
pending deletion
0 (-)
created at
2025-11-25 15:01:20
updated at
2025-11-25 15:01:21
url
https://ftp.ludd.ltu.se/mirrors/debian/dists/
url length
45
url crc
14565
url crc32
2879404261
location type
1 (url matches target location, page_location is empty)
canonical status
2 (missing canonical tag in html)
canonical page id
-
domain id
domain tld
752
domain parts
0
originating warc id
-
originating url
https://data.commoncrawl.org/crawl-data/CC-MAIN-2025-33/segments/1754151279938.13/warc/CC-MAIN-20250807054540-20250807084540-00790.warc.gz
source type
11 (CommonCrawl)
server ip
Publication date
2025-08-07 06:17:53
Fetch attempts
0
Original html size
5391
Normalized and saved size
4397
title
Index of /mirrors/debian/dists/
excerpt
content
NameLast modifiedSize../unstable/2015-04-25 14:12:51-trixie-updates/2023-06-11 16:09:17-trixie-proposed-updates/2023-06-11 16:09:17-trixie-backports/2023-06-11 16:09:17-trixie/2025-08-07 00:05:41-testing-updates/2023-06-10 10:32:48-testing-proposed-updates/2023-06-10 10:32:48-testing-backports/2023-06-10 13:03:14-testing/2023-06-10 10:32:48-stable-updates/2023-06-10 10:32:43-stable-proposed-updates/2023-06-10 10:32:43-stable-backports-sloppy/2023-06-10 13:03:14-stable-backports/2023-06-10 10:32:43-stable/2021-08-14 09:31:45-sid/2023-06-11 16:09:17-README2025-07-13 00:18:42751 Brc-buggy/2008-08-04 22:46:30-proposed-updates/2021-08-14 09:31:45-oldstable-updates/2023-06-10 10:32:33-oldstable-proposed-updates/2023-06-10 10:32:33-oldstable/2019-07-06 09:54:50-experimental/2023-06-11 16:09:17-Debian12.11/2025-05-17 10:29:25-Debian11.11/2024-08-31 12:25:52-bullseye-updates/2023-06-11 16:09:17-bullseye-proposed-updates/2024-08-31 13:29:51-bullseye/2024-08-31 12:40:03-bookworm-updates/2023-06-1...
author
updated
1764774945
block type
0
extracted fields
104
extracted bits
title
full content
content was extracted heuristically
detected location
0
detected language
126 (language undetectable (empty document, too short, or engines disagree))
category id
Pozostałe (16)
index version
2025123101
paywall score
0
spam phrases
0
text nonlatin
0
text cyrillic
0
text characters
914
text words
133
text unique words
66
text lines
1
text sentences
1
text paragraphs
1
text words per sentence
133
text matched phrases
0
text matched dictionaries
0
image author
featured image