id
type
0 (not classified)
status
21 (imported old-v2, waiting for another import)
review version
0
cleanup version
0
pending deletion
0 (-)
created at
2025-10-14 20:49:19
updated at
2025-10-18 11:56:39
url
https://aidisruption.ai/p/6-open-source-tools-to-build-your?r=2ajqea&triedRedirect=true
url length
87
url crc
12925
url crc32
3360502397
location type
1 (url matches target location, page_location is empty)
canonical status
30 (canonical url is different, page_canonical_page_id points to it)
canonical page id
domain id
domain tld
660
domain parts
0
originating warc id
-
originating url
https://data.commoncrawl.org/crawl-data/CC-MAIN-2025-33/segments/1754151281008.23/warc/CC-MAIN-20250812234112-20250813024112-00161.warc.gz
source type
11 (CommonCrawl)
server ip
Publication date
2025-08-13 00:58:13
Fetch attempts
0
Original html size
304743
Normalized and saved size
56629
title
excerpt
content
SubscribeIn this article, I'll teach you how to set up an autonomous and controllable large language model (LLM) foundation. This way, even if your work environment doesn't allow the use of OpenAI API, you can still proceed.This post is for paid subscribersSubscribeAlready a paid subscriber? Sign inPreviousNext
author
Meng Li
updated
1763270604
block type
0
extracted fields
36
extracted bits
article author
full content
detected location
0
detected language
1 (English)
category id
index version
2025110801
paywall score
0
spam phrases
0
text nonlatin
0
text cyrillic
0
text characters
650
text words
138
text unique words
81
text lines
45
text sentences
4
text paragraphs
1
text words per sentence
34
text matched phrases
1
text matched dictionaries
2
links self subdomains
0
links other subdomains
0
links other domains
0
links spam adult
0
links spam random
0
links spam expired
0
links ext activities
0
links ext ecommerce
0
links ext finance
0
links ext crypto
0
links ext booking
0
links ext news
0
links ext leaks
0
links ext ugc
10
links ext klim
0
links ext generic
0
image author
featured image