id
type
0 (not classified)
status
21 (imported old-v2, waiting for another import)
review version
0
cleanup version
0
pending deletion
0 (-)
created at
2025-10-06 12:40:23
updated at
2026-01-16 11:49:38
url
https://102school.com/regulatory-documents
url length
42
url crc
27943
url crc32
2062773543
location type
3 (page_location points to new url in the same domain)
canonical status
2 (missing canonical tag in html)
canonical page id
-
location
https://102school.com/ru/regulatory-documents
domain id
domain tld
2211
domain parts
0
originating warc id
-
originating url
https://102school.com/
source type
4 (mainpage of this domain)
server ip
Publication date
2026-01-16 11:49:36
Fetch attempts
1
Original html size
1082347
Normalized and saved size
15476
title
Школа №102 г. Бишкек
excerpt
content
author
updated
1769954333
block type
0
extracted fields
8
extracted bits
title
detected location
0
detected language
126 (language undetectable (empty document, too short, or engines disagree))
category id
-
index version
1
paywall score
0
spam phrases
0
text nonlatin
0
text cyrillic
12
text characters
15
text words
4
text unique words
4
text lines
1
text sentences
2
text paragraphs
0
text words per sentence
2
text matched phrases
0
text matched dictionaries
0
image author
featured image