id
type
0 (not classified)
status
21 (imported old-v2, waiting for another import)
review version
0
cleanup version
0
pending deletion
0 (-)
created at
2025-12-31 03:41:24
updated at
2025-12-31 03:41:24
url
https://amecah.com.mx/index.php?view=article&id=787%3Acomite-syr&catid=13%3Aartcomites
url length
86
url crc
29753
url crc32
3276895289
location type
1 (url matches target location, page_location is empty)
canonical status
2 (missing canonical tag in html)
canonical page id
-
domain id
domain tld
484
domain parts
3
originating warc id
13193310
originating url
source type
11 (CommonCrawl)
server ip
Publication date
2025-08-02 22:43:53
Fetch attempts
0
Original html size
267151
Normalized and saved size
52151
title
Comite SyR
excerpt
content
author
Super User SM
updated
2026-01-02 10:29:01
block type
0
extracted fields
12
extracted bits
article author
title
detected location
0
detected language
8 (Spanish)
category id
index version
2025123101
paywall score
0
spam phrases
0
text nonlatin
0
text cyrillic
0
text characters
3908
text words
712
text unique words
301
text lines
71
text sentences
20
text paragraphs
11
text words per sentence
35
text matched phrases
1
text matched dictionaries
1
image author
featured image