Main
id
type
0
status
13
review version
1
cleanup version
2
pending deletion
0
created at
2024-03-09 21:49:01
updated at
2024-03-09 21:49:01
Address
url
https://domain.com/1.html
url length
25
url crc
23558
url crc32
451501062
location type
0
canonical status
0
canonical page id
-
Source
domain id
domain tld
2211
domain parts
2
originating warc id
-
originating url
https://top-way.cfd/sitemap.xml
source type
1
Server response
server ip
-
pubdate
2025-07-15 12:31:43
attempts
1
size orig
0
size saved
0
Text analysis
block type
10
extracted fields
0
extracted bits
–
detected location
0
detected language
0 (awaiting analysis)
category id
-
index version
0
paywall score
0
spam phrases
0