Main
id
type
5
status
0
review version
1
cleanup version
0
pending deletion
0
created at
2026-01-16 12:18:16
updated at
2026-01-16 12:18:16
Address
url
https://chadrick-kwag.net/posts/paper-summary-bros-a-pre-trained-language-model-focusing-on-text-and-layout-for-better-key-information-extraction-from-documents/
url length
161
url crc
61173
url crc32
248377077
location type
0
canonical status
0
canonical page id
-
Source
domain id
domain tld
2644
domain parts
2
originating warc id
-
originating url
https://chadrick-kwag.net/sitemap.xml
source type
1
Server response
server ip
-
pubdate
2021-11-10 00:00:00
attempts
0
size orig
0
size saved
0
Text analysis
block type
0
extracted fields
0
extracted bits
–
detected location
0
detected language
0 (awaiting analysis)
category id
-
index version
0
paywall score
0
spam phrases
0