id
type
5
status
21
review version
0
cleanup version
0
pending deletion
0
created at
2025-12-02 20:10:19
updated at
2025-12-10 19:26:22
url
https://www.imyanmarhouse.com/news/read/1151789
url length
47
url crc
54467
url crc32
1735906499
location type
1
canonical status
10
canonical page id
domain id
domain tld
2211
domain parts
0
originating warc id
-
originating url
https://data.commoncrawl.org/crawl-data/CC-MAIN-2025-33/segments/1754151279917.30/warc/CC-MAIN-20250806140254-20250806170254-00516.warc.gz
source type
11
page id
title
áá»á±á¬ááºáá½ááºáááºá áá±á¬ááºáá±á¬ááºáá¾á áá±á«áºáᬠáááºá¸ ááá ááẠáá°á á á¶á¡áááºáá¼á®á¸ááᯠáá±á¬ááº
excerpt
content
áá¼ááºá á¡áááºáá¼á¶áá¼á± ááááºá¸ | Posted by áá½á¾á±á ááºáááºá¸ Share on Facebook áá±á¬ááºáá±á¬ááºáá¯ááºáá¾ááºáá¼ááºáá½ááºá· áá»á±á¬ááºáá½ááºááẠáá¬ááá¼á¬áá¾áẠááá¯ááºááá¯ááºáá²á·áá±á¬ “áá±áá¼ááºáá¼á¬ á¥áá»á¬áẔ á¡áááºáá¾á á á¶á¡áááºáá¼á®á¸ááᯠáá»á±á¬ááºáá½ááºáááºá áá±á¸áá»á¾á±á¬á·áá±á¸áá¬áá¼á±á¬ááºá¸ Hong Kong Commercial Daily ááááºá¸áá¬áá áá±á¬áºáá¼áááºá áá»á±á¬ááºáá½ááºáááºááẠáá±á¬ááºáá±á¬ááºá áá°áá¯á¶áá¶áááºáá½ááºáá¼á áºáá±á¬ The Peak áá¾á á¡ááá¯áá«á á¶á¡áááºáá¼á®á¸ááᯠáááá áá¼ááºá·áá¾á áºá áá±á¬ááºáá±á¬ááºáá±á«áºáᬠááá áááºá¸áá¼ááºá· áááºá...
author
updated
1766848677
block type
0
extracted fields
233
extracted bits
featured image
title
full content
content was extracted heuristically
OpenGraph suggests this is an article
detected location
0
detected language
126 (language undetectable (empty document, too short, or engines disagree))
category id
-
index version
1
paywall score
0
spam phrases
0
text nonlatin
0
text cyrillic
0
text characters
4799
text words
3689
text unique words
99
text lines
1
text sentences
1
text paragraphs
1
text words per sentence
255
text matched phrases
0
text matched dictionaries
0
links self subdomains
0
links other subdomains
0
links other domains
21
links spam adult
0
links spam random
0
links spam expired
0
links ext activities
0
links ext ecommerce
0
links ext finance
0
links ext crypto
0
links ext booking
0
links ext news
0
links ext leaks
2
links ext ugc
17
links ext klim
0
links ext generic
0
image author