Main

type

0 (not classified)

status

21 (imported old-v2, waiting for another import)

review version

0

cleanup version

0

pending deletion

0 (-)

created at

2025-07-10 03:15:28

updated at

2025-12-06 20:56:06

Address

url

https://trukj.com/news.html

url length

27

url crc

24756

url crc32

3614204084

location type

4 (page_location points to new url in different domain)

canonical status

30 (canonical url is different, page_canonical_page_id points to it)

canonical page id

3128167266

location

https://www.trukj.com/news.html

Source

domain id

218353608

domain tld

0

domain parts

0

originating warc id

-

originating url

https://trukj.com/

source type

4 (mainpage of this domain)

Server response

server ip

156.244.123.226

Publication date

2025-12-06 20:56:05

Fetch attempts

1

Original html size

36032

Normalized and saved size

31951

Content

title

云顶集团-www.4008.com|中国·官网首å‘

excerpt

content

author

TG:myyjjpp

updated

1767274233

Text analysis

block type

0

extracted fields

13

extracted bits

featured image
article author
title

detected location

0

detected language

126 (language undetectable (empty document, too short, or engines disagree))

category id

Pozostałe (16)

index version

2025123101

paywall score

0

spam phrases

0

Text statistics

text nonlatin

0

text cyrillic

0

text characters

2048

text words

1004

text unique words

250

text lines

78

text sentences

10

text paragraphs

13

text words per sentence

100

text matched phrases

0

text matched dictionaries

0