id
type
5 (blog/news article)
status
21 (imported old-v2, waiting for another import)
review version
0
cleanup version
0
pending deletion
0 (-)
created at
2025-10-17 19:20:25
updated at
2026-01-04 07:40:14
url
https://alloutrugbyleague.co.uk/news/st-helens-josh-papalii-blow-1257429
url length
72
url crc
29817
url crc32
2790880377
location type
4 (page_location points to new url in different domain)
canonical status
30 (canonical url is different, page_canonical_page_id points to it)
canonical page id
location
https://www.alloutrugbyleague.co.uk/news/st-helens-josh-papalii-blow-1257429
domain id
domain tld
826
domain parts
0
originating warc id
-
originating url
https://alloutrugbyleague.co.uk/
source type
4 (mainpage of this domain)
server ip
Publication date
2026-01-04 07:40:14
Fetch attempts
1
Original html size
447091
Normalized and saved size
33122
title
St Helens dealt Josh Papali’i blow as NRL club move after Origin heroics
excerpt
content
St Helens dealt Josh Papali’i blow as NRL club move after State of Origin heroicsThe reported St Helens target has received fresh interest in the NRL.Josh McAllister10:03, 10 Jul 2025View ImageJosh Papali'i made a shock State of Origin comeback and helped Queensland to a series win on Wednesday.Canberra Raiders icon Josh Papali’i could yet remain in the NRL with a fresh twist in the veteran forward’s future just 24 hours after his State of Origin return.The 33-year-old made a shock comeback to the Queensland side under coach Billy Slater, despite having announced his retirement from representative football just last year. He made his 24th Origin appearance on Wednesday, helping Queensland clinch the series with a 24-12 victory over New South Wales.Papili’i is set to leave Canberra Raiders at the end of the season, having been told there is no extended contract offer on the table. Reports emerged last week stating that the prop has agreed a two-year deal with St Helens.However, his st...
author
Josh McAllister
updated
1768058767
block type
0
extracted fields
237
extracted bits
featured image
article author
title
full content
content was extracted heuristically
OpenGraph suggests this is an article
detected location
0
detected language
1 (English)
category id
224
index version
2025123101
paywall score
0
spam phrases
0
text nonlatin
0
text cyrillic
0
text characters
1853
text words
382
text unique words
221
text lines
1
text sentences
8
text paragraphs
1
text words per sentence
47
text matched phrases
1
text matched dictionaries
2
links self subdomains
0
links other subdomains
3
links other domains
6
links spam adult
0
links spam random
0
links spam expired
0
links ext activities
0
links ext ecommerce
0
links ext finance
0
links ext crypto
0
links ext booking
0
links ext news
0
links ext leaks
0
links ext ugc
4
links ext klim
0
links ext generic
0
image author