Main

type

5 (blog/news article)

status

10 (page successfully fetched)

review version

0

cleanup version

0

pending deletion

0 (-)

created at

2025-07-06 23:15:54

updated at

2026-02-17 17:38:26

Address

url

https://asiam.ie/news/statement-on-the-publication-of-the-epsen-act-review

url length

74

url crc

3102

url crc32

3473738782

location type

1 (url matches target location, page_location is empty)

canonical status

2 (missing canonical tag in html)

canonical page id

-

Source

domain id

25009906

domain tld

0

domain parts

0

originating warc id

-

originating url

https://asiam.ie/

source type

4 (mainpage of this domain)

Server response

server ip

34.202.203.47

Publication date

2025-07-09 19:41:45

Fetch attempts

1

Original html size

90848

Normalized and saved size

61965

Text analysis

block type

0

extracted fields

105

extracted bits

featured image
title
full content
content was extracted heuristically

detected location

0

detected language

1 (English)

category id

-

index version

1

paywall score

0

spam phrases

0

Text statistics

text nonlatin

0

text cyrillic

0

text characters

2150

text words

399

text unique words

191

text lines

1

text sentences

12

text paragraphs

1

text words per sentence

33

text matched phrases

0

text matched dictionaries

0