id
type
5 (blog/news article)
status
21 (imported old-v2, waiting for another import)
review version
0
cleanup version
0
pending deletion
0 (-)
created at
2025-11-18 02:51:57
updated at
2025-11-18 02:51:58
url
https://www.arrowheadgrp.com/blog/protecting-clients-and-carriers-from-insurance-fraud/
url length
87
url crc
44931
url crc32
1977921411
location type
1 (url matches target location, page_location is empty)
canonical status
10 (verified canonical url)
canonical page id
domain id
domain tld
2211
domain parts
0
originating warc id
-
originating url
https://data.commoncrawl.org/crawl-data/CC-MAIN-2025-33/segments/1754151279968.16/warc/CC-MAIN-20250807151203-20250807181203-00656.warc.gz
source type
11 (CommonCrawl)
server ip
Publication date
2025-08-07 16:32:27
Fetch attempts
0
Original html size
108478
Normalized and saved size
43563
title
Protecting clients and carriers from insurance fraud - Arrowhead
excerpt
content
author
Arrowhead Communications
updated
1768124538
block type
0
extracted fields
141
extracted bits
featured image
article author
title
OpenGraph suggests this is an article
detected location
0
detected language
1 (English)
category id
Other [en] (231)
index version
2025123101
paywall score
0
spam phrases
1
text nonlatin
0
text cyrillic
0
text characters
5364
text words
990
text unique words
519
text lines
83
text sentences
48
text paragraphs
16
text words per sentence
20
text matched phrases
1
text matched dictionaries
4
links self subdomains
0
links other subdomains
0
links other domains
5
links spam adult
0
links spam random
0
links spam expired
0
links ext activities
0
links ext ecommerce
0
links ext finance
0
links ext crypto
0
links ext booking
0
links ext news
1
links ext leaks
0
links ext ugc
1
links ext klim
0
links ext generic
0
image author