id
type
0 (not classified)
status
22 (imported)
review version
1
cleanup version
2
pending deletion
0 (-)
created at
2023-12-03 16:28:44
updated at
2024-10-19 21:13:17
pol page id
pol status
0
pol hosts ticketing
pol hosts ecommerce
pol hosts finance
pol hosts crypto
pol hosts leak
pol hosts devel
pol hosts ugc
pol hosts klim
pol hosts builders
pol hosts self subdomains
pol hosts other subdomains
pol hosts other domains
pol updated
1771726595
url
https://abcokna.pl/firma-roku-branzy-budowlanej
url length
47
url crc
42944
url crc32
837134272
location type
1 (url matches target location, page_location is empty)
canonical status
10 (verified canonical url)
canonical page id
domain id
domain tld
616
domain parts
2
originating warc id
-
originating url
https://abcokna.pl/sitemap.xml
source type
1 (sitemap)
server ip
-
Publication date
2024-03-13 07:21:13
Fetch attempts
0
Original html size
0
Normalized and saved size
25753
title
abc OKNA » Firma roku branży budowlanej
excerpt
content
Sed ut perspiciatis unde omnis iste natus error sit voluptatem accusantium doloremque laudantium, totam rem aperiam, eaque ipsa quae ab illo inventore veritatis et quasi architecto beatae vitae dicta sunt explicabo. Nemo enim ipsam voluptatem quia voluptas sit aspernatur aut odit aut fugit, sed quia consequuntur magni dolores eos qui ratione voluptatem sequi nesciunt. Neque porro quisquam est, qui dolorem ipsum quia dolor sit amet, consectetur, adipisci velit, sed quia non numquam eius modi tempora incidunt ut labore et dolore magnam aliquam quaerat voluptatem. Ut enim ad minima veniam, quis nostrum exercitationem ullam corporis suscipit laboriosam, nisi ut aliquid ex ea commodi consequatur? Quis autem vel eum iure reprehenderit qui in ea voluptate velit esse quam nihil molestiae consequatur, vel illum qui dolorem eum fugiat quo voluptas nulla pariatur. Sed ut perspiciatis unde omnis iste natus error sit voluptatem accusantium doloremque laudantium, totam rem aperiam, eaque ipsa qu...
author
updated
1771726595
block type
0
extracted fields
233
extracted bits
featured image
title
full content
content was extracted heuristically
OpenGraph suggests this is an article
detected location
0
detected language
0 (awaiting analysis)
category id
Lorem ipsum (237)
index version
1
paywall score
0
spam phrases
0
text nonlatin
0
text cyrillic
0
text characters
2166
text words
387
text unique words
102
text lines
1
text sentences
15
text paragraphs
1
text words per sentence
25
text matched phrases
0
text matched dictionaries
0
image author