id
type
5 (blog/news article)
status
21 (imported old-v2, waiting for another import)
review version
1
cleanup version
2
pending deletion
0 (-)
created at
2024-03-16 14:39:11
updated at
2026-01-10 06:25:50
pol page id
pol status
0
pol hosts ticketing
pol hosts ecommerce
pol hosts finance
pol hosts crypto
pol hosts leak
pol hosts devel
pol hosts ugc
pol hosts klim
pol hosts builders
pol hosts self subdomains
pol hosts other subdomains
code.tidio.co sp-ao.shortpixel.ai
pol hosts other domains
polyfill.io
pol updated
1769034361
url
https://www.cambridge-school.pl/blog/
url length
37
url crc
27498
url crc32
2955111274
location type
1 (url matches target location, page_location is empty)
canonical status
10 (verified canonical url)
canonical page id
domain id
domain tld
616
domain parts
2
originating warc id
-
originating url
https://www.cambridge-school.pl/post-sitemap.xml
source type
1 (sitemap)
server ip
Publication date
2025-07-18 07:30:11
Fetch attempts
0
Original html size
43871
Normalized and saved size
38310
title
Blog - Cambridge School - Szkoła językowa
excerpt
content
Highgate Cementary is a special place with examples of Victorian gothic style tombs and Egyptian influences, being inspiration for many atists and film makers. Beautiful sculptures and hidden dark places is for sure for those who like pinch of beauty and suspense. The Winton Gallery at Science Museum London has one of the most amazing and exciting street art scenes in the world. It’s changing all the time.The most popular places are: Shoreditch and Brick Lane are streets You must see A 300-metre tunnel underneath Waterloo Station, completely covered in street art. You can see people spraying on the walls from morning to evenings. Why London is an interesting place to visit (not only) on holiday? There are 170 different museums in London, among them three out of ten the most popular ones in the world. For art lo...
author
https://pixanet.pl
updated
1769034361
block type
0
extracted fields
108
extracted bits
article author
title
full content
content was extracted heuristically
detected location
126
detected language
121 (Polish)
category id
index version
2025123101
paywall score
0
spam phrases
0
text nonlatin
0
text cyrillic
0
text characters
3248
text words
627
text unique words
415
text lines
1
text sentences
34
text paragraphs
1
text words per sentence
18
text matched phrases
2
text matched dictionaries
7
links self subdomains
0
links other subdomains
2
links other domains
1
links spam adult
0
links spam random
0
links spam expired
0
links ext activities
0
links ext ecommerce
0
links ext finance
0
links ext crypto
0
links ext booking
0
links ext news
0
links ext leaks
0
links ext ugc
0
links ext klim
0
links ext generic
0