Main

type

5 (blog/news article)

status

21 (imported old-v2, waiting for another import)

review version

0

cleanup version

0

pending deletion

0 (-)

created at

2025-06-24 01:33:50

updated at

2025-11-02 16:39:44

Address

url

https://www.bbc.com/news/business-40441434

url length

42

url crc

41679

url crc32

1774232271

location type

1 (url matches target location, page_location is empty)

canonical status

10 (verified canonical url)

canonical page id

-

Source

domain id

74498681

domain tld

0

domain parts

0

originating warc id

-

originating url

http://www.bbc.com/news/business-40441434#new_tab

source type

10 (canonical url)

Server response

server ip

151.101.0.81

Publication date

2025-11-02 16:39:44

Fetch attempts

1

Original html size

209505

Normalized and saved size

60242

Content

title

Could new data laws end up bankrupting your company?

excerpt

content

Could new data laws end up bankrupting your company?6 July 2017ShareSaveMatthew WallTechnology of Business editorShareSaveGetty ImagesMany companies are in full "panic" mode, says KPMG's Mark ThompsonThe European Union's General Data Protection Regulation (GDPR) comes into force in May 2018, radically changing the way organisations have to look after our personal data. Failure to comply could lead to huge fines, yet many businesses are far from ready. Here's why you should care.What is GDPR exactly?A new EU regulation governing how organisations should handle and protect our personal data. Many of the stipulations are already covered by the UK's Data Protection Act; but simply put, organisations need to keep records of all personal data, be able to prove that consent was given, show where the data's going, what it's being used for, and how it's being protected. Accountability is the new watchword.If personal data gets stolen after a cyber-attack, companies have to report the breach wit...

author

Matthew Wall

updated

1762302978

Text analysis

block type

0

extracted fields

111

extracted bits

featured image
image author
article author
title
full content
content was extracted heuristically

detected location

0

detected language

1 (English)

category id

Wycieki (26)

index version

2025103102

paywall score

0

spam phrases

0

Text statistics

text nonlatin

0

text cyrillic

0

text characters

5702

text words

1154

text unique words

508

text lines

1

text sentences

18

text paragraphs

1

text words per sentence

64

text matched phrases

2

text matched dictionaries

5