Data Discovery and Classification Are Complicated, But Critical to Your Data Protection Program

Data — it’s your most critical asset. According to Domo, 2.5 quintillion bytes of data are created daily. It’s no wonder that finding and identifying data is one of the most complex and challenging processes organizations face along their data protection journeys. With the average total cost of a data breach hitting $3.92 million in 2019, companies must discover and classify their data as a foundational component of their data security and data privacy strategies.

What Is Data Discovery and Classification?

Data discovery and data classification go hand in hand. Data discovery is the process of scanning your environment to determine where data (both structured and unstructured) resides — e.g., in database and file servers that could potentially contain sensitive and/or regulated data.

Data classification, which follows the data discovery process, is more complicated. It’s the process of identifying the types of data within the discovered data sources using a predefined set of patterns, keywords or rules and assigning classification labels to that data. For example, if you work at a health insurance company, you would use medical identifier patterns to search for sensitive healthcare information.

Why Is Data Discovery and Classification Important?

Put simply, if you don’t know what data you have and where it lives, you can’t protect it effectively, which means your data is vulnerable. In addition, data classifications inform how you should treat and protect your data, including the policies you need to place around it, and guide the prioritization of your data protection and risk mitigation activities. Finally, it helps identify data that is governed by regulations and enables you to implement the controls required to achieve compliance.

Common Barriers to Effective Data Protection

Given the myriad strategic, tactical, business and technical reasons for performing data discovery and data classification, why isn’t every company doing it? Well, it’s complicated.

Operationally, discovering and classifying structured and unstructured data in a unified way across the cloud and on-premises locations is a complex process due to the scale, types of data, and underlying architectures and platforms. It’s also challenging to establish and maintain a coherent approach across the different environments and assign labels consistently across all the data. Without that consistency, the effectiveness of these processes is limited at best.

Moreover, data is constantly changing and moving, which means it needs to be tracked and reclassified regularly and continuously. Your business changes and evolves over time, which can complicate your data discovery and data classification efforts when introducing legacy (or, conversely, new) technologies. Lastly, with so many new regulations coming into effect — especially data privacy regulations — it’s hard to keep up with, centralize and manage all the compliance requirements for data protection.

In part two of this series, we’ll explore some tips and best practices to help companies strategically plan and implement a flexible approach to data discovery and classification.

Read the Forrester Report: Rethinking Data Discovery & Classification

Joanne Godfrey

How secure are green data centers? Consider these 5 trends

4 min read - As organizations increasingly measure environmental impact towards their sustainability goals, many are focusing on their data centers.KPMG found that the majority of the top 100 companies measure and report on their sustainability efforts. Because data centers consume a large amount of energy, Gartner predicts that by 2027, three in four organizations will have implemented a data center sustainability program, which often includes implementing a green data center.“Responsibilities for sustainability are increasingly being passed down from CIOs to infrastructure and operations…

Why maintaining data cleanliness is essential to cybersecurity

3 min read - Data, in all its shapes and forms, is one of the most critical assets a business possesses. Not only does it provide organizations with critical information regarding their systems and processes, but it also fuels growth and enables better decision-making on all levels.However, like any other piece of company equipment, data can degrade over time and become less valuable if organizations aren’t careful. What’s even more dangerous is that neglecting data hygiene can expose organizations to a number of security…

Router reality check: 86% of default passwords have never been changed

4 min read - Misconfigurations remain a popular compromise point — and routers are leading the way.According to recent survey data, 86% of respondents have never changed their router admin password, and 52% have never adjusted any factory settings. This puts attackers in the perfect position to compromise enterprise networks. Why put the time and effort into creating phishing emails and stealing staff data when supposedly secure devices can be accessed using "admin" and "password" as credentials?It's time for a router reality check.Rising router risksRouters…

Security Intelligence

{{title}}

{{title}}

{{title}}

Topics

What Is Data Discovery and Classification?

Why Is Data Discovery and Classification Important?

Common Barriers to Effective Data Protection

More from Data Protection

How secure are green data centers? Consider these 5 trends

Why maintaining data cleanliness is essential to cybersecurity

Router reality check: 86% of default passwords have never been changed

Topic updates