It’s estimated that at least 30% of an organization's unstructured data is redundant, obsolete, or trivial (ROT). This is data that an organization continues to retain even though it has no business or legal value. What an appropriate acronym! But why are so many of you still storing your “ROTten” data?
This situation is getting more critical because unstructured data is exploding, representing 80% to 90% of all new enterprise data and growing 3 times faster than structured data. More and more of this data resides outside the on-premises data center, where it can grow quickly without you noticing. With huge pressure on IT to optimize spending, both on premises and in the cloud—especially given the current economic times—removing or at least minimizing ROT seems like it would be a no-brainer.
But it isn’t as easy as cleaning out all those tools and supplies you’ve hoarded over the years in your garage. At least there, you can physically see what you have and organize it so you can get rid of the stuff you no longer need (or that’s a liability to keep—like that rusty saw you haven’t used in 5 years).
The problem with your enterprise data is that most of it is dark data—data that most organizations don’t even know they have. It’s generated by people, machines, and processes across your organization and is stored but not used. Not only does storing this data cost a lot, but it clogs your data repositories, making it harder for you to find the useful data that’s needed to drive business decisions. Then there’s the environmental and sustainability impact of retaining data that has no value—you need more shelves and therefore more power, more cooling, and more data center space either on premises or in the cloud.
If you have no visibility into this data, how can you categorize it as redundant, obsolete, or trivial so that you can clean it up? Being able to classify your data doesn’t just help you optimize your existing on-premises and cloud storage resources—it helps prevent you from migrating unnecessary data, and incurring added costs, when you take on new cloud migration projects.
As unstructured and dark data proliferates, we see more organizations adopting tools that give them insight into their data so they can control it more easily to minimize ROT.
Our answer to this is NetApp® BlueXP® classification, a data governance and classification service that’s powered by NetApp Cloud Data Sense. It’s delivered through the BlueXP control plane, and provides insight and reporting for both on-premises and cloud storage across a wide array of data types and technology vendors.
BlueXP classification automatically scans your data sources (NetApp and third-party) and uses artificial intelligence, machine learning, and natural language processing to analyze, map, and classify both structured and unstructured data across your entire data estate. It automatically identifies stale, duplicate, and nonbusiness data (among other types of data, like Personally identifiable information [PII] that must be controlled for compliance purposes) regardless of where the data is stored. And makes that data visible through a single pane of glass so you can quickly address it.
BlueXP classification helps you reduce ROT by answering questions like:
You can put this insight into action right from the BlueXP classification dashboard, where you can move, copy, delete, assign, tag, and label data appropriately, or set policies to automate certain processes.
Of course, controlling ROT is only one of several challenges you face as your data proliferates. Keeping your data secure and compliant is essential, and failure to do so can have even bigger financial consequences.
In addition to identifying savings opportunities, BlueXP classification excels at helping you discover and protect personal and sensitive data. This helps you meet your cybersecurity and regulatory compliance requirements with capabilities like:
There’s no better time than the start of a new year to tackle that cleanup work that’s so badly needed. So go ahead—make it your resolution to eliminate your ROT in 2023! And while you’re at it, please get rid of that rusty saw in your garage, too.
Try it out for yourself. Get started with a free trial of BlueXP classification for 30 days (up to 1TB).
Darnell Fatigati is a senior product marketing manager for NetApp’s Cloud Data Services with over 20 years of experience in technology, IT services, and telecom. Her focus is helping customers understand how to gain visibility and insight into their data so they can optimize their IT infrastructure, efficiently migrate to the cloud, maintain regulatory compliance, and enhance their cybersecurity posture.