The term 'sustainable data' sounds concise, but what does it actually mean? For convenience, a range of topics is grouped under this term, even though they might not have much in common. These topics vary from energy efficiency to the careful handling of privacy-sensitive data. However, each of these subjects is important to prevent the process of digitalization from having an adverse effect.
- Storage and processing.
A migration to cloud processes seems like a sustainable step from an environmental perspective. Digitalizing processes and transferring data processing to the cloud causes the energy consumption of these processes to become less visible. However, data centers are required for all that data storage and processing, and they are known for their enormous energy and water consumption. Many data operators, however, strive to work as environmentally conscious as possible. They use renewable energy and energy-efficient hardware. Due to their scale, they are more efficient than traditional on-premises data centers at companies or the "server room" at the office. But not all data centers follow the same path when setting up sustainable processes. Therefore, when selecting a cloud provider, it is important to ask critical questions about their environmental goals and future plans in this area.
- Hardware.
Data is processed on servers, which significantly determines the energy consumption in the process. Cooling in the server room is also an energy-intensive process. Hardware manufacturers are making increasingly smaller and faster components that, relatively speaking, consume less and less energy. These should replace the old equipment. But what happens to the old equipment? The degree to which equipment can be recycled varies considerably between manufacturers. This is also an important topic to discuss with suppliers.
- Green energy.
Even with the best saving measures in place, energy is still required. Ideally, this energy should come from renewable sources if the aim is to prioritize sustainable data. The use of solar panels and wind energy helps reduce the ecological footprint.
- Data streams.
The way data processes are structured can have significant implications for the amount of energy required. Especially retrieving and writing data is relatively energy-intensive. When programming software, it can therefore matter how often data needs to be loaded into the working memory. The way processes are interconnected can also impact the overall energy usage. Therefore, ask a software supplier or cloud provider during the selection process how they focus on setting up sustainable data streams.
- Data deduplication.
The more data stored, the greater the energy demand. Therefore, it is not wise to store more data than necessary. Many data files contain duplicate or even multiple instances of data. There is software that analyzes data for duplicates before storing it. The copies are removed and replaced with a reference. When retrieving data from storage, the reverse process occurs so that applications still find the copies where they expect them. There are significant differences in the efficiency and accuracy of deduplication software. Therefore, it is a point of attention during the selection process.
- Data cleaning.
Deleting data that is no longer needed is also a way to manage data sustainably. Many companies, as well as individuals, tend to keep data "just in case it might be useful someday." The General Data Protection Regulation (GDPR) already imposes legal requirements on the maximum retention period for personal data. However, it is advisable to use software that attaches a retention period to all data and automatically deletes it after this period expires.
- Data quality.
Data lakes filled with data that don't fit together are essentially worthless. Therefore, the reuse of data depends on careful monitoring of the quality of the data being stored. In many digital transformation projects, rationalizing and converting data is the biggest sore point.
- Governance.
Supervision of the use and management of data has as much impact on sustainability as the technical requirements for sustainable data. How is accountability for the use and management of data ensured? Is there adequate attention to privacy? And how is ethical handling of data guaranteed? Governance is not a one-time exercise but a subject that requires continuous attention because societal conditions, legislation, and technology are subject to change.
_____
Data Expo
At the upcoming Data Expo, sustainability will once again receive significant attention. At the Jaarbeurs, on Wednesday September 11th, and Thursday September 12th, you can gain essential knowledge, explore technological possibilities, and establish valuable contacts to take the next step. A visit to Data Expo will help you find answers to all your data-related questions. Visitor registration is free and can be done here.
Also read: 6 must-haves with Data Governance
_____