Description
Organizations are struggling to reduce and right-size their information foot-print, using data governance techniques like data cleansing and de-duplication. Why is this effort necessary? Briefly explain and support from your readings, using APA style citations.
Requirements:
1. Make your initial post with at least one scholarly reference.
2. Use information from your readings and other sources. Use proper citations and references in your post (scholarly references should match the content)
3. Need two response posts also
4. No Plagiarism
Here are the posts for which responses are needed.
Post -1:
Reducing and right-sizing information foot-print Data footprint increases effectiveness of the IT resources. For instance, the use of data cleansing and de-duplication techniques ensures that servers, storage, and network are effective (Smallwood, 2014). The process also improves the performance of the servers and network, among other resources. Reducing and right-sizing information foot-print makes it easier to use the organization’s equipment.
The process also improves transparency. For instance, compression improves transparency across the apps and maximizes technology investment such as storage tiers. De-duplication improves transparency by introducing a new layer of technology which existed or did not co-exist with the current management and storage tools (Schulz, 2007). The efforts also improve flexibility. De-duplication could cause delay while performing bulk data restoration but after the process is completed, the organization achieves flexibility.
The efforts also ensure speedy data protection. They also reduce the size of storage needed to rapidly retrieve data. The efforts, such Single instance storage (SIS), enable easy and timely ingestion and elimination of duplicate data to save the storage capacity that can be used to store data that has been backed up (Schulz, 2007). This assumes that there is a high degree of commonality and repeating data files being backed up.
Post -2:
Given the current rate of data generation as a result of digitalization and other technological advancements, it is inevitable for organization handling this data to have it in abundance and it has reached unprecedented levels. Data or information footprint is the amount of data stored by an organization both online and offline. It involves cloud storage, removable disks and all other forms of data storage. Companies are looking for ways to reduce their data footprint by employing various techniques, tools and best practices that can address the data growth management and work in collaboration with data governance so that important information is not lost or data is not being misused. Data footprint reduction can be useful an organization in many aspects such as performance, availability, capacity and economic or energy efficiency requirements using various techniques.
Reducing data footprint can help manage storage more effectively across multiple applications and various tiers of storage. Information Footprint also helps in enhancing the service delivery of an application and further will provide timely data protection to be in compliance with business objectives. Other reasons why organizations are putting efforts in reduction of data footprint is because this will help in the reduction of storage costs or defer upgrades to expand server, storage and network capacity along with associated software license and maintenance fees. Information footprint will extend the effectiveness of existing storage and maintenance capabilities by reducing or cleansing that is duplicate or of no more use.
As far as Network related advantages are considered, reduction of data footprint has a positive affect on LAN, WAN, and SAN bandwidth which can work more efficiently in tasks such as data replication or remote backup.
Explanation & Answer
The plag checker at Studypool has highlighted one reference but there's nothing I can do about it.
1
Importance of Data Cleansing and Data De-Duplication
Name
Institution Affiliation
Course
Instructor
Name
2
In the field of Big Data analytics, a lot of information is collected and stored by various
organizations. The stored data may be inaccurate, outdated, or exist as multiple copies of the
same information. Data cleansing is a vital process in which an organization revisits its database
and updates and removes irrelevant, outdated, duplicated, incorrect or improperly formatted data
to reduce and right-size their data information foot-prints. Therefore, data cleansing and deduplication is a necessary data management process aimed at ensuring the maximum...