What Is Data Cleansing?

Data cleansing uses data from multiple online transaction processing (OLTP) systems to generate part of a data warehouse process. Errors such as spelling, conflicting spelling rules between two systems, and conflicting data (such as having two numbers for the same part). The purpose of data cleanup is to prevent errors or problematic data from entering the calculation process. Generally, it is completed with the help of a computer, including clearing of the valid range of data, clearing of logical consistency of data, and random inspection of data quality.

The process must resolve incorrect usage from multiple
In addition to the singular values of the data input, there is a more complicated one, and what needs to be done is the logical consistency cleanup.
1. First, you can download some storage analyzers. Run this program on your infrastructure to find all files that have not been accessed or modified in 90 days. Make a list and try to associate it with Active Directory.
2. Find the largest file and submit it to the appropriate manager. "You see, these files take up a lot of space, and many of them have not been accessed for more than 90 days. Are these files still useful?"
3 Give users unlimited and unlimited access to tape. Tell them that the data on it is secure; and that it is accessible via the World Wide Web, which can take 20 seconds to 2 minutes. But we don't want to put it on the main storage anymore, because we spend too much on the main storage. You may do it with just a little bit of coordination; you may not even feel it.
4. Implement file isolation scheme. Basically, indicate how you deploy data when you first create it, and apply policies to it. Understanding the data in detail is the best way, even if it's based only on the person's department. If he works in the accounting department and you think that all accounting systems are mission-critical, this means that there is some level of service and resource commitment. When it is saved, the file is where it should be. It is completely transparent to the user and does not require any cooperation.
5. How to achieve it? You will apply different strategies for the importance of the data in each resource pool. Do you get an error message? of course! However, once you receive error messages and have not been accessed for 90 days, they will be migrated offline. We have to start thinking about how to deal with this part of the data. [3]

IN OTHER LANGUAGES

Was this article helpful? Thanks for the feedback Thanks for the feedback

How can we help? How can we help?