September 11, 2006 at 11:38 am
Comments posted to this topic are about the content Data Quality
September 18, 2006 at 2:24 am
Hello,
you gave the example of legacy systems not being well documented.
Sometimes it can be the other way arround unfortunately. A change of management, a change of IT-companies, and a wreckless import (time you know) of originally more or less well-documented data into a poorly documented "new" system. My heart broke and now two years later there are still all kinds of little tiny problems. But hey, the system runs ! (but the data s**ks)
Greetings,
L
September 18, 2006 at 3:55 am
Thank you for writing this article. It represents another approach to what I have suggested in my article printed here last year about Fragmentation Measurement
I believe that the quality of data is to be measured firstly concerning the gap between the physical device and the way the users are using it.
The second level has to deal with the redundancy on data (probably check replication).
The third level which I believe this article is about, is the beginning of a BI, because you are looking for missing data and taking decisions according to the contents of the data base.
This subject is very important, and should be continued.
Koby Biller
September 18, 2006 at 9:07 am
I work for a Third Party Administrator. One of my primary job duties is converting data from other sources. When our company buys another company or we get a new client, I convert their data into our system. The new client’s we receive can have anywhere from 2,000 to 40,000 claims, 5,000 to 600,000 payments, and 5,000 to 600,000 notes.
I receive data in every format you can imagine. I have found the following steps useful when converting new data.
Here is a sampling of what my stored procedures look for.
Since I have used this method of performing conversions, I am able to clean up the data in a pre-production phase, before it gets into production. Also, by using identify columns and primary key columns as explained in step 2 and 3, I am able to trace my data back to the original source.
September 18, 2007 at 10:08 pm
Consisting all most entirely of truisms, No quantification, expected norms or constructive methodology for cleansing discussed.
Very unsatisfying read.
Viewing 5 posts - 1 through 4 (of 4 total)
You must be logged in to reply to this topic. Login to reply