|
|
By: ryoungman
by Roy Youngman on May 24, 2008 - 08:09 AM read 79 times Source: http://www.ryoungman.net/?p=11#comment-33 |
|
Not surprisingly, while I was discovering and solving bad data problems, many others were too. The process now even has a name and a Wikipedia article: Data Profiling.
There is a short list of commercial products offering data profiling capabilities. Not surprisingly, some are associated with ETL tools and others with DBMS products. Oracle provides some capability with its Warehouse Builder product. IBM has a product called WebSphere Information Analyzer. Informatica has a product called Data Explorer. There doesn’t seem to be any open source alternatives I can find. If anyone knows of any, please comment. There is a company called DataFlux that seems pretty focused on data profiling. They have an active blog going that offers up a lot of good advise on the subject matter as well.


