Open source data integration vendor Talend has unveiled a tool aimed at scrubbing dirty data from corporate information repositories.
Talend Data Quality, which will be available free under a GPL license, ferrets out such errors as duplicate names and address, and improperly configured data including phone numbers. At its most basic, the software can ensure a person's phone number is correct and has the required number of digits; check that zip codes match the cities contained in an address entry; and consolidate entries that have names, nicknames or abbreviations that apply to the same person.
"Data quality goes way beyond name and address issues," says Yves De Montcheuil, vice president of worldwide marketing for Talend. "Those are the most prevalent. But if you have a product catalog, you need to ensure product descriptions are correct and that the price makes sense."
He says clean data is key when integrating information across systems because mis-information can propagate fast not only internally but to partners.
Talend is coupling its Data Quality tool with the Talend Open Profiler software it released in June. The Profiler can look inside a database and pinpoint problems. The tools can be used together or separately. In addition, both tools work in harmony with Talend's Integration Suite.
The company also has been forging relationships with major vendors, including a partnership with Microsoft earlier this year.
Data Quality is a graphical tool that lets users drag and drop components onto a process map. The components describe such tasks as reformatting an address, checking an address against the U.S. Postal Service database, pulling data from specific repositories, adding longitude and latitude to customer records to provide navigation help to delivery drivers, or pulling data from a credit bureau. An SDK lets users create their own components.
Once the process is complete, Talend Data Quality generates an executable code in Java or Perl that can be installed in multiple places on the network and close to data sources.
Talend plans to deliver Data Quality at the end of September and will offer tech support and other services via a subscription that starts at US$15,000 per year.
Read up on the latest ideas and technologies from companies that sell hardware, software and services. Using EMC Celerra IP Storage with Vmware Infrastructure 3 over iSCSI and NFS
Email Archiving Implementation: Five Costly Mistakes to Avoid
EMC Data Profiling for File System and Exchange Server Environments
Choices in Storage Architecture for Oracle Environments
Microsoft 2008 Mission Critical IT
Mimosa™ NearPoint™ for Microsoft® Exchange Server: Email Archiving 101
Realizing the Value of Unified Communications
Network Aware Service Management
Zones provide focussed content from Computerworld and leading technology partners.Discover how SOA can create smarter outcomes for your business.
Attend and learn:
- How SOA is helping leading companies to become more agile
- Where you should be applying SOA processes in your company
- The top SOA implementation mistakes to avoid
Click here for more information.
- +
Computerworld Live Podcast #97: The Future of Enterprise Networking 25/07/2008 09:45:36
This week CW Live chats with Mark Thompson, global sales and marketing manager for HP ProCurve, on the future of the enterprise networking. Mark discusses the trends we can expect to see in the near future and how the right infrastructure can ensure your enterprise network is secure. - +
Computerworld Live Podcast #96: Security at the Edge 11/06/2008 09:22:22
CW Live speaks with Amol Mitra, HP ProCurve Director of Marketing for Asia Pacific and Japan. Today's topic: how enterprises are starting to shift away from simply controlling security via server logins, firewalls and moving to more adaptive security frameworks. - +
Data Management Edition #10: Multi-Petascale Systems 02/05/2008 09:12:33
This week we look at sustainability and the development of multicore technologies to build multi-petascale systems. - +
IT Security Edition #11: How to poison the Storm botnet 01/05/2008 08:51:55
This week CW Live presents a case study on how to poison the notorious Storm botnet . Plus we take a look at Cisco's plans for Ironport. - +
IT Security Edition #10: Cyber-battles fought and won 24/04/2008 11:09:47
Vendors bow to end user pressure to improve product security, and we take a look at the latest concepts shaping the cyber-battlefield of the future.
Vignette Announces 2008 Excellence Awards 2008-11-21 10:50:00+11
PGP and Ponemon Institute Unveil Inaugural Australian Data Breach Study 2008 2008-11-20 17:34:00+11
Symantec Cloud Services Transform Data Centre Operations Through Proactive Management 2008-11-20 12:06:00+11
Verizon Business Offers Tips to Building a Successful Unified Communications and Collaboration Plan 2008-11-20 12:04:00+11
AARNet Brings 4K Digital Cinema to Australia: First 4K HD Video Signal delivered into Australia by AARNet 2008-11-20 12:02:00+11
Know thy self: Reduce costs, secure data and ensure compliance with identity management
Midsize businesses cannot operate effectively without the ability to control access to their networks and business systems. A strong identity management platform can play the role of gatekeeper and guardian of business intelligence and information. Read on to discover how you can create a strong identity management plan to protect your business.









