Data Pipeline's embedded data transformation engine makes it easy for Java applications to convert, manipulate, and transform data with only a few lines of code.
The engine has readers and writers for common data formats like XML, CSV, and Excel. It also has operators to filter, validate, lookup, and more.
Plugs into your product or application. No servers, installation, setup, configuration, or network hops.
Write transformations using Java
Use the language and tools you already know.
Low memory and disk overhead.
Large data sets
Handle gigabytes of data with ease.
Process data as it comes in, no delays.
Enhance the toolkit to fit your unique needs; plug‐in your own logic or modify existing behaviour.
Easy to use
Get started quickly.
Get the help you need.
Open, visible source
No guessing, know exactly what the toolkit is doing.
Simple to understand, use, and extend.
Leverage built‐in endpoints and operations.
Flexible data representation
Choose how best to structure your data.
Comma Separated Values (CSV)
Supports user-defined delimited values.
Supports Excel formats 97, 2003, 2007, and 2010.
Streaming XML reader using XPath queries. Template-based XML writer using built-in expression language.
Fixed-Width/Fixed Length Records (FLR)
Web Server Logs
Built-in serialization format.
Rule‐based record filtering.
Rule‐based data validation.
Built-in or user-defined transformation.
Lookups / Joins
Sort large datasets
Remove duplicates (Dedup)
Detailed exception reporting
Attach temporary, transient data to any field or record