Data Aggregation

Data Pipeline provides readers that lets you perform SQL-like "group by" operations on any dataset.

The GroupByReader class performs these aggregate operations on-the-fly — without a database — while the data is flowing.

Aggregate operations can be applied to data coming from flat files (CSV, XML, JSON), streaming data, Java Beans, APIs, or any format Data Pipeline supports, including your own custom readers.