All posts by Dele Taylor

About Dele Taylor

We make Data Pipeline — a lightweight ETL framework for Java. Use it to filter, transform, and aggregate data on-the-fly in your web, mobile, and desktop apps. Learn more about it at northconcepts.com. View all posts by Dele Taylor →

Data Pipeline 4.4 Now Available

Posted On 31 Jul 2018
By Dele Taylor
In Data Pipeline, News
Leave a comment

Today we’re pleased announce the release of Data Pipeline version 4.4. This update includes integration with Amazon S3, new features to better handle real-time data and aggregation, and new XML and JSON readers to speed up your development.

Continue reading →

Online data prep and code generator for Data Pipeline

Posted On 20 Oct 2017
By Dele Taylor
In Code Generation, Data Pipeline, Data Preparation, ETL, News
Leave a comment

Online data prep and code generator for Data Pipeline

We’re building on a new tool to help you work faster with Data Pipeline.

This new tool is a web app that lets you interactively transform, filter, and prepare data on-the-fly. It also lets you generate Data Pipeline code based on the actions you perform.

Continue reading →

How to Convert Tabular Data to Trees Using Aggregation

Posted On 11 Oct 2017
By Dele Taylor
In Data Pipeline, ETL
Leave a comment

How to Convert Tabular Data to Trees Using Aggregation

We recently received an email from a Java developer asking how to convert records in a table (like you get in a relational database, CSV, or Excel file) to a composite tree structure. Normally, we’d point to one of Data Pipeline’s XML or JSON data writers, but for good reasons those options didn’t apply here. The developer emailing us needed the hierarchical structures in object form for use in his API calls.

Since we didn’t have a general purpose, table-tree mapper, we built one. We looked at several options, but ultimately decided to add a new operator to the GroupByReader. This not only answered the immediate mapping question, but also allowed him to use the new operator with sliding window aggregation if the need ever arose.

The rest of this blog will walk you through the implementation in case you ever need to add your own custom aggregate operator to Data Pipeline.

Continue reading →

Scala and Data Pipeline – Phone Bill Calculation Example

Scala and Data Pipeline - Phone Bill Calculation Example

Earlier this year a friend sent me a video showing how he implemented a phone bill calculation challenge using Scala. I took a stab at it using Java + Data Pipeline and below is what I came up with.

How about you? How would you code this using your favourite language or framework?

Continue reading →

How to Export Emails from Gmail to Excel with Data Pipeline

Posted On 10 Aug 2017
By Dele Taylor
In Data Pipeline, Excel
Leave a comment

Export emails from Gmail and G Suite to Excel

Updated: July 2021

If you have ever tried to export emails to Excel for analysis, you know it is not exactly straightforward. Maybe you need to find the top companies contacting you and your sales team. Maybe you need to perform text or sentiment analysis on the contents of your messages. Or maybe you’re creating visualizations to better understand who’s emailing you. This east guide will show you how you can use Data Pipeline to search and read emails from Gmail or G Suite, process them any way you like, and store them in Excel.

Continue reading →

Spring Batch vs Data Pipeline – ETL Job Example

Posted On 4 Oct 2016
By Dele Taylor
In Batch, Data Pipeline, Java, Spring Framework
Leave a comment

Data Pipeline vs Spring Batch

Updated: July 2021

Most examples of creating a Spring Batch ETL Job require an enormous amount of code for such a routine task. In this blog, I will show you how to accomplish the same task of summarizing a million stock trades to find the open, close, high, and low prices for each symbol using our Data Pipeline framework.

Continue reading →

How to speed up JDBC inserts?

Posted On 17 May 2016
By Dele Taylor
In Database, Java
View all 8 comments

How to speed up JDBC inserts

Updated: May 2023

When trying to assess how knowledgeable a developer is in general and in JDBC in particular, here’s a question I like to ask: how would you speed up inserts when using JDBC?

Here are some options to consider if you ever need to insert data quickly into an SQL database.

Continue reading →

How To Aggregate Twitter Searches Without A Database

Posted On 11 Aug 2015
By Dele Taylor
In Data Pipeline, Java, Twitter
One comment so far

One feature of Data Pipeline is its ability to aggregate data without a database. This feature allows you to apply SQL “group by” operations to JSON, CSV, XML, Java beans, and other formats on-the-fly — in real-time. This quick tutorial will show you how to use the GroupByReader class to aggregate Twitter search results.

Continue reading →

How to read data in parallel using AsyncMultiReader

Posted On 26 Jun 2015
By Dele Taylor
In Data Pipeline, Exceptions, Multithreading
One comment so far

How to read data in parallel using AsyncMultiReader

Data Pipeline now includes a new AsyncMultiReader endpoint that lets you read from multiple DataReaders in parallel. Here’s how it works.

Continue reading →

How to create multiple sheets in a single Excel file

Posted On 30 May 2015
By Dele Taylor
In Data Pipeline, Excel, Java, News
One comment so far

Data Pipeline lets you read, write, and convert Excel files using a very simple API. This post will show you how to create Excel files containing more than one work sheet or tab.

Continue reading →

All posts by Dele Taylor

About Dele Taylor

Data Pipeline 4.4 Now Available

Online data prep and code generator for Data Pipeline

How to Convert Tabular Data to Trees Using Aggregation

Scala and Data Pipeline – Phone Bill Calculation Example

How to Export Emails from Gmail to Excel with Data Pipeline

Spring Batch vs Data Pipeline – ETL Job Example

How to speed up JDBC inserts?

How To Aggregate Twitter Searches Without A Database

How to read data in parallel using AsyncMultiReader

How to create multiple sheets in a single Excel file

Data Pipeline

Docs

Company

Tools