All posts by The DataPipeline Team

About The DataPipeline Team

We make Data Pipeline — a lightweight ETL framework for Java. Use it to filter, transform, and aggregate data on-the-fly in your web, mobile, and desktop apps. Learn more about it at northconcepts.com. View all posts by The DataPipeline Team →

What’s New in DataPipeline 7.0?

pexels-neosiam-625219_text_7.0

Welcome to the DataPipeline 7.0 release. Since our last update, the DataPipeline team has been hard at work adding more declarative components, new integrations, new transformations, and generally making the framework easier to use. Our goal is to make simple use-cases easy and complex ones less difficult to implement.

Continue reading →

What’s New in DataPipeline 6.0?

We’re pleased to announce the release of DataPipeline version 6.0. This release includes our new DataPipeline Foundations addon that brings decisioning, source-target data mapping, and other cool features to your software.

Continue reading →

11 Java Data Integration Libraries (2023)

Posted On 2 Oct 2019
By The DataPipeline Team
In Data Preparation, Database, Java, News
Leave a comment

11 Java Data Integration Libraries for 2022

Updated: May 2023

With data being produced from many sources in a variety of formats businesses must have a sane way to gain useful insight. Data integration is the process of transforming data from one or more sources into a form that can be loaded into a target system or used for analysis and business intelligence.

Data integration libraries take some programming burden from the shoulders of developers by abstracting data processing and transformation tasks and allowing the developer to focus on tasks that are directly related to the application logic.

Continue reading →

25 Machine Learning and Artificial Intelligence Conferences

52 Machine Learning and Artificial Intelligence Conferences in 2017 and 2018

Machine learning and artificial intelligence in general are two of today’s hottest skills. AI and ML conferences provide a place for you to improve your skills, discuss trends, and exchange ideas with other data scientists, developers, and entrepreneurs. Whether you’re new to the world of machine learning, trying to stay up-to-date, or just looking to network, there’s a conference happening for you. This article lists over 50 conferences taking place around the world for you to consider attending.

Continue reading →

18 ETL Tools for Java Developers (Updated 2023)

Posted On 31 Aug 2017
By The DataPipeline Team
In Batch, Data Pipeline, Data Preparation, ETL, Java
Leave a comment

ETL Tools for Java Developers

Updated: May 2023

ETL is a process for performing data extraction, transformation and loading. The process extracts data from a variety of sources and formats, transforms it into a standard structure, and loads it into a database, file, web service, or other system for analysis, visualization, machine learning, etc.

ETL tools come in a wide variety of shapes. Some run on your desktop or on-premises servers, while others run as SaaS in the cloud. Some are code-based, built on standard programming languages that many developers already know. Others are built on a custom DSL (domain specific language) in an attempt to be more intentional and require less code. Others still are completely graphical, only offering programming interfaces for complex transformations.

What follows is a list of ETL tools for developers already familiar with Java and the JVM (Java Virtual Machine) to clean, validate, filter, and prepare your data for use.

Continue reading →

25 Conferences Data Scientists Should Attend in 2022 and 2023

Posted On 19 Jul 2016
By The DataPipeline Team
In Data Science
Leave a comment

Updated: June 2022

Being a data scientist means dedication to continuous learning. One great way to keep learning, improve your network, and get exposed to different views is to attend conferences.

Since 2020 organizers have been opting for online virtual conferences instead of in-person conferences. In 2021 and 2022 the same trend continues although some conferences are also being scheduled to be attended in person since the second half of the year 2021.

Data science conferences are one of the best ways to learn, develop new skills, meet and discuss ideas and discover how others are applying AI, analytics and machine learning in their work.

Here are several conferences for data scientists you should consider attending.

Continue reading →

Data Pipeline v4.1 Adds MongoDB Support

We’re excited to introduce Data Pipeline version 4.1, the second update on our 2016 roadmap.

This release features MongoDB integration, expression language additions, and improved transformations and joins. We’ve also thrown in a ton of examples for all the new 4.1 and 4.0 features. Enjoy. Continue reading →

Data Pipeline 3.1.4 Now Available

Data Pipeline v3.1.4 is now available for download. This release includes support for MySQL upserts, lower JSON and XML memory usage, bug fixes, and more.
Continue reading →

Data Pipeline 3.1 Now Available

Data Pipeline 3.1 is now available for download. This is a milestone release that adds native support for hierarchical data (nested records and multidimensional arrays).

Continue reading →

How to convert XML to Excel (2023)

Posted On 22 Jun 2015
By The DataPipeline Team
In Data Pipeline, Excel, Java, News, XML
View all 2 comments

Data Pipeline makes it easy to read, transform, and write XML and Excel files. This post shows you how you too can load data from an on-disk XML file, apply transformations on the fly, and save the result to an Excel file.

Continue reading →

All posts by The DataPipeline Team

About The DataPipeline Team

What’s New in DataPipeline 7.0?

What’s New in DataPipeline 6.0?

11 Java Data Integration Libraries (2023)

25 Machine Learning and Artificial Intelligence Conferences

18 ETL Tools for Java Developers (Updated 2023)

25 Conferences Data Scientists Should Attend in 2022 and 2023

Data Pipeline v4.1 Adds MongoDB Support

Data Pipeline 3.1.4 Now Available

Data Pipeline 3.1 Now Available

How to convert XML to Excel (2023)

Data Pipeline

Docs

Company

Tools