Running Jobs
Data Pipeline transfers records from readers to writers using jobs. The chain of readers and writers in a job is called a pipeline. You might find the terms job and pipeline used interchangeably.
As with the Java I/O classes, you can program the transfer yourself (open streams, read-write records in a loop, close streams), however the Job
class offers many benefits:
- It can be monitored and managed using JMX (Java Management Extensions)
- It can be run synchronously or asynchronously
- It can be cancelled, paused, and resumed
- It's thread-safe
- It's state can safely be logged at any time or on completion
- It publishes lifecycle events to the system event bus
- It has built-in error handling
- It supports callbacks
- It tracks stats about itself