IBM® InfoSphere® DataStage® is a leading ETL platform that integrates data across multiple enterprise systems. It leverages a high performance parallel framework, available on-premises or in the cloud. The scalable platform provides extended metadata management and enterprise connectivity. It integrates heterogeneous data, including big data at rest (Hadoop-based) or big data in motion (stream-based), on both distributed and mainframe platforms. It supports IBM Db2® Z and Db2 for z/OS®, applies workload and business rules, and integrates real-time data in an easy to deploy platform.

DataStage ETL Job Build and Run

6 minute demo

Show the basic concepts of building and running ETL jobs in DataStage and see how users can track the flow of data through lineage analysis

Enterprise Data Warehouse Offloading

7 minute demo

Companies are realizing large cost savings by offloading data from traditional data warehouses into Hadoop Cluster - using DataStage to lift and transform data while running inside Hadoop Clusters to reduce license and hardware requirements.

Using DataStage to synchronize on premise and Cloud data repositories

9 minute demo

DataStage ETL tool is used with IBM Data Replication to detect changes in transaction data stores and transform them for Cloud applications in Realtime

Tour IBM InfoSphere DataStage: Offload Data Warehousing to Hadoop by using DataStage

Use IBM® InfoSphere® DataStage® to load Hadoop and use YARN to manage DataStage workloads in a Hadoop cluster

  • Learn how to run DataStage traditional ETL jobs
  • Configure DataStage to run inside Hadoop Clusters
  • Examine execution logs to esnure configuration worked correctly

15-30 minute introduction