Use Case Accelerators > DMX-h Use Case Accelerator: HDFS Extract

DMX-h Use Case Accelerator: HDFS Extract

Article #: Product: Version:

Summary

Syncsort has provided a set of use case accelerators to help users understand how to access HDFS directly from DMExpress.

The HDFS Extract use case accelerator demonstrates how to extract TPC-H supplier data from HDFS and write it to the local file system.

Resolution

HDFS Extract UCA Description

Source

The Source is defined using an HDFS connection to access the specified source file in HDFS.

Target

The Target is defined to be a UNIX text file that will reside on the local file system.

Since the source file resides in HDFS, you can run this use case accelerator on any Linux system that has an HDFS client configured to connect to a Hadoop cluster.

Attachments

The following attachments are available for running this UCA:

Additional Information

See the Guide to DMX-h ETL Use Case Accelerators for an overview of how the set of use case accelerators are organized and how to run them.

For general guidance on developing and running DMX-h ETL solutions, see Developing DMX-h ETL Jobs and Running DMX-h ETL Jobs in the DMExpress Help.

Last updated: