Open topic with navigation
The HDFS Extract use case accelerator demonstrates how to extract TPC-H supplier data from HDFS and write it to the local file system.
The Source is defined using an HDFS connection to access the specified source file in HDFS.
The Target is defined to be a UNIX text file that will reside on the local file system.
Since the source file resides in HDFS, you can run this use case accelerator on any Linux system that has an HDFS client configured to connect to a Hadoop cluster.
The following attachments are available for running this UCA:
See the Guide to DMX-h ETL Use Case Accelerators for an overview of how the set of use case accelerators are organized and how to run them.
For general guidance on developing and running DMX-h ETL solutions, see Developing DMX-h ETL Jobs and Running DMX-h ETL Jobs in the DMExpress Help.
Copyright © 2016 Syncsort All rights reserved.