Open topic with navigation
Syncsort has provided a set of use case accelerators (UCAs) to help users understand how to implement DMX-h ETL solutions in a Hadoop MapReduce framework.
The File Join Large use case accelerator demonstrates how to perform a join of two large files stored in HDFS. This example performs an inner join, but could easily be modified to perform a left-outer, right-outer, or full-outer join.
The following attachments are available for understanding and running this UCA:
See the Guide to DMX-h ETL Use Case Accelerators for an overview of how the set of use case accelerators are organized and how to run them.
For general guidance on developing and running DMX-h ETL solutions, see Developing DMX-h ETL Jobs and Running DMX-h ETL Jobs in the DMExpress Help.
Copyright © 2016 Syncsort All rights reserved.