Open topic with navigation
DMExpress provides Hadoop Distributed File System (HDFS) connectivity for both sources and targets, and should be used in the following cases:
It should not be used for saving DMExpress job, task, or metadata files. Those files should be saved to the local file system.
DMExpress supports HDFS connectivity as a type of remote file connection.
Once an HDFS connection is defined, it can be used to read/write data files from/to a Hadoop cluster.
Specify the HDFS connection when defining a source in a DMExpress task to read data files stored in a Hadoop cluster.
Specify the HDFS connection when defining a target in a DMExpress task to load data files into a Hadoop cluster.
Do not use a DMExpress HDFS connection to store DMExpress job, task, or metadata files. These files should be stored in the local file system only.
Copyright © 2016 Syncsort All rights reserved.