How many instances of the Funnel stage run in parallel?
In the exhibit, a Funnel stage has two input links. Input 1 (Seq_File) comes from a SequentialFile
with “Readers per Node” set to “2”. Input 2 (Dataset) comes from a dataset created with 3
partitions. With the Funnel stage, the funnel type is set to “Sequence”. The parallel configuration
file contains three nodes.
How many instances of the Funnel stage run in parallel?
Which two steps will accomplish this?
A job contains a Sort stage that sorts a large volume of data across a cluster of servers. Due to
new requirements, the customer has requested that this sorting be done on a subset of servers
identified in the configuration file to minimize impact on database nodes. The
$APT_CONFIG_FILE shows no sort disk pool for any node. Which two steps will accomplish this?
(Choose two.)
Which technique is the most appropriate?
You have a dataset hash-partitioned and sorted on CustID. You need to create a Sequential File
sorted on CustID from this dataset. Which technique is the most appropriate?
How can a fixed-width single sequential file be read in parallel?
You are running a DataStage job using a 2-node configuration file. How can a fixed-width single
sequential file be read in parallel? (Choose two.)
What is the lowest CPU cost partitioning method?
What is the lowest CPU cost partitioning method?
Which two statements accomplish this requirement?
The purchase history record contains CustID, ProductID, ProductType and TotalAmount. You
need to retain the record of greatest TotalAmount per CustID and ProductType using
RemoveDuplicate stage. Which two statements accomplish this requirement? (Choose two.)
What three requirements must be met in order to use this stage?
You are required to use a Make Vector stage in your job. What three requirements must be met in
order to use this stage? (Choose three.)
What two tasks must be done to meet these requirements?
A customer requires reading a fixed-length sequential file in parallel where input rows can be
identified. What two tasks must be done to meet these requirements? (Choose two.)
Which three source stages require a schema file when using RCP?
Which three source stages require a schema file when using RCP? (Choose three.)
Which approach will retrieve and process all five files in the minimal amount of time?
A client has multiple servers connected with a high bandwidth switch and has installed DataStage
on one of them. There are five files (with the same record layout) that must be retrieved from three
other servers using FTP. Which approach will retrieve and process all five files in the minimal
amount of time?