Which two methods could be used to dramatically improve performance of this job?

seenagape

12 years ago

A DataStage job uses an Inner Join to combine data from two source parallel datasets that were
written to disk in sort order based on the join key columns. Which two methods could be used to
dramatically improve performance of this job? (Choose two.)

A.
Disable job monitoring.

B.
Set the environment variable $APT_SORT_INSERTION_CHECK_ONLY.

C.
Unset the Preserve Partitioning flag on the output of each parallel dataset.

D.
Explicitly specify hash partitioning and sorting on each input to the Join stage.

E.
Add a parallel sort stage before each Join input,specifying the “Don’t Sort,Previously Grouped”
sort key mode for each key.

Explanation: