You are building an Azure Machine Learning workflow by using Azure Machine Learning Studio.You create an Azure notebook that supports the Microsoft Cognitive Toolkit.
You need to ensure that the stochastic gradient descent (SGD) configuration maximizes the samples per
second and supports parallel modeling that is managed by a parameter server.
Which SGD algorithm should you use?

A.
DataParallelASGD
B.
DataParallelSGD
C.
ModelAveragingSGD
D.
BlockMomentumSGD
If it is to be managed by parameter server, then it is DataParallelASGD
6
0
DataParallelASGD.
https://docs.microsoft.com/en-us/cognitive-toolkit/multiple-gpus-and-machines#5-data-parallel-training-with-1-bit-sgd
8 Data-Parallel Training with Parameter Server
2
0
Parameter server is a widely used framework in distributed machine learning. The most important benefit it brings is the asynchronous parallel training with many workers.
0
0
More new 70-774 Questions: https://drive.google.com/drive/folders/1WVXCup_qKNm0iitL4rKQ_hsgZd6M_dQD?usp=sharing
0
0