Which SGD algorithm should you use?

adminDecember 2, 2017

You are building an Azure Machine Learning workflow by using Azure Machine Learning Studio.You create an Azure notebook that supports the Microsoft Cognitive Toolkit.
You need to ensure that the stochastic gradient descent (SGD) configuration maximizes the samples per
second and supports parallel modeling that is managed by a parameter server.
Which SGD algorithm should you use?

A.
DataParallelASGD

B.
DataParallelSGD

C.
ModelAveragingSGD

D.
BlockMomentumSGD

4 Comments on “Which SGD algorithm should you use?”

Ruchita says:

December 14, 2017 at 2:19 am

If it is to be managed by parameter server, then it is DataParallelASGD

6

0

Log in to Reply
1. rai says:
  
  March 5, 2018 at 11:10 pm
  
  DataParallelASGD.
  
  https://docs.microsoft.com/en-us/cognitive-toolkit/multiple-gpus-and-machines#5-data-parallel-training-with-1-bit-sgd
  8 Data-Parallel Training with Parameter Server
  
  2
  
  0
  
  Log in to Reply
  1. rai says:
    
    March 15, 2018 at 3:24 am
    
    Parameter server is a widely used framework in distributed machine learning. The most important benefit it brings is the asynchronous parallel training with many workers.
    
    0
    
    0
    
    Log in to Reply
Levo says:

December 26, 2017 at 5:05 am

More new 70-774 Questions: https://drive.google.com/drive/folders/1WVXCup_qKNm0iitL4rKQ_hsgZd6M_dQD?usp=sharing

0

0

Log in to Reply

Get 50% Discount on All Your Purchases
at PrepAway.com - Latest Exam Questions

This is ONE TIME OFFER

Enter your email address to receive your 50% off dicount code:

SPECIAL OFFER: GET 50% OFF

Use Discount Code:

Microsoft Exam Questions

Free Microsoft Study Guide

Which SGD algorithm should you use?

4 Comments on “Which SGD algorithm should you use?”

Leave a Reply Cancel reply