GD vs SGD

##GD

small number of model updates

accurate

each epoch may be expensive

easy to parallelize

##SGD

Requires lots of model updates

Not as accurate, but often good enough

A log of progress in one pass for big data

Not trivial to parallelize

最后編輯于
?著作權歸作者所有,轉載或內容合作請聯系作者
平臺聲明:文章內容(如有圖片或視頻亦包括在內)由作者上傳并發布,文章內容僅代表作者本人觀點,簡書系信息發布平臺,僅提供信息存儲服務。

推薦閱讀更多精彩內容