메이블러, 내 생각친구

분류: 기타

/ 저자: Greg Brockman / 링크: https://www.quora.com/What-are-the-best-ways-to-pick-up-Deep-Learning-skills-as-an-engineer/answer/Greg-Brockman

{From Quora} What are the best ways to pick up Deep Learning skills as an engineer?

What are the best ways to pick up Deep Learning skills as an engineer?

Greg Brockman, Co-Founder & CTO @ OpenAI, previously CTO @ Stripe

Answered May 19, 2016 · Featured on HuffPost and 2 more · Upvoted by Anuran Roychowdhury, MS Computer Engineering & Machine Learning, University of Florida (2018) and Tao Xu, Built ML systems at Airbnb, Quora, Facebook and Microsoft.

Because deep learning started working so recently and is moving so quickly, it's a relatively shallow field (no pun intended) and can be picked up without too much pre-existing background.

That being said, different people will find different learning approaches to be preferable.

If you want to read one main resource... the Goodfellow, Bengio, Courville book (available for free from http://www.deeplearningbook.org/) is an extremely comprehensive survey of the field. It contains essentially all the concepts and intuition needed for deep learning engineering (except reinforcement learning).

If you'd like to take courses... Pieter Abbeel and Wojciech Zaremba suggest the following course sequence:

- Linear Algebra — Stephen Boyd’s EE263 (Stanford)
- Neural Networks for Machine Learning — Geoff Hinton (Coursera)
- Neural Nets — Andrej Karpathy’s CS231N (Stanford)
- Advanced Robotics (the MDP / optimal control lectures) — Pieter Abbeel’s CS287 (Berkeley)
- Deep RL — John Schulman’s CS294-112 (Berkeley)

(Pieter also recommends the Cover & Thomas information theory and Nocedal & Wright nonlinear optimization books).

If you'd like to get your hands dirty... Ilya Sutskever recommends implementing simple MNIST classifiers, small convnets, reimplementing char-rnn, and then playing with a big convnet. Personally, I started out by picking Kaggle competitions (especially the "Knowledge" ones) and using those as a source of problems. Implementing agents for OpenAI Gym (or algorithms for the set of research problems we’ll be releasing soon) could also be a good starting place.

I recommend a mix of all of the above approaches. Deep learning requires a mix of theoretical and empirical understanding. It can be hard to know where to start, and looking up a term on Wikipedia usually just yields another six terms to look up. The books and courses can both solve this problem by giving you a well-paced introduction to the subject.

On the other hand, everything usually feels abstract until you start implementing. It's mostly important to implement a variety of models and make them really work. As Ilya likes to say, you need to be prepared to suffer: expect hours of debugging models that refuse to learn, many passes restructuring your code, and building up your own conventions for changing various hyperparameters. But each time you suffer, know that you've built a little bit of skill that will be invaluable for the future.

분류:

저자 * / 링크

제목

<div class="ans_page_question_header u-padding-bottom--sm u-margin-bottom--md" style="color: rgb(51, 51, 51); font-family: "Helvetica Neue", Helvetica, Arial, sans-serif; font-size: 15px; margin-bottom: 16px !important; padding-bottom: 8px !important;"><div class="_type_serif_title_xlarge pass_color_to_child_links" style="font-family: q_serif, Georgia, Times, "Times New Roman", "Hiragino Kaku Gothic Pro", Meiryo, serif; font-weight: bold; line-height: 1.3; color: rgb(38, 38, 38); font-size: 26px;"><div id="cFNcUl"><a class="question_link" href="https://www.quora.com/What-are-the-best-ways-to-pick-up-Deep-Learning-skills-as-an-engineer" target="_top" action_mousedown="QuestionLinkClickthrough" id="__w2_dBm8iyC_link" style="background-image: initial; background-position: initial; background-size: initial; background-repeat: initial; background-attachment: initial; background-origin: initial; background-clip: initial; color: inherit; outline: 0px;"><span class="question_text" style="text-decoration-line: underline;"><span class="rendered_qtext" style="tab-size: 4;">What are the best ways to pick up Deep Learning skills as an engineer?</span></span></a></div></div><div id="TmVmYm"></div></div><div id="IXpKSW" style="color: rgb(51, 51, 51); font-family: "Helvetica Neue", Helvetica, Arial, sans-serif; font-size: 15px;"></div><div class="author_header" style="margin-bottom: 8px; color: rgb(51, 51, 51); font-family: "Helvetica Neue", Helvetica, Arial, sans-serif; font-size: 15px;"><div class="ContentHeader AnswerPageHeader AnswerHeader" style="position: relative; min-height: 40px; margin-bottom: 0px; line-height: 1.4;"><div class="photo_text_layout size_small" style="position: relative;"><div class="_layout_photo_wrapper" style="position: absolute; top: 0px; left: 0px;"><div class="_layout_photo" style="position: relative; width: 40px; height: 40px;"><div id="BsfBsO"><span class="photo_tooltip" id="__w2_vCAF7Dd_link"><a href="https://www.quora.com/profile/Greg-Brockman" id="__w2_E75AzTp_link" style="background-image: initial; background-position: initial; background-size: initial; background-repeat: initial; background-attachment: initial; background-origin: initial; background-clip: initial; color: rgb(43, 109, 173);"><img class="profile_photo_img" src="https://qph.ec.quoracdn.net/main-thumb-19956-50-wdlswkcfilzxjlsoncbxknigcwhdmgmk.jpeg" alt="Greg Brockman" height="50" width="50" style="color: transparent; animation-duration: 0.001s; animation-name: insQ_100; border-radius: 50%; width: 40px; height: 40px;"></a><span id="ivxRwf"></span></span></div></div></div><div class="_layout_text_wrapper" style="display: table; padding-left: 48px; width: 602px; word-break: break-word;"><div class="_layout_text" style="display: table-cell; vertical-align: middle; height: 40px;"><div class="follow_button_wrapper" style="margin-right: 28px;"><div class="feed_item_answer_user" style="line-height: 1.2em; margin-bottom: 2px; font-family: q_serif, Georgia, Times, "Times New Roman", "Hiragino Kaku Gothic Pro", Meiryo, serif;"><span id="wNpSah"><span id="ZVdYFV"><span id="__w2_Y8kBHLH_link" class=""><a class="user" href="https://www.quora.com/profile/Greg-Brockman" action_mousedown="UserLinkClickthrough" id="__w2_Y8kBHLH_name_link" style="background-image: initial; background-position: initial; background-size: initial; background-repeat: initial; background-attachment: initial; background-origin: initial; background-clip: initial; color: rgb(51, 51, 51);">Greg Brockman</a></span></span><span class="NameCredential IdentityNameCredential" style="font-size: 15px; line-height: 1.4;">, Co-Founder & CTO @ OpenAI, previously CTO @ Stripe</span></span></div><span class="credibility_wrapper"><div class="u-inline-block" id="NLyzKM" style="display: inline-block !important;"><div class="CredibilityFacts pass_color_to_child_links" style="font-size: 13px; color: rgb(153, 153, 153);"><span id="lTDPNk"><a class="answer_permalink" action_mousedown="AnswerPermalinkClickthrough" href="https://www.quora.com/What-are-the-best-ways-to-pick-up-Deep-Learning-skills-as-an-engineer/answer/Greg-Brockman" id="__w2_VJopFdR_link" style="background-image: initial; background-position: initial; background-size: initial; background-repeat: initial; background-attachment: initial; background-origin: initial; background-clip: initial; color: inherit;">Answered May 19, 2016</a></span><span class="bullet"> · </span><span id="__w2_iIUVcQE_partial_phrases">Featured on <a href="http://www.huffingtonpost.com/entry/how-can-i-get-started-in-deep-learning_us_58f59ceae4b0156697225252" target="_blank" rel="nofollow noopener" style="background-image: initial; background-position: initial; background-size: initial; background-repeat: initial; background-attachment: initial; background-origin: initial; background-clip: initial; color: inherit;">HuffPost</a><span id="__w2_iIUVcQE_view_all_span"> and <a class="view_all_link" href="https://www.quora.com/What-are-the-best-ways-to-pick-up-Deep-Learning-skills-as-an-engineer/answer/Greg-Brockman#" id="__w2_iIUVcQE_view_all_link" style="background-image: initial; background-position: initial; background-size: initial; background-repeat: initial; background-attachment: initial; background-origin: initial; background-clip: initial; color: inherit;">2 more</a></span></span><span class="bullet"> · </span>Upvoted by <span id="yrftYF"><span id="__w2_pcpRjx2_link"><a class="user" href="https://www.quora.com/profile/Anuran-Roychowdhury" action_mousedown="UserLinkClickthrough" id="__w2_pcpRjx2_name_link" style="background-image: initial; background-position: initial; background-size: initial; background-repeat: initial; background-attachment: initial; background-origin: initial; background-clip: initial; color: inherit;">Anuran Roychowdhury</a></span></span>, <span class="bio" style="border-bottom: 1px solid rgb(221, 221, 221);">MS Computer Engineering & Machine Learning, University of Florida (2018)</span><span class="and"> and </span><span id="XmwsKD"><span id="__w2_WdAPe7W_link"><a class="user" href="https://www.quora.com/profile/Tao-Xu" action_mousedown="UserLinkClickthrough" id="__w2_WdAPe7W_name_link" style="background-image: initial; background-position: initial; background-size: initial; background-repeat: initial; background-attachment: initial; background-origin: initial; background-clip: initial; color: inherit;">Tao Xu</a></span></span>, <span class="bio" style="border-bottom: 1px solid rgb(221, 221, 221);">Built ML systems at Airbnb, Quora, Facebook and Microsoft.</span></div></div></span></div><div id="__w2_S3sKao4_follow_button"><span id="OmLIxx"><a class="Button UserFollowHeaderIcon User UserFollowHeader UserFollowHeaderIconNoBorder TwoStateButton main_button user_follow_button user_follow_button_icon follow_button" href="https://www.quora.com/What-are-the-best-ways-to-pick-up-Deep-Learning-skills-as-an-engineer/answer/Greg-Brockman#" action_click="UserFollow" action_target="{"type": "user", "uid": 19956}" id="__w2_BxpRjXK_button" style="background-image: initial; background-position: initial; background-size: initial; background-repeat: initial; background-attachment: initial; background-origin: initial; background-clip: initial; color: rgb(43, 109, 173); user-select: none; transition: opacity 100ms ease-in-out, color 100ms ease-in-out, background-color 100ms ease-in-out, border-color 100ms ease-in-out; border-radius: 3px; box-shadow: none; display: inline-block; outline: 0px; padding: 0px; text-align: center; cursor: pointer; border: 0px; position: absolute; top: 4px; right: 0px;"></a></span></div></div></div></div></div></div><div id="oOhXcl" style="color: rgb(51, 51, 51); font-family: "Helvetica Neue", Helvetica, Arial, sans-serif; font-size: 15px;"><a name="answer_22828421" style="background-image: initial; background-position: initial; background-size: initial; background-repeat: initial; background-attachment: initial; background-origin: initial; background-clip: initial; color: rgb(43, 109, 173);"></a><div class="AnswerPageAnswer Answer AnswerBase" id="__w2_cKHWDPr_answer" style="padding: 8px 0px; border-top: 0px; position: relative;"><div id="__w2_cKHWDPr_answer_content"><div id="AunnDX"><div class="inline_editor_content" id="__w2_vXVUUgt_content" style="tab-size: 4; font-family: q_serif, Georgia, Times, "Times New Roman", "Hiragino Kaku Gothic Pro", Meiryo, serif; line-height: 1.6;"><span class="inline_editor_value" id="__w2_vXVUUgt_answer_content" style="margin-top: 0px;"><div id="__w2_aAyEcuU_expanded_content"><div id="sShNJW"></div><div class="TranslatedAnswerBanner u-sans-font-main--small u-font-color--light u-margin-top--xs u-margin-bottom--xs u-line-height--1_5 pass_color_to_child_links" style="margin-top: 4px !important; margin-bottom: 4px !important; font-family: "Helvetica Neue", Helvetica, Arial, sans-serif; font-size: 13px; line-height: 1.5 !important; color: rgb(153, 153, 153) !important;"></div><div><div class="u-serif-font-main--regular" style="font-size: 15px; line-height: 1.6;"><div class="ui_qtext_expanded"><span class="ui_qtext_rendered_qtext" style="tab-size: 4;"><p class="ui_qtext_para" style="margin-bottom: 1em; padding: 0px;">Because deep learning started working so recently and is moving so quickly, it's a relatively shallow field (no pun intended) and can be picked up without too much pre-existing background.</p><p class="ui_qtext_para" style="margin-bottom: 1em; padding: 0px;">That being said, different people will find different learning approaches to be preferable.</p><p class="ui_qtext_para" style="margin-bottom: 1em; padding: 0px;"><b>If you want to read one main resource...</b> the Goodfellow, Bengio, Courville book (available for free from <span class="qlink_container"><a href="http://www.deeplearningbook.org/" rel="noopener nofollow" target="_blank" class="external_link" style="background-image: url("//qsf.ec.quoracdn.net/-3-images.new_grid.external_link.svg-26-aef78ead48f1f1e2.svg"); background-position: right 0.3em; background-size: 10.5px; background-repeat: no-repeat; background-attachment: initial; background-origin: initial; background-clip: initial; color: rgb(43, 109, 173); padding-right: 15px;">http://www.deeplearningbook.org/</a></span>) is an extremely comprehensive survey of the field. It contains essentially all the concepts and intuition needed for deep learning engineering (except reinforcement learning).</p><p class="ui_qtext_para" style="margin-bottom: 1em; padding: 0px;"><b>If you'd like to take courses... </b>Pieter Abbeel and Wojciech Zaremba suggest the following course sequence:</p><p class="ui_qtext_para" style="margin-bottom: 1em; padding: 0px;">- Linear Algebra — Stephen Boyd’s EE263 (Stanford)<br>- Neural Networks for Machine Learning — Geoff Hinton (Coursera)<br>- Neural Nets — Andrej Karpathy’s CS231N (Stanford)<br>- Advanced Robotics (the MDP / optimal control lectures) — Pieter Abbeel’s CS287 (Berkeley)<br>- Deep RL — John Schulman’s CS294-112 (Berkeley)</p><p class="ui_qtext_para" style="margin-bottom: 1em; padding: 0px;">(Pieter also recommends the Cover & Thomas information theory and Nocedal & Wright nonlinear optimization books).</p><p class="ui_qtext_para" style="margin-bottom: 1em; padding: 0px;"><b>If you'd like to get your hands dirty...</b> Ilya Sutskever recommends implementing simple MNIST classifiers, small convnets, reimplementing char-rnn, and then playing with a big convnet. Personally, I started out by picking Kaggle competitions (especially the "Knowledge" ones) and using those as a source of problems. Implementing agents for OpenAI Gym (or algorithms for the set of research problems we’ll be releasing soon) could also be a good starting place.</p><p class="ui_qtext_para" style="margin-bottom: 1em; padding: 0px;">I recommend a mix of all of the above approaches. Deep learning requires a mix of theoretical and empirical understanding. It can be hard to know where to start, and looking up a term on Wikipedia usually just yields another six terms to look up. The books and courses can both solve this problem by giving you a well-paced introduction to the subject.</p><p class="ui_qtext_para" style="margin-bottom: 1em; padding: 0px;">On the other hand, everything usually feels abstract until you start implementing. It's mostly important to implement a variety of models and make them really work. As Ilya likes to say, you need to be prepared to suffer: expect hours of debugging models that refuse to learn, many passes restructuring your code, and building up your own conventions for changing various hyperparameters. But each time you suffer, know that you've built a little bit of skill that will be invaluable for the future.</p></span></div></div></div></div></span></div></div></div></div></div>