鍙傝冩枃妗
鍓嶈█
杩戞鏃堕棿锛宮l-agent鍗囩骇鍒颁簡v0.3鐗堟湰锛屽仛浜嗕竴浜涘彉鏇达紝瀵艰嚧涔嬪墠鐨勬枃妗cml-agent v0.2锛歐in10涓嬬幆澧冨畨瑁銆嬮儴鍒嗗唴瀹规棤娉曞湪ml-agent v0.3涓娇鐢ㄣ傛渶涓昏鐨勬槸ppo.ipynb鏂囦欢绉婚櫎锛屽鑷存棤娉曠敤ppo杩涜璁粌銆傛墍浠ヨ繖閲岃繘琛岄噸鏂版暣鐞嗐
杞欢瀹夎
鎺ㄨ崘鐨勭幆澧
Phython3 64浣嶏紙ml-agent v0.3涓嶅啀鏀寔Phython2锛
Jupyter notebook
TensorFlow
Visual Studio 2017
Unity3d 2017.1(鏈枃浣跨敤)/2018.1
1銆佸厠闅唌l-agent
浠嶨ithub缃戠珯涓https://github.com/Unity-Technologies/ml-agents鍏嬮殕锛堜笅杞斤級ml-agent锛屼唬鐮侊紝鏀惧湪浠绘剰浣嶇疆涓嬨(鏈枃鏀惧湪D:\ml-agent)
2銆佸畨瑁匳isual Studio/Unity3d
瀹夎鐣ヨ繃
3銆佸畨瑁匒naconda 64浣
Anaconda鍐呯疆浜哖hython3 64浣嶅拰Jupyter notebook浠ュ強鍏朵粬渚垮埄鐨勫姛鑳斤紝鎵浠ヨ繕鏄夋嫨瀹夎Anaconda绠鍖栨暣涓狿hython鐨勮繃绋嬨備笅杞藉湴鍧https://www.anaconda.com/download/#windows銆傚畨瑁呯暐杩囷紙鏈枃瀹夎鍦‵鐩橈級銆傚畨瑁呭畬鎴愬悗鍒╃敤Anaconda鐨凙naconda Navigator鍒涘缓涓涓幆澧(杩欓噷鐜鍚嶄负tensorflow)锛孭hython鐗堟湰閫夋嫨3.6銆
4銆佸畨瑁卪l-agent渚濊禆搴
鍦ㄥ紑濮嬭彍鍗曚腑鎵撳紑Anacoda Prompt锛屽湪鍛戒护琛屼腑杈撳叆涓涓嬪懡浠ゆ潵婵娲诲垰鍒氬垱寤虹殑鐜
activate tensorflow
杈撳叆鍛戒护鍒囨崲鍒癿l-agent鎵鍦ㄧ殑鐩綍涓璸ython鐩綍鐨勪綅缃傛瘮濡俶l-agent瀹夎鐩綍涓篋:\Git\ml-agent锛屽垯杈撳叆
cd D:\Git\ml-agent\python
濡傛灉浣犵殑Anaconda涓嶆槸瀹夎鍦╩l-agent鐩綍鐩稿悓鐨勭鐩橈紝閭d箞闇瑕佸垏鎹㈠埌ml-agen鎵鍦ㄧ殑纾佺洏銆傛瘮濡傝繖閲孉naconda鐨勫畨瑁呯洰褰曚负F鐩橈紝ml-agent瀹夎鐩綍涓篋:\ml-agent锛屽垯闇瑕佸垏鎹㈠埌D鐩橈紝杈撳叆
D:
鐒跺悗寮濮嬪畨瑁匘emo鎵闇鐨勭幆澧冿紝杈撳叆鍛戒护
pip install .
娉ㄦ剰锛屼笉瑕侀仐婕忔渶鍚庣殑鐐瑰彿銆傜瓑寰呭畨瑁呭畬鎴愬嵆鍙傝鍛戒护浼氬畨瑁呮墍鏈夌殑渚濊禆搴擄紝鍖呮嫭tensorflow銆傛鏃朵笉鐢ㄥ叧闂繖涓獥鍙c
5銆佺紪璇慤nity绋嬪簭
浣跨敤Unity2017鎵撳紑ml-agent涓媢nity-environment鏂囦欢澶广
鎵撳紑Assets\ML-Agents\Examples\3DBall鐩綍涓嬬殑3DBall鍦烘櫙鏂囦欢銆傚湪鍦烘櫙涓夋嫨Ball3DAcademy涓嬬殑Ball3DBrain鐗╀綋锛屽皢TypeOfBrain淇敼涓篍xternal锛岃〃绀轰粠Tensorflow涓幏鍙栨暟鎹
鑿滃崟涓夋嫨File->Build Setting锛屾坊鍔犲綋鍓嶆墍鍦ㄥ満鏅傦紙鍙互鍕鹃塂evelopment Build浠ヤ究鏌ョ湅杈撳嚭锛
鐐瑰嚮PlayerSeting锛屾鏌ヨ缃
Resolution and Presentation -> 鍕鹃塕un in Background
Resolution and Presentation -> Display Resolution Dialog璁剧疆涓篸isable
鍥炲埌Build Setting闈㈡澘锛岀偣鍑籅uild锛岀紪璇戝埌ml-agent鐨刾ython鐩綍涓傚悕涓3dball.exe
6銆佸紑濮嬭缁
娉ㄦ剰锛岃缁冩柟娉曞拰ml-agent v0.2涓嶅悓銆倂0.2浣跨敤Jupyter notebook杩愯ppo.ipynb鏂囦欢銆備絾鏄痸0.3鏀逛负浣跨敤鍛戒护琛岀殑鏂规硶銆
鎴戜滑鍥炲埌Anacoda Prompt锛岃緭鍏ヤ互涓嬪懡浠わ細
python learn.py 3dball --run-id=test --train
鍏朵腑
learn.py鍖呭惈浜嗗ぇ閲忕殑ml绠楁硶锛屽寘鎷琾po銆
3dball灏辨槸鍒氬垰鎴戜滑鐢╱nity鐢熸垚鐨別xe鏂囦欢鐨勫悕绉般
--run-id=test鍙互涓嶅啓锛屽彧鏄0鏄庤繖娆¤缁冪殑id銆傛瘮濡傚彲浠ョ敤tensorboard鏉ョ湅
--train琛ㄧず澹版槑鎵ц鐨勬槸璁粌妯″紡
濡傛灉杩欒繖浜涘懡浠ゅ弬鏁版劅鍏磋叮锛岃鍙傝Training ML-Agents
鐢变簬璁粌鐨凷tep涓5.0e4(5*10鐨4娆℃柟)锛屽鏋滅敤cpu绠楁瘮杈冩參锛屽彲浠ユ殏鏃朵慨鏀硅秴鍙傛暟閰嶇疆鏂囦欢trainer_config.yaml锛屽皢Ball3DBrain涓嬪鍔犱竴琛宮ax_steps: 2.0e4锛堟敞鎰忥紝鐢变簬璇ユ枃浠堕噰鐢▂aml鏍煎紡锛屽鏂囦欢鐨勭紪鐮佹牸寮忓拰绌烘牸瑕佹眰闈炲父涓ユ牸锛屽鏋滃紓甯革紝灏嗘棤娉曡繘琛岃缁冦俶ax鍓嶉潰鏈4涓┖鏍硷紝涓嶆槸tab銆傚啋鍙峰悗闈㈡湁涓涓┖鏍硷紝鏁翠釜鏂囦欢閲囩敤UTF8缂栫爜锛夈
璁粌缁撴灉鏁版嵁淇濆瓨鍦╩odels\test\涓
璁粌鐨勭粨鏋滆棰戯細Unity ml-agent v0.3瀹炶返