Top

このブログはなんですか Talk of tech innovation is bullsh*t. Shut up and get the work done – says Linus Torvalds Deep Leaning 勉強用ブログ - DeepLearningを勉強する人何を実装しましたか(Deep Learning) Generative DCGAN WGAN Reinforcement Lear…

2017-06-20

Pointer Networks

DeepLearning Python TensorFlow 自分用 RNN

[1506.03134] Pointer Networks 論文まとめ入力系列上のインデックスに対応した要素から成る出力系列の条件付き確率分布を学習するアーキテクチャ. この種の問題は、出力の各ステップでのターゲットクラスの数が、可変である入力長にいぞんしているので、Se…

2017-03-31

Prioritized Experience Replay

DeepLearning Python ReinforcementLearning TensorFlow 強化学習自分用

[1511.05952] Prioritized Experience Replay 論文まとめ Online RLの問題点遷移(transition)間の依存関係の影響が大きいレアな遷移をすぐに捨ててしまうそこで、 Experience Replay(ER) DQNでは、replay mem.からランダムサンプリングしたミニバッチを使…

2017-03-13

Deep Reinforcement Learning with Double Q-learning (Double DQN)

DeepLearning Python ReinforcementLearning TensorFlow 強化学習自分用

Deep Reinforcement Learning with Double Q-learning [1509.06461] Deep Reinforcement Learning with Double Q-learning 論文まとめ Q-learningは、maxを取っている関係上、action-valueを過大評価(overestimate)する傾向があることが知られている.これま…

2017-03-12

Deep Q Network (DQN)

DeepLearning Python ReinforcementLearning TensorFlow 強化学習

http://www.nature.com/nature/journal/v518/n7540/full/nature14236.html [1312.5602] Playing Atari with Deep Reinforcement LearningQ-Learningにおいて、action-value functionをDNNで関数近似したもので、Deep RLの皮切りとなった. Q-Learningとはなん…

2017-02-28

強化学習基礎（メモ書き）

DeepLearning ReinforcementLearning 自分用強化学習

強化学習基礎 MDP→TD→Q-Learning→DQN手前まで、強化学習の基本的なことをかいつまんだまとめ（自分用の自己満メモ）素晴らしい講義 David Silver氏による強化学習講義これにほぼ対応した素晴らしい演習問題+α GitHub - dennybritz/reinforcement-learning:…

2017-02-28

Wasserstein GAN (WGAN)

DeepLearning GAN 自分用 Python TensorFlow

Wasserstein GAN (WGAN) [1701.07875] Wasserstein GAN ([1701.04862] Towards Principled Methods for Training Generative Adversarial Networks WGANの話の前にこの話がある) Martin Arjovsky氏の実装(Torch) GitHub - martinarjovsky/WassersteinGANWGAN…

2017-02-26

Deep Convolutional Generative Adversarial Networks (DCGAN)

DeepLearning GAN 自分用 Python TensorFlow

Deep Convolutional Generative Adversarial Networks [1511.06434] Unsupervised Representation Learning with Deep Convolutional Generative Adversarial NetworksDCGANをTensorflowで実装データはMNIST ちなみにTensorflowの経験はそんなにない（ので…

2017-02-26

Deep Leaning 勉強用ブログ

DeepLearning 自分用 Python TensorFlow

このブログについてザコい学生のブログこれから自分の勉強としてDeep Leaning関連の論文等を実装していき、その過程をブログとして残しておきたい. (モチベーションのためにも) Linuxカーネルの開発者であるLinux Benedict Torvalds氏も以下のように述べて…