谭升的博客

【强化学习】 1.4.0 “进化方法”和 “决策梯度方法” 概论

发表于 2018-09-20 | 分类于 Reinforcement Learning ， RL-An Introduction | 评论数： | 阅读次数：

Abstract: 本文介绍强化学习中的一些局限（limitation）和机遇（scope），介绍进化方法和决策梯度方法的区别和优劣
Keywords: Evolutionary Method，进化方法，Policy Gradient Methods，决策梯度方法

阅读全文 »

【强化学习】1.3 强化学习的基础元素

发表于 2018-09-12 | 分类于 Reinforcement Learning ， RL-An Introduction | 评论数： | 阅读次数：

Abstract: 本文介绍除了agent和environment以外的，对于强化学习最重要的最基础的四个元素。
Keywords: Policy，策略，Reward Signal，奖励，Value Function，评价函数，Model of Environment，环境模型

阅读全文 »

【强化学习】1-2 强化学习举例

发表于 2018-08-30 | 分类于 Reinforcement Learning ， RL-An Introduction | 评论数： | 阅读次数：

Abstract: 本文介绍几个对应于强化学习的生活中的例子，来具体化前面提到的名词和几个重要理论在自然界中的表现。
Keywords: 强化学习，强化学习举例，Agent，Environment，环境，Reaction，反应

阅读全文 »

【强化学习】1-1-4 强化学习和人工智能

发表于 2018-08-29 | 分类于 Reinforcement Learning ， RL-An Introduction | 评论数： | 阅读次数：

Abstract: 本文主要介绍强化学习现阶段的情况，以及未来的去向的一种预测。
Keywords: modern Reinforcement Learning，现代强化学习，Psychology，心理学，Neuroscience，神经系统科学，mathematics，数学

阅读全文 »

【强化学习】1-1-3 强化学习基本框架

发表于 2018-08-29 | 分类于 Reinforcement Learning ， RL-An Introduction | 评论数： | 阅读次数：

Abstract: 本文简要介绍强化学习的框架，以及框架中几个概念的基本关系
Keywords: agent，real-time，organism，robot，framwork

阅读全文 »

【强化学习】1-1-2 “探索”(Exploration)还是“ 利用”(Exploitation)都要“面向目标”(Goal-Direct)

发表于 2018-08-27 | 分类于 Reinforcement Learning ， RL-An Introduction | 评论数： | 阅读次数：

Abstract: 本文介绍强化学习中最重要的一个挑战—— “探索”(Exploration)还是“ 利用”(Exploitation)
Keywords: Trade-Off，Exploration，Exploitation，Goal-Direct，平衡，探索，利用，目标导向

阅读全文 »

【强化学习】1.1.1 强化学习、监督学习和非监督学习

发表于 2018-08-26 | 分类于 Reinforcement Learning ， RL-An Introduction | 评论数： | 阅读次数：

Abstract: 本文主要介绍强化学习，监督学习，非监督学习之间的不同。
Keywords: Supervised Learning，Unsupervised Learning，Reinforcement Learning

阅读全文 »

【强化学习】1.1.0 强化学习介绍

发表于 2018-08-25 | 分类于 Reinforcement Learning ， RL-An Introduction | 评论数： | 阅读次数：

Abstract: 本文介绍Reinforcement Learning的具体特点和与其他机器学习算法不同之处，本文是一个骨架性的文章，所有专有名词都保持英文原始单词，具体内容会在后续中给出详细解答。
Keywords: Reinforcement Learning，Situation，Action，Enviroment，Closed-loop，Optimal Control，Markov Decision Processes，MDPs

阅读全文 »

【强化学习】1.0 强化学习介绍

发表于 2018-08-22 | 分类于 Reinforcement Learning ， RL-An Introduction | 评论数： | 阅读次数：

Abstract: 本文是《Reinforcement Learning: An Introduction》的第一篇，介绍本书以及本系列的主要写作内容
Keywords: 学习的本质，诱发与结果(cause and effect)，计算近似，直接建模，强化学习

阅读全文 »

【PRML】Introduction 介绍

发表于 2018-07-17 | 分类于 Machine Learning ， PRML | 评论数： | 阅读次数：

Abstract: In this section, we introduct what is Machine Learning, pattern recognition, and some relative concepts.
Keywords: “Pattern Recognition and Machine Learning”, Introduction, Machine Learning

阅读全文 »