Deep Reinforcement Learning Hands-On: Apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero
(eBook)

Book Cover
Average Rating
Published
Packt Publishing, 2018.
Format
eBook
Status
Available Online

Description

Loading Description...

Also in this Series

Checking series information...

More Like This

Loading more titles like this title...

More Details

Physical Description
0m 0s
Language
English
ISBN
9781788839303

Reviews from GoodReads

Loading GoodReads Reviews.

Citations

APA Citation, 7th Edition (style guide)

Maxim Lapan., & Maxim Lapan|AUTHOR. (2018). Deep Reinforcement Learning Hands-On: Apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero . Packt Publishing.

Chicago / Turabian - Author Date Citation, 17th Edition (style guide)

Maxim Lapan and Maxim Lapan|AUTHOR. 2018. Deep Reinforcement Learning Hands-On: Apply Modern RL Methods, With Deep Q-networks, Value Iteration, Policy Gradients, TRPO, AlphaGo Zero. Packt Publishing.

Chicago / Turabian - Humanities (Notes and Bibliography) Citation, 17th Edition (style guide)

Maxim Lapan and Maxim Lapan|AUTHOR. Deep Reinforcement Learning Hands-On: Apply Modern RL Methods, With Deep Q-networks, Value Iteration, Policy Gradients, TRPO, AlphaGo Zero Packt Publishing, 2018.

MLA Citation, 9th Edition (style guide)

Maxim Lapan, and Maxim Lapan|AUTHOR. Deep Reinforcement Learning Hands-On: Apply Modern RL Methods, With Deep Q-networks, Value Iteration, Policy Gradients, TRPO, AlphaGo Zero Packt Publishing, 2018.

Note! Citations contain only title, author, edition, publisher, and year published. Citations should be used as a guideline and should be double checked for accuracy. Citation formats are based on standards as of August 2021.

Staff View

Go To Grouped Work

Grouping Information

Grouped Work IDb928d81b-c5be-9d6e-bba3-ba4b5b8cda39-eng
Full titledeep reinforcement learning hands on apply modern rl methods with deep q networks value iteration policy gradients trpo alphago zero
Authorlapan maxim
Grouping Categorybook
Last Update2024-06-25 21:00:43PM
Last Indexed2024-06-26 04:40:21AM

Hoopla Extract Information

stdClass Object
(
    [year] => 2018
    [artist] => Maxim Lapan
    [fiction] => 
    [coverImageUrl] => https://cover.hoopladigital.com/dra_9781788839303_270.jpeg
    [titleId] => 13589389
    [isbn] => 9781788839303
    [abridged] => 
    [language] => ENGLISH
    [profanity] => 
    [title] => Deep Reinforcement Learning Hands-On
    [demo] => 
    [segments] => Array
        (
        )

    [duration] => 0m 0s
    [children] => 
    [artists] => Array
        (
            [0] => stdClass Object
                (
                    [name] => Maxim Lapan
                    [artistFormal] => Lapan, Maxim
                    [relationship] => AUTHOR
                )

        )

    [genres] => Array
        (
            [0] => Algorithms
            [1] => Artificial Intelligence
            [2] => Computers
            [3] => Memory Management
            [4] => Natural Language Processing
            [5] => Programming
        )

    [price] => 1.35
    [id] => 13589389
    [edited] => 
    [kind] => EBOOK
    [active] => 1
    [upc] => 
    [synopsis] => New edition of the bestselling guide to deep reinforcement learning and how it's used to solve complex real-world problems. Revised and expanded to include multi-agent methods, discrete optimization, RL in robotics, advanced exploration techniques, and more

Key Features
Second edition of the bestselling introduction to deep reinforcement learning, expanded with six new chapters
Learn advanced exploration techniques including noisy networks, pseudo-count, and network distillation methods
Apply RL methods to cheap hardware robotics platforms
Book Description
Deep Reinforcement Learning Hands-On, Second Edition is an updated and expanded version of the bestselling guide to the very latest reinforcement learning (RL) tools and techniques. It provides you with an introduction to the fundamentals of RL, along with the hands-on ability to code intelligent learning agents to perform a range of practical tasks.

With six new chapters devoted to a variety of up-to-the-minute developments in RL, including discrete optimization (solving the Rubik's Cube), multi-agent methods, Microsoft's TextWorld environment, advanced exploration techniques, and more, you will come away from this book with a deep understanding of the latest innovations in this emerging field.

In addition, you will gain actionable insights into such topic areas as deep Q-networks, policy gradient methods, continuous control problems, and highly scalable, non-gradient methods. You will also discover how to build a real hardware robot trained with RL for less than $100 and solve the Pong environment in just 30 minutes of training using step-by-step code optimization.

In short, Deep Reinforcement Learning Hands-On, Second Edition, is your companion to navigating the exciting complexities of RL as it helps you attain experience and knowledge through real-world examples.

What you will learn
Understand the deep learning context of RL and implement complex deep learning models
Evaluate RL methods including cross-entropy, DQN, actor-critic, TRPO, PPO, DDPG, D4PG, and others
Build a practical hardware robot trained with RL methods for less than $100
Discover Microsoft's TextWorld environment, which is an interactive fiction games platform
Use discrete optimization in RL to solve a Rubik's Cube
Teach your agent to play Connect 4 using AlphaGo Zero
Explore the very latest deep RL research on topics including AI chatbots
Discover advanced exploration techniques, including noisy networks and network distillation techniques
Who this book is for
Some fluency in Python is assumed. Sound understanding of the fundamentals of deep learning will be helpful. This book is an introduction to deep RL and requires no background in RL.
    [url] => https://www.hoopladigital.com/title/13589389
    [pa] => 
    [subtitle] => Apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero
    [publisher] => Packt Publishing
    [purchaseModel] => INSTANT
)