Skip to main content

What is Reinforcement learning?

Reinforcement learning is training of machine learning agent to make sequence of decisions. The machine learning agent learn to achieve its goal in uncertain and potentially complex environments. To make the machine do what the programmer wants, AI either get rewards or penalties based on the action it performs.

To understand reinforcement learning simply we can differentiate between preprogrammed agent and reinforcement learning agent.

So let's take the example of robot dog.
A robot dog which is preprogrammed already has the hard coded algorithm in it which includes various predefined methods like sit, stand, walk, etc.
So the robot can only perform the actions which are defined in it.

On the other hand if we take the example of robot dog which has reinforcement learning algorithm, that dog do not have any predefined methods which leads him to perform action.
The robot dog will learn  what the programmer wants based on getting rewards or penalties digitally.
The main goal of the machine is to maximize total number of records.

So the dog will perform the action like sit, stand, walk, etc to get rewards.
By avoiding the actions like falling down, etc which push towards penalties.

Currently reinforcement learning is very efficient in machines creativity.

Comments

Post a Comment

Popular posts from this blog

OOP USING C++ ROADMAP BY LOVE BABBAR

  What is Object-Oriented Programming? Object-oriented programming is a programming paradigm based on the concept of "objects", which can contain data and code: data in the form of fields, and code, in the form of procedures. A feature of objects is that an object's own procedures can access and often modify the data fields of itself. Object Oriented Programming is considered as a design methodology for building non-rigid software. In OOPS, every logic is written to get our work done, but represented in form of Objects. OOP allows us to break our problems into small unit of work that is represented via objects and their functions. We build functions around  objects.   There are mainly four pillars (features) of OOP. If all of these four features are presented in programming, the programming is called  perfect Object Oriented Programming. Abstraction Encapsulation Inheritance Polymorphism disadvantages of object-oriented programming include: S...

PI Academy Website Using HTML, CSS, PHP, Database.

Introduction: Website for a home tuition advertisement along with the student form and the details of the standard and classes supported. This website is officially hosted on the Internet,  click here  to check the website. It contains a form for the student details and the login credential only for the teacher who can login and check the student details who fill the form. Here a single database with one table is used to store the details of the student.     ` To get the code of the project click on the below links:  GitHub:   https://github.com/AmreenKhan1003/PI-Academy-Website.git

Mutimedia System Architecture, IMA, Network

Multimedia System Architecture Multimedia encompasses a large variety of technologies and integration of multiple architectures interacting in real-time. The   following figure describes the architecture of a multimedia workstation environment. The left side shows non-multimedia systems. The right side shows multimedia extensions, multimedia driver support, Add-on multimedia devices and peripherals for supporting multimedia applications. For each special devices such as scanners, video cameras, VCRs and sound equipment-, a software device driver needs to provide the interface from an application to the device. The GUI requires control extensions to support applications such as full-motion video. The various graphics standards such as MCA, GGA and XGA have demonstrated the increasing demands for higher resolutions for GUls. Combined graphics and imaging applications require functionality at three levels. They are provided...