Skip to main content

What is Reinforcement learning?

Reinforcement learning is training of machine learning agent to make sequence of decisions. The machine learning agent learn to achieve its goal in uncertain and potentially complex environments. To make the machine do what the programmer wants, AI either get rewards or penalties based on the action it performs.

To understand reinforcement learning simply we can differentiate between preprogrammed agent and reinforcement learning agent.

So let's take the example of robot dog.
A robot dog which is preprogrammed already has the hard coded algorithm in it which includes various predefined methods like sit, stand, walk, etc.
So the robot can only perform the actions which are defined in it.

On the other hand if we take the example of robot dog which has reinforcement learning algorithm, that dog do not have any predefined methods which leads him to perform action.
The robot dog will learn  what the programmer wants based on getting rewards or penalties digitally.
The main goal of the machine is to maximize total number of records.

So the dog will perform the action like sit, stand, walk, etc to get rewards.
By avoiding the actions like falling down, etc which push towards penalties.

Currently reinforcement learning is very efficient in machines creativity.

Comments

Post a Comment

Popular posts from this blog

OOP USING C++ ROADMAP BY LOVE BABBAR

  What is Object-Oriented Programming? Object-oriented programming is a programming paradigm based on the concept of "objects", which can contain data and code: data in the form of fields, and code, in the form of procedures. A feature of objects is that an object's own procedures can access and often modify the data fields of itself. Object Oriented Programming is considered as a design methodology for building non-rigid software. In OOPS, every logic is written to get our work done, but represented in form of Objects. OOP allows us to break our problems into small unit of work that is represented via objects and their functions. We build functions around  objects.   There are mainly four pillars (features) of OOP. If all of these four features are presented in programming, the programming is called  perfect Object Oriented Programming. Abstraction Encapsulation Inheritance Polymorphism disadvantages of object-oriented programming include: S...

Price Comparer website using django and web scrapping

Our website compares the prices of any given product which is available on the amazon and filpkart, thus shows you the result of price available on both site along with the suitable links. To develop this project we have used the python web framework known as Django. Web pages are made using the bootstrap HTML they are web.html and about.html.   fig directories of Project /PriceCompare directory is the main folder generated by command $ django-admin startproject PriceCompare Which contains the manage.py file that helps to run the server and host our web on localhost / app1 folder is created   by a command $ python manage.py startapp app1 Here will find all the required files like views.py, urls.py, etc. /Template directory contains all the HTML files   Views.py file contains the code which compare the user given data. We have used the selenium module of python to scrap the web data which are allowed to scrape . Selenium webdriver is used to autom...

Mutimedia System Architecture, IMA, Network

Multimedia System Architecture Multimedia encompasses a large variety of technologies and integration of multiple architectures interacting in real-time. The   following figure describes the architecture of a multimedia workstation environment. The left side shows non-multimedia systems. The right side shows multimedia extensions, multimedia driver support, Add-on multimedia devices and peripherals for supporting multimedia applications. For each special devices such as scanners, video cameras, VCRs and sound equipment-, a software device driver needs to provide the interface from an application to the device. The GUI requires control extensions to support applications such as full-motion video. The various graphics standards such as MCA, GGA and XGA have demonstrated the increasing demands for higher resolutions for GUls. Combined graphics and imaging applications require functionality at three levels. They are provided...