[Get it solved] Consider the following grid environment. Starting from an...

Check Out Our Work & Get Yours Done

Submit Work

Download Sample

Enroll in the complete course for only $250 USD*

Order Now

Submit work Offers

Consider the following grid environment. Starting from any unshaded square, you can move up, down, left, or right.

computer science

Description

For submission instructions please refer to website. For all problems, if you use an existing result from either the literature or a textbook to solve the exercise, you need to cite the source.

1 Gridworld [15 pts]

Consider the following grid environment. Starting from any unshaded square, you can move up, down, left, or right. Actions are deterministic and always succeed (e.g. going left from state 16 goes to state 15) unless they will cause the agent to run into a wall. The thicker edges indicate walls, and attempting to move in the direction of a wall results in staying in the same square (e.g. going in any direction other than left from state 16 stays in 16). Taking any action from the green target square (no. 12) earns a reward of rg (so r(12, a) = rg ∀a) and ends the episode . Taking any action from the red square of death (no. 5) earns a reward of rr (so r(5, a) = rr ∀a) and ends the episode. Otherwise, from every other square, taking any action is associated with a reward rs ∈ {−1, 0, +1} (even if the action results in the agent staying in the same square). Assume the discount factor γ = 1, rg = +5, and rr = −5 unless otherwise specified.

(a) (3pts) Define the value of rs that would cause the optimal policy to return the shortest path to the green target square (no. 12). Using this rs, find the optimal value for each square.

(b) (3pts) Lets refer to the value function derived in (a) as V πg old and the policy as πg. Suppose we are now in a new gridworld where all the rewards (rs, rg, and rr) have +2 added to them. Consider still following πg of the original gridworld, what will the new values V πg new be in this second gridworld?

Related Questions in computer science category

These all need to be done using recursion and exactly as stated in the instructions. I would also like comments in the code.

case study on JAD Sessions

For this assignment you will write a Java class that manages an array list of SimpleStudent objects. The SimpleStudent class is supplied; make no changes to this class.

SAP GENERAL PURPOSE DATA STRUCTURES

Quota is the number of spaces you need to store, whether from or Linux Quota is the number of spaces you need to hold, whether from The volume of file storage space is determined by the storage space accessible on the ECS storage arrays as well as by the

A+ Solution not a D+ one the whole thing to reflect quality work

IT 511 Final Project Guidelines and Rubric (Solved)

Java application using NetBeans IDE

Design a class named NearestPoints that contains methods to solve the nearest point problem using both approaches-Naive and neighbor preserving hash functions

EHRs, EMRs, and PHRs: What Are They and How Do They Relate to Each Other?

Get Higher Grades Now

Tutors Online

Description

Drop Files Here Or Click to Upload

Get Free Quote!

421 Experts Online

We Provide Services Across The Globe

Disclaimer: The reference papers or solutions provided by Calltutors.com serve as model papers or solutions for students or professionals and are not to be submitted as it is to any institutions. These documents are intended to be used for research and reference purposes only. University and company's logo's are the property of respected owners. We don't have affiliation with the mentioned universities. By using our services means, you agree to our Honor Code , Privacy Policy , Terms & Conditions , Payment , Refund & Cancellation Policy.

Enroll in the complete course for only $250 USD*

Consider the following grid environment. Starting from any unshaded square, you can move up, down, left, or right.

computer science

Description

Get instant assignment help service

Related Questions in computer science category

Policy

Exploring

Other

Connect With Us

We Provide Services Across The Globe