Name: Robot Learning Homework 4 Solved
SKU: 28240
Price: 30.00 USD
Availability: InStock

Description

Rate this product

Name, Surname, ID Number

Problem 4.1 Trajectory Generation with Dynamical Systems [38 Points]

In this exercise we will use the Dynamic Motor Primitives (DMPs), described by the following dynamical system,

¨y =	² ( ( (g y) ( ˙y= )) + f_w (z)) ,	(1)
z˙ =	_z z,	(2)

where y is the state of the system, ˙y and ¨y are the first and second time derivatives, respectively. The attractor’s goal is denoted by g and the forcing function by f_w. The parameters and control the spring-damper system. The phase variable is denoted by z and the temporal scaling coeﬃcient by . The forcing function f_w is given by

f_w (z) =

_i (z) w_i z

= (z)

with _i (z) =

z z

(3)

K_j

₀_j (z)

^K_j ₁ _j (z)

i=0

_i ( )

P ₌

where the basis functions _i (z) are Gaussian basis given by

c_i )² =h_i ,

_i (z) = exp

0.5 (z

(4)

where the centers c are equally distributed in the phase z, and the width h is an open parameter. For the programming exercises a basic environment of a double link pendulum is provided, as well as the computation of the _i (z).

Similarities to a PD controller [2 Points]

Transform Equation (1) to have a similar structure to a PD-controller,

¨y_z = K_P y_z^{d es} y_z + K_D ˙y_z^{d es} ˙y_z + u_{f f}

(5)

and write down how the following quantities K_p, K_d , y^{d es} and ˙y^{d es} look like in terms of the DMP parameters. z z

Name, Surname, ID Number

Stability [2 Points]

Explain why the DMPs are stable when t ! 1 and what would the equilibrium point be.

Double Pendulum – Training [12 Points]

Implement the DMPs and test them on the double pendulum environment. In order to train the DMPs you

have to solve Equation (1) on the forcing function. Before starting the execution, set the goal g position to be the same as in the demonstration. Then, set the parameters to = 25, = 6.25, _z = 3=T, = 1. Use N = 50 basis functions, equally distributed in z. Use the learned DMPs to control the robot and plot in the same figure both the demonstrated trajectory and the reproduction from the DMPs. You need to implement the DMP-based controller (dmpCtl.py) and the training function for the controller parameters (dmpTrain.py). To plot your results you can use dmpComparison.py. Refer to example.py to see how to call it.

Name, Surname, ID Number

Double Pendulum – Conditioning on the Final Position [3 Points]

Using the trained DMPs from the previous question, simulate the system with diﬀerent goal positions: first with q_t_=end = f0, 0.2g and then with q_t_=end = f0.8, 0.5g. Generate one figure per DoF. In each figure, plot the demonstrated trajectory and the reproduced trajectories with diﬀerent goal positions. How do you interpret the result?

Double Pendulum – Temporal Modulation [3 Points]

Using the trained DMPs from the previous question, simulate the system with diﬀerent temporal scaling factors = f0.5, 1.5g. Generate one figure per DoF and explain the result.

Name, Surname, ID Number

f) Probabilistic Movement Primitives – Radial Basis Function [3 Points]

We now want to use ProMPs. Before we train them, we need to define some basis functions. We decide to use N = 30 radial basis functions (RBFs) with centers uniformly distributed in the time interval [0 2b, T + 2b], where T is the end time of the demonstrations. The bandwidth of the Gaussian basis (std) is set to b = 0.2. Implement these basis functions in getProMPBasis.py. Do not forget to normalize the basis such at every time-point they sum-up to one! Attach a plot showing the basis functions in time.

Probabilistic Movement Primitives – Training [7 Points]

In this exercise you will train the ProMPs using the imitation learning data from getImitationData.py and the RBFs defined in the previous question. Modify the proMP.py in order to estimate weight vectors w_i reproducing the diﬀerent demonstrations. Then, fit a Gaussian using all the weight vectors. Generate a plot showing the desired trajectory distribution in time (mean and std) as well as the trajectories used for imitation.

Name, Surname, ID Number

Probabilistic Movement Primitives – Number of Basis Functions [2 Points]

Evaluate the eﬀects of using a reduced number of RBFs. Generate two plots showing the desired trajectory distribution and the trajectories used for imitation as in the previous exercise, but this time use N = 20 and N = 10 basis functions. Briefly analyze your results.

i) Probabilistic Movement Primitives – Conditioning [4 Points]

Using Gaussian conditioning calculate the new distribution over the weight vectors w_i such as the trajectory has a via point at position y = 3 at time t_cond = 1150 with variance _y = 0.0002. Use again 30 basis functions.

Assuming that the probability over the weights is given by N wj _w, _w and the probability of being to that

position is given by N y j w, _y , show how the new distribution over w is computed (how does the mean and variance look like)?

Then, in a single plot, show the previous distribution (learned from imitation) and the new distribution (after conditioning). Additionally, sample K = 10 random weight vectors from the ProMP, compute the trajectories and plot them in the same plot. Analyze briefly your results.

Robot Learning Homework 4 Solved

Share this:

Share this:

Description

Share this:

Related products

ASSIGNMENT 03

Programming Assignment # 1 Dynamic Memory Allocation Solution

Take Home Exam 4 Solution

Assignment-(H) Solution

Problem Set 4 Solution