CDS 110b: Optimal Control: Difference between revisions

From Murray Wiki
Jump to navigationJump to search
No edit summary
Line 26: Line 26:


<p>Jeremy Gillula, 07 Jan 05</p>
<p>Jeremy Gillula, 07 Jan 05</p>
</blockquote>
'''Q: What do you mean by penalizing something, from Q>=0 "penalizes" state error?'''
<blockquote>
<p>According to the form of the quadratic cost function <math>J</math>, there are three quadratic terms such
as <math>x^T Q x</math>, <math>u^T R u</math>, and <math>x(T)^T P_1 x(T)</math>. When <math>Q \geq 0</math> and if <math>Q</math> is relative big, the value of <math>x</math> will have bigger contribution to the value of <math>J</math>. In order to keep <math>J</math> small, <math>x</math> must be relatively small. So  selecting a big <math>Q</math> can keep <math>x</math> in small value regions. This is what the "penalizing" means.</p>
<p>So in the optimal control design, the relative values of <math>Q</math>, <math>R</math>, and <math>P_1</math> represent how important  <math>X</math>, <math>U</math>, and <math>X(T)</math> are in the designer's concerns.</p>
<p>Zhipu Jin,13 Jan 03</p>
</blockquote>
</blockquote>

Revision as of 06:23, 15 January 2008

CDS 110b Schedule Project Course Text

This lecture provides an overview of optimal control theory. Beginning with a review of optimization, we introduce the notion of Lagrange multipliers and provide a summary of the Pontryagin's maximum principle.

References and Further Reading

Frequently Asked Questions

Q: In the example on Bang-Bang control discussed in the lecture, how is the control law for u obtained?

Pontryagin's Maximum Principle says that u has to be chosen to minimise the Hamiltonian H(x,u,λ) for given values of x and λ. In the example, H=1+(λTA)x+(λTB)u. At first glance, it seems that the more negative u is the more H will be minimised. And since the most negative value of u allowed is 1, u=1. However, the co-efficient of u may be of either sign. Therefore, the sign of u has to be chosen such that the sign of the term (λTB)u is negative. That's how we come up with u=sign(λTB).

Shaunak Sen, 12 Jan 06

Q: Notation question for you: In the Lecture notes from Wednesday, I'm assuming that T is the final time and T (superscript T) is a transpose operation. Am I correct in my assumption?

Yes, you are correct.

Jeremy Gillula, 07 Jan 05