Mikhail Yakovlevich Suslin: Difference between revisions

Revision as of 22:02, 26 October 2013

The theory of optimal control is concerned with operating a dynamic system at minimum cost. The case where the system dynamics are described by a set of linear differential equations and the cost is described by a quadratic function is called the LQ problem. One of the main results in the theory is that the solution is provided by the linear-quadratic regulator (LQR), a feedback controller whose equations are given below. The LQR is an important part of the solution to the LQG problem. Like the LQR problem itself, the LQG problem is one of the most fundamental problems in control theory.

General description

This means that the settings of a (regulating) controller governing either a machine or process (like an airplane or chemical reactor) are found by using a mathematical algorithm that minimizes a cost function with weighting factors supplied by a human (engineer). The "cost" (function) is often defined as a sum of the deviations of key measurements from their desired values. In effect this algorithm finds those controller settings that minimize the undesired deviations, like deviations from desired altitude or process temperature. Often the magnitude of the control action itself is included in this sum so as to keep the energy expended by the control action itself limited.

In effect, the LQR algorithm takes care of the tedious work done by the control systems engineer in optimizing the controller. However, the engineer still needs to specify the weighting factors and compare the results with the specified design goals. Often this means that controller synthesis will still be an iterative process where the engineer judges the produced "optimal" controllers through simulation and then adjusts the weighting factors to get a controller more in line with the specified design goals.

The LQR algorithm is, at its core, just an automated way of finding an appropriate state-feedback controller. As such it is not uncommon to find that control engineers prefer alternative methods like full state feedback (also known as pole placement) to find a controller over the use of the LQR algorithm. With these the engineer has a much clearer linkage between adjusted parameters and the resulting changes in controller behavior. Difficulty in finding the right weighting factors limits the application of the LQR based controller synthesis.

Finite-horizon, continuous-time LQR

For a continuous-time linear system, defined on $t \in [t_{0}, t_{1}]$ , described by

\dot{x} = A x + B u

with a quadratic cost function defined as

J = \frac{1}{2} x^{T} (t_{1}) F (t_{1}) x (t_{1}) + \int_{t_{0}}^{t_{1}} (x^{T} Q x + u^{T} R u) d t

the feedback control law that minimizes the value of the cost is

u = - K x

where $K$ is given by

K = R^{- 1} B^{T} P (t)

and $P$ is found by solving the continuous time Riccati differential equation.

A^{T} P (t) + P (t) A - P (t) B R^{- 1} B^{T} P (t) + Q = - \dot{P} (t)

with the boundary condition

P (t_{1}) = F (t_{1}) .

The first order conditions for J_min are

(i) State equation

\dot{x} = A x + B u

(ii) Co-state equation

- \dot{λ} = - Q x + A^{T} λ

(iii) Stationary equation

0 = R u + B^{T} λ

(iv) Boundary conditions

x (t_{0}) = x_{0}

and $λ (t_{1}) = F (t_{1}) x (t_{1})$

Infinite-horizon, continuous-time LQR

For a continuous-time linear system described by

\dot{x} = A x + B u

with a cost functional defined as

J = \int_{0}^{\infty} (x^{T} Q x + u^{T} R u) d t

the feedback control law that minimizes the value of the cost is

u = - K x

where $K$ is given by

K = R^{- 1} B^{T} P

and $P$ is found by solving the continuous time algebraic Riccati equation

A^{T} P + P A - P B R^{- 1} B^{T} P + Q = 0

Finite-horizon, discrete-time LQR

For a discrete-time linear system described by ^[1]

x_{k} = A x_{k - 1} + B u_{k}

with a performance index defined as

J = \sum_{k = 0}^{N} (x_{k}^{T} Q x_{k} + u_{k}^{T} R u_{k})

the optimal control sequence minimizing the performance index is given by

u_{k} = - F_{k} x_{k - 1}

where

F_{k} = (R + B^{T} P_{k} B)^{- 1} B^{T} P_{k} A

and $P_{k}$ is found iteratively backwards in time by the dynamic Riccati equation

$P_{k - 1} = Q + A^{T} (P_{k} - P_{k} B {(R + B^{T} P_{k} B)}^{- 1} B^{T} P_{k}) A$

from initial condition $P_{N} = Q$ .

Infinite-horizon, discrete-time LQR

For a discrete-time linear system described by

x_{k + 1} = A x_{k} + B u_{k}

with a performance index defined as

J = \sum_{k = 0}^{\infty} (x_{k}^{T} Q x_{k} + u_{k}^{T} R u_{k})

the optimal control sequence minimizing the performance index is given by

u_{k} = - F x_{k}

where

F = (R + B^{T} P B)^{- 1} B^{T} P A

and $P$ is the unique positive definite solution to the discrete time algebraic Riccati equation (DARE)

$P = Q + A^{T} (P - P B {(R + B^{T} P B)}^{- 1} B^{T} P) A$ .

Note that one way to solve this equation is by iterating the dynamic Riccati equation of the finite-horizon case until it converges.

References

↑ 20 year-old Real Estate Agent Rusty from Saint-Paul, has hobbies and interests which includes monopoly, property developers in singapore and poker. Will soon undertake a contiki trip that may include going to the Lower Valley of the Omo.

My blog: http://www.primaboinca.com/view_profile.php?userid=5889534

20 year-old Real Estate Agent Rusty from Saint-Paul, has hobbies and interests which includes monopoly, property developers in singapore and poker. Will soon undertake a contiki trip that may include going to the Lower Valley of the Omo.

My blog: http://www.primaboinca.com/view_profile.php?userid=5889534

20 year-old Real Estate Agent Rusty from Saint-Paul, has hobbies and interests which includes monopoly, property developers in singapore and poker. Will soon undertake a contiki trip that may include going to the Lower Valley of the Omo.

My blog: http://www.primaboinca.com/view_profile.php?userid=5889534

External links

[1] 20 year-old Real Estate Agent Rusty from Saint-Paul, has hobbies and interests which includes monopoly, property developers in singapore and poker. Will soon undertake a contiki trip that may include going to the Lower Valley of the Omo.

My blog: http://www.primaboinca.com/view_profile.php?userid=5889534

[1]

@@ Line 1: / Line 1: @@
-Make money online by [http://ganhedinheiro.comoganhardinheiro101.com/ selling] your talents. Good music is always in demand and with today's technological advances, anyone with musical talent can make music and offer it for sale [http://www.comoganhardinheiro101.com/category/opcoes-binarias/ ganhar dinheiro] to a broad audience. By setting up your own website and using social media for promotion, you can share your music with others and sell downloads with a free PayPal account. <br><br><br>It's easy to make money online. There is truth to the fact that you can start making money on the Internet as soon as you're done with this article. After all, so many others are making money online, why not you?
+The theory of [[optimal control]] is concerned with operating a [[dynamic system]] at minimum cost.  The case where  the system dynamics are described by a set of [[linear differential equation]]s and the cost is described by a [[quadratic polynomial|quadratic]] [[functional (mathematics)|function]] is called the LQ problem.  One of the main results in the theory is that the solution is provided by the '''linear-quadratic regulator (LQR)''', a feedback controller whose equations are given below. The LQR is an important part of the solution to the [[Linear-quadratic-Gaussian control|LQG problem]]. Like the LQR problem itself, the LQG problem is one of the most fundamental problems in [[control theory]].
- Keep your mind open and you can make a lot of money. As you [http://www.comoganhardinheiro101.com/inicio/ como conseguir dinheiro] can see, there are many ways to approach the world of online income. With various streams of income available, you are sure to find one, or two, that can help you with your income needs. Take this information to heart, put it to use and build your own online success story.<br><br>fee to watch your webinar at their [http://www.comoganhardinheiro101.com/como-ganhar-dinheiro-pela-internet/ convenience].  Here's more information on [http://ganhedinheironainternet.comoganhardinheiro101.com/ ganhar dinheiro] stop by http://ganhedinheironainternet.comoganhardinheiro101.com/ Once it is in place, [http://www.comoganhardinheiro101.com/?p=16 como ganhar dinheiro] promotion and possibly answering questions will be your only tasks.<br><br>Getting paid money to work online isn't the easiest thing to do in the world, but it is possible. If this is something you wish to work with, then the tips presented above should have helped you.
+==General description==
+This means that the settings of a (regulating) controller governing either a machine or process (like an airplane or chemical reactor) are found by using a mathematical algorithm that minimizes a cost function with weighting factors supplied by a human (engineer). The "cost" (function) is often defined as a sum of the deviations of key measurements from their desired values. In effect this algorithm finds those controller settings that minimize the undesired deviations, like deviations from desired altitude or process temperature. Often the magnitude of the control action itself is included in this sum so as to keep the energy expended by the control action itself limited.
- Take some time, do things the right way and then you can succeed. Start your online [http://www.comoganhardinheiro101.com como conseguir dinheiro] earning today by following the great advice discussed in this article. Earning money is not as hard as it may seem, you just need to know how to get started. By choosing to put your right foot forward, you are heading off to a great start earning money to make ends meet.
+In effect, the LQR algorithm takes care of the tedious work done by the control systems engineer in optimizing the controller. However, the engineer still needs to specify the weighting factors and compare the results with the specified design goals. Often this means that controller synthesis will still be an iterative process where the engineer judges the produced "optimal" controllers through simulation and then adjusts the weighting factors to get a controller more in line with the specified design goals.
+The LQR algorithm is, at its core, just an automated way of finding an appropriate [[state space (controls)|state-feedback controller]]. As such it is not uncommon to find that control engineers prefer alternative methods like [[full state feedback]] (also known as pole placement) to find a controller over the use of the LQR algorithm. With these the engineer has a much clearer linkage between adjusted parameters and the resulting changes in controller behavior. Difficulty in finding the right weighting factors limits the application of the LQR based controller synthesis.
+<!-- The final paragraph of this article assumes that the final controlled signal is the most important result of control. This may seems like an obvious statement, but, there are examples where the cost of the control input is of similar magnitude to the cost of having the system deviate. In such applications it may be preferable to allow the system to deviate. An example that I am familiar with (although not the most intuitive example) is load control of airfoils. Near stall conditions sucking, blowing and periodic excitation are all methods of delaying stall. None of the above mentioned methods will control stall indefinitely. While all methods can require significant energy inputs that may exceed the capabilities of the power source. Given that none of these methods can always control the system, situations will arise where control is not possible. In this situation a state-feedback controller will try harder and harder to correct the system, possibly burning out the power source, or causing serious damage to other components. An example of irreversible damage is in Mig 21 fighter planes. They used the blowing of hot exhaust to control stall in tight turns, it was common that in dog fights the blowing mechanism would actually burn up the wings destroying the plane. with an LQR the controller would recognize that control is hopeless and allow the system to deviate. In the Mig 21 example the plane would simply not be able to turn as tight. This may be undesirable in a dog fight, but nor is burning up and falling from the sky. Now maybe I do not understand what a state-feedback controller is, but this is my impression of the last paragraph-->
+==Finite-horizon, continuous-time LQR==
+For a continuous-time linear system, defined on <math>t\in[t_0,t_1]</math>, described by
+:<math>\dot{x} = Ax + Bu</math>
+with a quadratic cost function defined as
+:<math>J = \frac{1}{2} x^T(t_1)F(t_1)x(t_1)  + \int\limits_{t_0}^{t_1} \left( x^T Q x + u^T R u \right) dt</math>
+the feedback control law that minimizes the value of the cost is
+:<math>u = -K x \,</math>
+where <math>K</math> is given by
+:<math>K = R^{-1} B^T P(t) \,</math>
+and <math>P</math> is found by solving the continuous time [[Riccati differential equation]].
+:<math>A^T P(t) + P(t) A - P(t) B R^{-1} B^T P(t) + Q = - \dot{P}(t) \,</math>
+with the boundary condition
+:<math>P(t_1) = F(t_1).</math>
+The first order conditions for J<sub>min</sub> are
+'''(i) State equation'''
+:<math>\dot{x} = Ax + Bu</math>
+'''(ii) [[Costate_equations | Co-state equation]]'''
+:<math>-\dot{\lambda} = -Qx + A^T \lambda </math>
+'''(iii) Stationary equation'''
+:<math> 0 = Ru + B^T \lambda</math>
+'''(iv) Boundary conditions'''
+:<math> x(t_0) = x_0</math>
+and
+<math> \lambda(t_1) = F(t_1) x(t_1)</math>
+==Infinite-horizon, continuous-time LQR==
+For a continuous-time linear system described by
+:<math>\dot{x} = Ax + Bu</math>
+with a cost functional defined as
+:<math>J = \int_{0}^\infty \left( x^T Q x + u^T R u \right) dt</math>
+the feedback control law that minimizes the value of the cost is
+:<math>u = -K x \,</math>
+where <math>K</math> is given by
+:<math>K = R^{-1} B^T P \,</math>
+and <math>P</math> is found by solving the continuous time [[algebraic Riccati equation]]
+:<math>A^T P + P A - P B R^{-1} B^T P + Q = 0 \,</math>
+==Finite-horizon, discrete-time LQR==
+For a discrete-time linear system described by
+<ref>{{cite book |last= Chow |first= Gregory C. |title= Analysis and Control of Dynamic Economic Systems |publisher= Krieger Publ. Co. |year= 1986  |isbn= 0-89874-969-7}}</ref>
+:<math>x_{k} = A x_{k-1} + B u_k \,</math>
+with a performance index defined as
+:<math>J = \sum\limits_{k=0}^{N} \left( x_k^T Q x_k + u_k^T R u_k \right)</math>
+the optimal control sequence minimizing the performance index is given by
+:<math>u_k = -F_k x_{k-1} \,</math>
+where
+:<math>F_k = (R + B^T P_k B)^{-1} B^T P_k A \,</math>
+and <math>P_k</math> is found iteratively backwards in time by the dynamic Riccati equation
+<math>P_{k-1} = Q + A^T \left( P_k - P_k B \left( R + B^T P_k B \right)^{-1} B^T P_k \right) A</math>
+from initial condition <math>P_N = Q</math>.
+==Infinite-horizon, discrete-time LQR==
+For a discrete-time linear system described by
+:<math>x_{k+1} = A x_k + B u_k \,</math>
+with a performance index defined as
+:<math>J = \sum\limits_{k=0}^{\infty} \left( x_k^T Q x_k + u_k^T R u_k \right)</math>
+the optimal control sequence minimizing the performance index is given by
+:<math>u_k = -F x_k \,</math>
+where
+:<math>F = (R + B^T P B)^{-1} B^T P A \,</math>
+and <math>P</math> is the unique positive definite solution to the discrete time [[algebraic Riccati equation]] (DARE)
+<math>P = Q + A^T \left( P - P B \left( R + B^T P B \right)^{-1} B^T P \right) A</math>.
+Note that one way to solve this equation is by iterating the dynamic Riccati equation of the finite-horizon case until it converges.
+==References==
+<references/>
+:*{{cite book
+ | last = Kwakernaak, Huibert and Sivan, Raphael
+ | first =
+ | authorlink =
+ | year = 1972
+ | title = Linear Optimal Control Systems.  First Edition
+ | publisher = Wiley-Interscience
+ | isbn = 0-471-51110-2
+}}
+:*{{cite book
+ | last = Sontag
+ | first = Eduardo
+ | authorlink = Eduardo D. Sontag
+ | year = 1998
+ | title = Mathematical Control Theory: Deterministic Finite Dimensional Systems. Second Edition
+ | publisher = Springer
+ | isbn = 0-387-98489-5
+}}
+==External links==
+* [http://www.mathworks.com/help/toolbox/control/ref/lqr.html MATLAB function for Linear Quadratic Regulator design]
+* [http://reference.wolfram.com/mathematica/ref/LQRegulatorGains.html Mathematica function for Linear Quadratic Regulator design]
+{{DEFAULTSORT:Linear-Quadratic Regulator}}
+[[Category:Optimal control]]

Mikhail Yakovlevich Suslin: Difference between revisions

Revision as of 22:02, 26 October 2013

Contents

General description

Finite-horizon, continuous-time LQR

Infinite-horizon, continuous-time LQR

Finite-horizon, discrete-time LQR

Infinite-horizon, discrete-time LQR

References

External links

Navigation menu

Mikhail Yakovlevich Suslin: Difference between revisions

Revision as of 22:02, 26 October 2013

General description

Finite-horizon, continuous-time LQR

Infinite-horizon, continuous-time LQR

Finite-horizon, discrete-time LQR

Infinite-horizon, discrete-time LQR

References

External links

Navigation menu

Search