Kullback-Leibler-Quadratic Optimal Control

3 Apr 2020  ·  Neil Cammardella, Ana Bušić, Sean Meyn ·

This paper presents advances in Kullback-Leibler-Quadratic (KLQ) optimal control: a stochastic control framework for Markovian models. The motivation is distributed control of large networks... As in prior work, the objective function is composed of a state cost in the form of Kullback-Leibler divergence plus a quadratic control cost. With this choice of objective function, the optimal probability distribution of a population of agents over a finite time horizon is shown to be an exponential tilting of the nominal probability distribution. The same is true for the controlled transition matrices that induce the optimal probability distribution. However, one limitation of the previous work is that randomness can only be introduced via the control policy; all uncontrolled (natural) processes must be modeled as deterministic to render them immutable under an exponential tilting. In this work, only the controlled dynamics are subject to tilting, allowing for more general probabilistic models. Another advancement is a reduction in complexity based on lossy compression using transform techniques. This is motivated by the need to consider time horizons that are much longer than the inter-sampling times required for reliable control. Numerical experiments are performed in a power network setting. The results show that the KLQ method enables the aggregate power consumption of a collection of flexible loads to track a time-varying reference signal, while simultaneously ensuring each individual load satisfies its own quality of service constraints. read more

PDF Abstract
No code implementations yet. Submit your code now


Optimization and Control