Property:Abstract
From Murray Wiki
Jump to navigationJump to search
This is a property of type Text.
R
We consider the maneuver planning problem for automated vehicles when they share the road with human- driven cars and interact with each other using a finite set of maneuvers. Each maneuver is calculated considering input constraints, actuator disturbances and sensor noise, so that we can use a maneuver automaton to perform high-level planning that is robust against low-level effects. In order to model the behavior of human-driven cars in response to the intent of the automated vehicle, we use control improvisation to build a probabilistic model. To accommodate for potential mismatches between the learned human model and human driving behaviors, we use a conditional value-at-risk objective function to obtain the optimal policy for the automated vehicle. We demonstrate through simulations that our motion planning framework allows an automated vehicle to exploit human behaviors with different levels of robustness. +
A
We consider the problem of attitude stabilization using exclusively visual sensory input, and we look for a solution which can satisfy the constraints of a ``bio-plausible'' computation. We obtain a PD controller which is a bilinear form of the goal image, and the current and delayed visual input. Moreover, this controller can be learned using classic neural networks algorithms. The structure of the resulting computation, derived from general principles by imposing a bilinear computation, has striking resemblances with existing models for visual information processing in insects (Reichardt Correlators and lobula plate tangential cells). We validate the algorithms using faithful simulations of the fruit fly visual input. +
C
We consider the problem of average consensus seeking in networked multi-agent systems. Based
on local information and a simple distributed algorithm, states of all agents automatically converge to
the average value of the initial conditions, where the convergence speed is determined by the algebraic
connectivity of the underlying communication network. In order to achieve an average consensus quickly,
we propose a new type of consensus protocol, multi-hop relay protocol, in which each agent expands
its knowledge by employing multi-hop communication links. We explicitly show that multi-hop relay
protocol increases the convergence speed without physically changing the network topology. Moreover,
accumulated delays along communication links are discussed. We show that, for multi-hop relay protocol,
the faster the protocol converges, the more sensitive it is to the delay. This tradeoff is identified when
we investigate the stable delay margin using frequency sweep method. +
D
We consider the problem of controlling a linear time invariant process when the controller is located at a location remote from where the sensor measurements are being generated. The communication from the sensor to the controller is supported by a communication network with arbitrary topology composed of analog erasure channels. Using a separation principle, we prove that the optimal LQG controller consists of an LQ optimal regulator along with an estimator that estimates the state of the process across the communication network mentioned above. We then determine the optimal information processing strategy that should be followed by each node in the network so that the estimator is able to compute the best possible estimate in the minimum mean squared error sense. The algorithm is optimal for any packet-dropping process and at every time step, even though it is recursive and hence requires a constant amount of memory, processing and transmission at every node in the network per time step. For the case when the packet drop processes are memoryless and independent across links, we analyze the stability properties and the performance of the closed loop system. The algorithm is an attempt to escape the more commonly used viewpoint of treating a network of communication links as a single end-to-end link with the probability of successful transmission determined by some measure of the reliability of the network. +
We consider the problem of designing distributed control protocols -for aircraft vehicle management systems- that cooperatively allocate electric power while meeting certain higher level goals and requirements, and dynamically reacting to the changes in the internal system state and external environment. A decentralized control problem is posed where each power distribution unit is equipped with a controller that implements a local protocol to allocate power to a certain subset of loads. We use linear temporal logic as the specification language for describing correct behaviors of the system (e.g., safe operating conditions) as well as the admissible dynamic behavior of the environment due to, for example, wind gusts and changes in system health. We start with a global specification and decompose it into local ones. These decompositions allow the protocols for each local controller to be separately synthe- sized and locally implemented while guaranteeing the global specifications to hold. Through a design example, we show that by refining the interface rules between power distribution units, it is possible to reduce the total power requirement. +
R
We consider the problem of designing policies for partially observable Markov decision processes (POMDPs) with dynamic coherent risk objectives. Synthesizing risk-averse optimal policies for POMDPs requires infinite memory and thus undecidable. To overcome this difficulty, we propose a method based on bounded policy iteration for designing stochastic but finite state (memory) controllers, which takes advantage of standard convex optimization methods. Given a memory budget and optimality criterion, the proposed method modifies the stochastic finite state controller leading to sub-optimal solutions with lower coherent risk. +
We consider the problem of designing policies for Markov decision processes (MDPs) with dynamic coherent risk objectives and constraints. We begin by formulating the problem in a Lagrangian framework. Under the assumption that the risk objectives and constraints can be represented by a Markov risk transition mapping, we propose an optimization-based method to synthesize Markovian policies that lower-bound the constrained risk-averse problem. We demonstrate that the formulated optimization problems are in the form of difference convex programs (DCPs) and can be solved by the disciplined convex-concave programming (DCCP) framework. We show that these results generalize linear programs for constrained MDPs with total discounted expected costs and constraints. Finally, we illustrate the effectiveness of the proposed method with numerical experiments on a rover navigatio +
C
We consider the problem of designing policies for Markov decision processes (MDPs) with dynamic coherent risk objectives and constraints. We begin by formulating the problem in a Lagrangian framework. Under the assumption that the risk objectives and constraints can be represented by a Markov risk transition mapping, we propose an optimization-based method to synthesize Markovian policies that lower-bound the constrained risk-averse problem. We demonstrate that the formulated optimization problems are in the form of difference convex programs (DCPs) and can be solved by the disciplined convex-concave programming (DCCP) framework. We show that these results generalize linear programs for constrained MDPs with total discounted expected costs and constraints. Finally, we illustrate the effectiveness of the proposed method with numerical experiments on a rover navigation problem involving conditional-value-at-risk (CVaR) and entropic-value-at-risk (EVaR) coherent risk measures. +
D
Dynamic State Estimation in Distributed Aircraft Electric Control Systems via Adaptive Submodularity +
We consider the problem of estimating the discrete state of an aircraft electric system under a distributed control architecture through active sensing. The main idea is to use a set of controllable switches to reconfigure the system in order to gather more information about the unknown state. By adaptively making a sequence of reconfiguration decisions with uncertain outcome and by correlating the measurements and prior information to make the next decision, we aim to reduce the uncertainty. A greedy strategy is developed that maximizes the one-step expected uncertainty reduction. By exploiting recent results on adaptive submodularity, we give theoretical guarantees on the worst-case performance of the greedy strategy. We apply the proposed method in a fault de- tection scenario where the discrete state captures possible faults in various circuit components. In addition, simple abstraction rules are proposed to alleviate state space explosion and to scale up the strategy. Finally, the efficiency of the proposed method is demonstrated empirically on different circuits. +
E
We consider the problem of estimating the subset of test conditions under which a simplified model—or set of simplified models—accurately approximates the behavior of a true system. We approach the problem by proposing a compact set of possible test conditions, and an unknown but samplable continous validity function over that set that quantifies the accuracy of the model under each possible condition. We propose a novel Bayes estimator that optimally directs function sampling to greedily minimize the expected posterior misclassification rate of the valid set, which we call minimum posterior misclassification sampling (GP-MPM), and we show that the the method can be extended to approximate the valid sets of a partially ordered set of models, with sample complexity growing sublinearly with the number of models. In testing against a safety-focused model, we show that the algorithm’s estimated valid set approaches the true valid set much more quickly than undirected sampling, even with small sample sizes. +
M
We consider the problem of planning motions in observations space, based on learned models of the dynamics that associate to each action a diffeomorphism of the observations domain. For an arbitrary set of diffeomorphisms, this problem must be formulated as a generic search problem. We adapt established algorithms of the graph search family. In this scenario, node expansion is very costly, as each node in the graph is associated to an uncertain diffeomorphism and corresponding predicted observations. We describe several improvements that ameliorate performance: the introduction of better image similarities to use as heuristics; a method to reduce the number of expanded nodes by preliminarily identifying redundant plans; and a method to pre-compute composite actions that make the search efficient in all directions. +
A
We consider the problem of purely visual pose stabilization (also known as servoing) of a second-order rigid- body system with six degrees of freedom: how to choose forces and torques, based on the current view and a memorized goal image, to steer the pose towards a desired one. Emphasis has been given to the bio-plausibility of the computation, in the sense that the control laws could be in principle implemented on the neural substrate of simple insects. We show that stabilizing laws can be realized by bilinear/quadratic operations on the visual input. This particular computational structure has several numerically favorable characteristics (sparse, local, and parallel), and thus permits an efficient engineering implementation. We show results of the control law tested on an indoor helicopter platform. +
We consider the problem of purely visual pose stabilization of a second-order rigid-body system: how to choose forces and torques, based on the visual input alone, such that the view converges to a memorized goal image. Emphasis has been given to the bio-plausibility of the computation, in the sense that the control laws could be in principle implemented on the neural substrate of simple insects. We show that stabilizing laws can be realized by bilinear/quadratic operations on the visual input. Moreover, the control laws can be ``bootstrapped'' (learned unsupervisedly) from experience, which further substantiate the bio-plausibility of such computation. +
S
We consider the problem of reasoning about networked and layered control systems using assume-guarantee specifications. As these systems are formed by the interconnection of components that operate under various clocks, we introduce a new logic, Multiclock Logic (MCL), to be able to express the requirements of components form the point of view of their local clocks. Specifying components locally promotes independent design and component reuse. We carry out a contract-based analysis of a control system implemented via two control algorithms (model predictive control and feedback linearization) running on their own processors and clocks. Then we implement each of the contracts to build a system. The system performs as desired when the requirements derived from our system-level analysis are respected. Violating the constraints required by the contract-based analysis of the system leads to error. +
T
We consider the problem of synthesizing hier- archical controllers for discrete-time switched affine systems subject to exogenous disturbances that guarantee that the trajectories of the system satisfy a high-level specification expressed as a linear temporal logic formula. Our method builds upon recent results on temporal logic planning and embedded controller synthesis. First, the control problem is lifted to a discrete level by constructing a finite transition system that abstracts the behavior of the underlying switched system. At the discrete level, we recast the problem as a two player temporal logic game by treating the environment driven switches as adversaries. The solution strategy for the game (i.e. the discrete plan) is then implemented at the continuous level by solving finite-horizon optimal control problems that establish reachability between discrete states and that compensate the effects of continuous disturbances. We also extend the earlier work by making efficient use of propositions in the temporal logic formula to drive the abstraction procedure and to facilitate the computation of continuous input at implementation time.
An aircraft fuel system example is formulated; and solved using the proposed method. This sample problem demonstrates the applicability of the abstraction procedure and correct-by-construction controllers to regulate the fuel levels in multiple tanks during interesting operations like aerial refueling. +
D
We consider the problem of synthesizing control protocols for smart camera networks where the goal is to guarantee that certain linear temporal logic (LTL) specifications related to a given surveillance task are met. We first present a centralized control architecture for assigning pan-tilt-zoom (PTZ) cameras to targets so that the specification is met for any admissible behavior of the targets. Then, in order to alleviate the computational complexity associated with LTL synthesis and to enable implementation of local control protocols on individual PTZ cameras, we propose a distributed synthe- sis methodology. The main idea is to decompose the global specification into local specifications for each PTZ camera. A thorough design example is presented to illustrate the steps of the proposed procedure. +
H
We consider the problem of synthesizing robot controllers to realize a task that unpredictably changes with time. Tasks are formally expressed in the GR(1) fragment of temporal logic, in which some of the variables are set by an adversary. The task changes by the addition or removal of goals, which occurs online (i.e., at run-time). We present an algorithm for mending control strategies to realize tasks after the addition of goals, while avoiding global re-synthesis of the strategy. Experiments are presented for a planar surveillance task in which new regions of interest are incrementally added. Run-times are empirically shown to be extremely favorable compared to re-synthesizing from scratch. We also present an algorithm for mending control strategies for the removal of goals. While in this setting the original strategy is still feasible (as we prove), our algorithm provides a more satisfying solution by âtightening loose ends.â Both algorithms are shown to yield so-called reach annotations, and thus the control strategies are easily amenable to other algorithms concerning incremental synthesis, e.g., as in previous work by the authors for robot navigation in uncertain environments. +
E
We define a notion of controllability for mechanical systems which determines
the configurations which are accessible from a given configuration. We
derive sufficient conditions for this notion of controllability in terms of
the given inputs, their Lie brackets, and their covariant derivatives. +
R
We define the real time trajectory generation problem to be how to generate,
possibly with some delay, a full state space and input trajectory in real
time from an output trajectory that is given online, while allowing a
tradeoff between stability and performance. We
propose two algorithms that solve the real time trajectory generation
problem for flat systems with zero dynamics, and discuss some interesting
properties. We explicitly address the tradeoff between stability and
performance. The algorithms are validated in simulations and experiments
with a thrust vectored ducted fan aircraft. +
A
A two-state ribosome and protein model can robustly capture the chemical reaction dynamics of gene expression +
We derive phenomenological models of gene expression from a mechanistic description of chemical reactions using an automated model reduction method. Using this method, we get analytical descriptions and computational performance guarantees to compare the reduced dynamics with the full models. We develop a new two-state model with the dynamics of the available free ribosomes in the system and the protein concentration. We show that this new two-state model captures the detailed mass-action kinetics of the chemical reaction network under various biologically plausible conditions on model parameters. On comparing the performance of this model with the commonly used mRNA transcript-protein dynamical model for gene expression, we analytically show that the free ribosome and protein model has superior error and robustness performance. +