Anisotropic Template Ansätze for Robust Positive Invariance under State-Dependent Uncertainty

Abdelrahman Ramadan, Melissa Greeff, Sidney Givigi

Interactive companion to the extended author version of an article accepted in IEEE Control Systems Letters

Keywords: Data-driven control, set invariance, Gaussian processes

Accepted article: IEEE Xplore  |  Simulator: quadrotor demo

Abstract We establish sufficient conditions for robust positive invariance under state- and input-dependent disturbances with anisotropic covariance structure. The proposed ansätze map a fixed ellipsoidal template through a GP-derived positive-definite matrix field, subsuming scalar homothetic scaling while retaining finite graph-based verification. The resulting LMI conditions couple the learned field to Schur-stable dynamics; an isotropic fallback with inflation factor $r=1/(1-\gamma_{\mathrm{cl}})$ proves admissibility. During each learning epoch the field is frozen, so online tube evaluation is one GP covariance query and a small matrix square root, with no online set iteration or LMI solve. Quadrotor simulations show a $195\times$ reduction in 3D velocity-tube volume and a $2.1{\times}10^5$ reduction in the joint 7D velocity-control subspace relative to a non-adaptive homothetic baseline. This companion document expands the derivations behind the extended manuscript, including proof sketches, intuition boxes, controller-sweep figures, contraction studies, and projection-area comparisons.

Sets & Spaces

SymbolDefinitionSignificance
$\mathbb{R}^n$, $\mathbb{R}^m$Euclidean spaces of dimension $n$ (states) and $m$ (inputs)Ambient spaces for all dynamics
$\mathbb{R}_+$Non-negative realsDomain for operator-monotone functions
$\mathbb{S}^n_{++}$, $\mathbb{S}^n_+$Symmetric positive-definite / positive-semidefinite matricesCodomain of anisotropy field $\mathbf{P}$; GP posterior covariance space
$\mathcal{X}$, $\mathcal{U}$State and input constraint sets (compact, convex)Operating envelope for the system
$\mathcal{Z}:=\mathcal{X}\times\mathcal{U}$Joint operating spaceDomain of $\mathbf{P}(\cdot)$ and GP posteriors
$\mathcal{B}_2^n$Euclidean unit ball in $\mathbb{R}^n$Fixed template shape $\bar{\mathbf{W}} = \mathcal{B}_2^n$
$\mathbb{W}$Fixed, state-independent disturbance setClassical RPI formulation
$\mathbb{W}(\mathbf{x},\mathbf{u})$State- and input-dependent disturbance setCore object: parameterized by $\mathbf{P}$
$\bar{\mathbf{W}}$Fixed template disturbance set (chosen as $\mathcal{B}_2^n$)Ansatz base shape; $\mathbf{0}\in\mathrm{int}(\bar{\mathbf{W}})$
$\boldsymbol{\Omega}$Robust Positively Invariant (RPI) setGlobal invariant set (classical formulation)
$\bar{\boldsymbol{\Omega}} = r\mathcal{B}_2^n$Tube template (inflated ball)Isotropic RPI template; scaled by $r$
$\boldsymbol{\Omega}(\mathbf{x},\mathbf{u})$Local tube cross-section $= \mathbf{P}(\mathbf{x},\mathbf{u})\bar{\boldsymbol{\Omega}}$Anisotropic tube field (main object of the paper)
$\boldsymbol{\Omega}^{(q)}(\mathbf{z})$Tube field at learning epoch $q$Contracts monotonically as data accumulate
$\mathcal{E}(\mathbf{x},\mathbf{u})$GP credible ellipsoid$= \hat{\boldsymbol{\mu}}_w \oplus c_{n,\alpha}\hat{\boldsymbol{\Sigma}}_w^{1/2}\mathcal{B}_2^n$
$\mathcal{D}$Training dataset $\{(\mathbf{z}^{(j)}, \mathbf{w}^{(j)})\}_{j=1}^N$GP training observations
$\mathcal{G}=(\mathcal{V},\mathcal{A})$Directed graph (vertices, arcs)Discretization for finite verification
$\mathcal{R}_i$Partition region (cluster) around node $i$Continuous region represented by a graph node
$\mathscr{A}$Admissible parameter field set$\{\mathbf{P}:\mathcal{Z}\to\mathbb{S}^n_{++} \mid \text{spectral condition holds}\}$

Operators & Set Operations

SymbolDefinitionSignificance
$A \oplus B$Minkowski sum: $\{a+b : a\in A, b\in B\}$Inflates set $A$ by shape $B$; central RPI operation
$A \ominus B$Pontryagin difference: $\{x : x+B \subseteq A\}$Shrinks/erodes set $A$ by $B$
$\preceq$, $\succeq$, $\succ$Löwner (semidefinite) ordering on $\mathbb{S}^n$$\mathbf{A}\preceq\mathbf{B} \Leftrightarrow \mathbf{B}-\mathbf{A}\succeq\mathbf{0}$; drives monotone contraction
$\gamma_{\mathbf{S}}(\mathbf{w})$Gauge function: $\inf\{\lambda\ge 0 : \mathbf{w}\in\lambda\mathbf{S}\}$Measures distance relative to set $\mathbf{S}$
$\mathrm{int}(\cdot)$Interior of a set$\mathbf{0}\in\mathrm{int}(\bar{\mathbf{W}})$ needed for strict contraction
$\mathrm{diag}(\cdot)$Diagonal matrixDiagonal ansatz: $\mathbf{P}=\mathrm{diag}(\sigma_1,\ldots,\sigma_n)$
$\mathrm{diam}(\mathcal{R}_i)$Diameter of region $i$Controls Lipschitz deviation bound $\delta_i$

Matrices

SymbolDefinitionSignificance
$\mathbf{A}$, $\mathbf{B}$State transition and input matricesDynamics: $\mathbf{x}_{k+1}=\mathbf{A}\mathbf{x}_k+\mathbf{B}\mathbf{u}_k+\mathbf{w}$
$\mathbf{A}_{\mathrm{cl}}$Closed-loop matrix $:= \mathbf{A}+\mathbf{B}\mathbf{K}$Must be Schur stable ($\rho < 1$); drives contraction
$\mathbf{K}$Stabilizing feedback gain $\in\mathbb{R}^{m\times n}$$\mathbf{u}=\mathbf{K}\mathbf{x}+\mathbf{v}$
$\mathbf{P}(\mathbf{x},\mathbf{u})$Anisotropy matrix field $:\mathcal{Z}\to\mathbb{S}^n_{++}$Central object: transforms template to local disturbance/tube set
$\mathbf{P}' := \mathbf{P}(\mathbf{x}',\mathbf{u}')$Successor anisotropy matrixAt the next operating point after transition
$\hat{\boldsymbol{\Sigma}}_w(\mathbf{x},\mathbf{u})$GP posterior covariance $\in\mathbb{S}^n_+$Source of anisotropy information
$\hat{\boldsymbol{\Sigma}}_w^{1/2}$Symmetric positive-definite matrix square rootOperator monotone: $\Sigma_1 \preceq \Sigma_2 \Rightarrow \Sigma_1^{1/2}\preceq\Sigma_2^{1/2}$
$\mathbf{M}$Transformed dynamics $:= \mathbf{P}'^{-1}\mathbf{A}_{\mathrm{cl}}\mathbf{P}$Template-coordinate closed-loop operator
$\mathbf{N}$Transformed disturbance $:= \frac{1}{r}\mathbf{P}'^{-1}\mathbf{P}$Template-coordinate disturbance scaling
$\mathbf{I}_n$$n\times n$ identity matrixReference for Löwner bounds, S-procedure
$\mathbf{V}\boldsymbol{\Lambda}\mathbf{V}^\top$Spectral decomposition$\boldsymbol{\Lambda}=\mathrm{diag}(\lambda_1,\ldots,\lambda_n)$; eigenvectors in $\mathbf{V}$
$\mathbf{G}$ (Gram matrix)Kernel matrix $[\mathbf{G}]_{jl}=k(\mathbf{z}^{(j)},\mathbf{z}^{(l)})$Training covariance; conditions GP posterior
$\mathbf{k}_*$Cross-covariance vector $[k(\mathbf{z}_*,\mathbf{z}^{(j)})]_{j=1}^N$Links test point to training data
$\boldsymbol{\Sigma}_{**}$Prior covariance at test pointFixed reference; posterior is $\boldsymbol{\Sigma}_{**}-\hat{\boldsymbol{\Sigma}}_w^{-1}$-update
$\boldsymbol{\Phi}^{(q)}$Explained variance $:= \boldsymbol{\Sigma}_{**} - \hat{\boldsymbol{\Sigma}}_w^{(q)}$Increases monotonically with data (variance reduction)

Vectors & State Variables

SymbolDefinitionSignificance
$\mathbf{x}_k$, $\mathbf{u}_k$State and control input at time $k$System state and applied control
$\hat{\mathbf{x}}_k$Nominal (predicted) stateReference trajectory for tube MPC
$\mathbf{e}_k := \mathbf{x}_k - \hat{\mathbf{x}}_k$Error between actual and nominal stateTube is built around error dynamics
$\mathbf{w}(\mathbf{x},\mathbf{u})$Disturbance vector at $(\mathbf{x},\mathbf{u})$Model mismatch; state-and-input-dependent
$\mathbf{v}$Feed-forward input ($\mathbf{u}=\mathbf{K}\mathbf{x}+\mathbf{v}$)Nominal MPC decision variable
$\mathbf{z} := (\mathbf{x},\mathbf{u}) \in \mathcal{Z}$Operating pointDomain variable for $\mathbf{P}(\cdot)$ and GP evaluation
$\boldsymbol{\xi}$, $\boldsymbol{\eta}$Template-coordinate error and disturbance$\mathbf{e}_k = \mathbf{P}_k\boldsymbol{\xi}$ with $\boldsymbol{\xi}\in\bar{\boldsymbol{\Omega}}$, $\boldsymbol{\eta}\in\bar{\mathbf{W}}$
$\hat{\boldsymbol{\mu}}_w$GP posterior mean vectorPredicted disturbance center at operating point

Scalars & Parameters

SymbolDefinitionSignificance
$n$, $m$State and input dimensionsProblem size
$r = \frac{1}{1-\gamma_{\mathrm{cl}}}$Template inflation factorKey parameter: minimal scalar such that $r\mathcal{B}_2^n$ is RPI
$\rho(\mathbf{A})$Spectral radius $:= \max_i|\lambda_i(\mathbf{A})|$Schur stability criterion: $\rho(\mathbf{A}_{\mathrm{cl}}) < 1$
$\gamma$Lyapunov contraction rate$\rho_{\bar{\boldsymbol{\Omega}}} < 1$ ensures tube contraction
$\alpha$GP confidence level $\in(0,1)$Controls credible-region size; higher $\alpha$ = larger set
$c_{n,\alpha} = \sqrt{\chi^2_{n,1-\alpha}}$Confidence scaling factorConverts $\chi^2$ quantile to ellipsoid radius
$\alpha^*$Certifiable confidence levelLargest $\alpha$ such that invariance holds
$p_{\min}$, $p_{\max}$Global spectral bounds on $\mathbf{P}$$p_{\min}\mathbf{I}\preceq\mathbf{P}(\mathbf{z})\preceq p_{\max}\mathbf{I}$
$p_i^+$, $p_i^-$Local upper/lower spectral bounds at node $i$$\lambda_{\max}(\mathbf{P}_i)\pm\delta_i$; finite verification inputs
$\kappa_P = p_{\max}/p_{\min}$Condition number of anisotropy fieldMeasures spread of anisotropy across operating space
$L_P$Lipschitz constant of $\mathbf{P}(\cdot)$Controls discretization error; inherited from GP kernel
$\delta_i = L_P\cdot\mathrm{diam}(\mathcal{R}_i)$Lipschitz deviation at node $i$Bounds eigenvalue excursion within partition cell
$\lambda_1,\lambda_2 \ge 0$S-procedure multipliers with $\lambda_1+\lambda_2\le1$LMI decision variables (Theorems 2 and 4)
$\sigma_n^2$GP observation noise varianceIrreducible noise floor
$\sigma_f^2$, $\ell$Kernel signal variance and length scaleSE kernel hyperparameters
$N$Number of GP training pointsDataset size; grows across epochs
$N_{\mathrm{gp}}$Active GP dictionary size (capped)Finite by Assumption 3; controls Schur-complement bound
$\underline{\sigma}_0^2$Uniform lower bound on LMC prior covariance$\boldsymbol{\Sigma}_0(\mathbf{z}) \succeq \underline{\sigma}_0^2 \mathbf{I}_n$
$\bar{k}_0$Upper bound on prior cross-covariance blocksUsed in Schur-complement lower bound
$\underline{\lambda}$$\frac{\sigma_n^2\,\underline{\sigma}_0^2}{\sigma_n^2 + N_{\mathrm{gp}}\bar{k}_0}$Uniform lower bound on $\lambda_{\min}(\hat{\boldsymbol{\Sigma}}_w)$; ensures $p_{\min} > 0$
$\alpha_k$Homothetic scaling factor (scalar ansatz)Baseline: $\alpha_k\bar{\boldsymbol{\Omega}}$
$\boldsymbol{\gamma}_k \in \mathbb{R}^p$Per-facet scaling vector (elastic tubes)Richer than scalar but still stage-indexed

Norms & Functions

SymbolDefinitionSignificance
$\|\cdot\|_2$Euclidean / spectral normPrimary norm for vectors and matrices throughout
$\|\cdot\|_F$Frobenius normUsed in Lipschitz analysis of $\mathbf{P}$
$k(\mathbf{z},\mathbf{z}')$Kernel (covariance) functionGP building block; encodes smoothness prior
$k_{\mathrm{SE}}$Squared-exponential kernel: $\sigma_f^2\exp(-\|\mathbf{z}-\mathbf{z}'\|^2/2\ell^2)$Standard smooth kernel satisfying Lipschitz assumptions
$\mathcal{GP}(\mu,k)$Gaussian Process priorDistribution over functions; $f\sim\mathcal{GP}(\mu,k)$
$\lambda_{\max}(\cdot)$, $\lambda_{\min}(\cdot)$Maximum / minimum eigenvalueSpectral bounds for Löwner and LMI conditions

Key Equations & Conditions

EquationNameSignificance
$\mathbb{W}(\mathbf{x},\mathbf{u})=\mathbf{P}(\mathbf{x},\mathbf{u})\bar{\mathbf{W}}$Anisotropic template ansatzCore parameterization: local set = matrix × template
$\mathbf{x}_{k+1}=\mathbf{A}\mathbf{x}_k+\mathbf{B}\mathbf{u}_k+\mathbf{w}$System dynamicsDiscrete-time model with additive disturbance
$\mathbf{e}_{k+1}=\mathbf{A}_{\mathrm{cl}}\mathbf{e}_k+\mathbf{w}$Error dynamicsTube is invariant for the error system
$\mathbf{A}_{\mathrm{cl}}\boldsymbol{\Omega}\oplus\mathbb{W}\subseteq\boldsymbol{\Omega}$RPI condition (classical)Set containment that defines invariance
$\|\mathbf{M}\boldsymbol{\xi}+\mathbf{N}\boldsymbol{\eta}\|_2 \le 1$Spectral invariance condition (Thm 1)Template-coordinate sufficient condition
$\begin{bmatrix}\lambda_1\mathbf{I}{-}\mathbf{M}^\top\mathbf{M} & {-}\mathbf{M}^\top\mathbf{N}\\ {-}\mathbf{N}^\top\mathbf{M} & \lambda_2\mathbf{I}{-}\mathbf{N}^\top\mathbf{N}\end{bmatrix}\succeq\mathbf{0},\quad \lambda_1+\lambda_2\le1$LMI condition (Thm 2)Computationally verifiable via SDP
$\frac{p_i^+}{p_j^-}\bigl(\gamma_{\mathrm{cl}}+\frac{1}{r}\bigr)\le 1$Arc-wise spectral condition (Thm 3)Reduces infinite check to finite graph arcs
$\hat{\boldsymbol{\Sigma}}_w^{(q+1)}\preceq\hat{\boldsymbol{\Sigma}}_w^{(q)}$Monotone variance contraction (Thm 5)Posterior shrinks in Löwner order with more data

Simulation Parameters (Section V)

SymbolDefinitionValue / Context
$\beta_1$, $\beta_2$Aerodynamic drag coefficient; actuator inefficiency$0.15$; $0.08$
$\sigma_w$Wind standard deviation$0.05$
$T_s$Sampling period$0.02$ s
$Q_p, Q_v, Q_a, Q_r$LQR state weights (position, velocity, attitude, rate)Sweep parameters across 7 layers
$R_T, R_\tau$LQR input weights (thrust, torque)Sweep parameters
$\|Q_v^{1/2} P(\mathbf{z})\|_2$Weighted invariance normMust be $< 1$ for RPI certification
$\delta\mathbf{v}\in\mathbb{R}^3$, $\delta\mathbf{u}\in\mathbb{R}^4$Velocity error and control deviationJoint 7D subspace for tube volume comparison

Contents

I. Introduction

Robust Model Predictive Control (MPC) requires disturbance-invariant sets for constraint satisfaction under uncertainty. The standard formulation assumes a fixed, state-independent disturbance set $\mathbb{W}\subset\mathbb{R}^n$ and computes a Robust Positively Invariant (RPI) set $\boldsymbol{\Omega}$ satisfying $\mathbf{A}_{\mathrm{cl}}\boldsymbol{\Omega}\oplus\mathbb{W}\subseteq\boldsymbol{\Omega}$. This is computationally tractable but conservative: a single $\mathbb{W}$ must bound worst-case disturbances uniformly, so spatial variation in uncertainty magnitude and orientation is lost.

Minkowski Sum. Given two sets $A,B\subset\mathbb{R}^n$, the Minkowski sum is $A\oplus B:=\{a+b: a\in A,\;b\in B\}$. Geometrically, it "inflates" $A$ by the shape of $B$.

Pontryagin Difference. The Pontryagin difference (or erosion) is $A\ominus B:=\{x\in\mathbb{R}^n: x+B\subseteq A\}$. It "shrinks" $A$ by $B$. Key property: $A\ominus B \subseteq A$ whenever $\mathbf{0}\in B$.

RPI connection. A set $\boldsymbol{\Omega}$ is RPI for $(\mathbf{A}_{\mathrm{cl}},\mathbb{W})$ when $\mathbf{A}_{\mathrm{cl}}\boldsymbol{\Omega}\oplus\mathbb{W}\subseteq\boldsymbol{\Omega}$, meaning at every point the "pushed-forward" set plus the disturbance set stays within $\boldsymbol{\Omega}$. Equivalently, $\mathbf{A}_{\mathrm{cl}}\boldsymbol{\Omega}\subseteq\boldsymbol{\Omega}\ominus\mathbb{W}$.

We observe that methods addressing this conservatism share a common structure: parameterization of disturbance or tube sets as transformations of a fixed template shape. We formalize this as a template ansatz, a structured trial form wherein $(\mathbf{x},\mathbf{u})$-dependence is encoded in transformation parameters rather than set geometry.

Scalar ansätze. Homothetic tube MPC uses $\alpha_k\bar{\boldsymbol{\Omega}}$ with scalar $\alpha_k$ optimized over the prediction horizon. Extensions learn stage-indexed scalars through Lipschitz analysis or the scenario approach. These methods capture magnitude variation but not directional structure.

Per-facet and shape-free ansätze. Elastic tubes generalize to per-facet scaling $\boldsymbol{\gamma}_k\in\mathbb{R}^p$, and configuration-constrained tubes optimize polytope vertices online. These permit richer geometric variation but remain stage-indexed: adaptation tracks the nominal trajectory rather than the current operating point.

Diagonal ansätze. GP-based methods construct confidence regions from posteriors, producing axis-aligned hyperboxes $\mathbb{W}(\mathbf{x})=\{\mathbf{w}:|w_j|\le\beta\sigma_{n,j}(\mathbf{x})\}$, equivalent to $\mathbf{P}=\mathrm{diag}(\sigma_1,\ldots,\sigma_n)$, but cannot represent inter-component correlations.

The gap. No existing method provides a full matrix $\mathbf{P}(\mathbf{x},\mathbf{u})\in\mathbb{S}^n_{++}$ that varies with the current operating point and is learned from data.

Table I: Template ansätze in the literature and the proposed generalization.
MethodTransformationDepends onLearned
HomotheticScalar $\alpha$Stage $k$No
ElasticPer-facet $\boldsymbol{\gamma}$Stage $k$No
CCTMPCFull shapeStage $k$No
Kohler et al.Scalar$(\mathbf{x},\mathbf{u})$No
GP-MPCDiagonal$\mathbf{x}$ onlyYes
ScenarioScalarStage $k$Yes
ProposedFull matrix $\mathbf{P}$$(\mathbf{x},\mathbf{u})$Yes

Relation to prior work. State- and input-dependent disturbances induce a circular dependence: verifying that states remain within a candidate invariant set requires knowledge of reachable states, which depends on the invariant set. Our prior work resolves this by lifting the dynamics to an augmented space $\mathbb{R}^{2n+m}$, where the disturbance law becomes a graph constraint, and then computing RPI sets as fixed points of a set-valued operator. That lifted fixed-point method is exact but computationally expensive: polytope iterations require seconds with GPU acceleration and can take minutes without it. The present work takes a different route. The base template pair $(\bar{\mathbf{W}}, \bar{\boldsymbol{\Omega}})$ is computed once at initialization; thereafter, invariance is enforced by constraining the parameter field $\mathbf{P}(\cdot)$ rather than by recomputing set geometry.

Richer transformation classes. The template ansatz framework naturally suggests a hierarchy of increasing expressiveness: scalar (homothetic), diagonal, symmetric positive-definite (present work), general linear, and polynomial. The latter would parameterize disturbance sets as semialgebraic sublevel sets $\mathbb{W} = \{\mathbf{w} : p(\mathbf{w}; \mathbf{x}, \mathbf{u}) \leq 1\}$ with verification via sum-of-squares programming. This could capture non-ellipsoidal uncertainty structure at higher computational cost.

Contributions. We introduce the anisotropic template ansatz

$$\mathbb{W}(\mathbf{x},\mathbf{u})=\mathbf{P}(\mathbf{x},\mathbf{u})\,\bar{\mathbf{W}},$$

where $\bar{\mathbf{W}}\subset\mathbb{R}^n$ is a fixed template and $\mathbf{P}(\mathbf{x},\mathbf{u})\in\mathbb{S}^n_{++}$ encodes local anisotropy learned from GP posteriors via $\mathbf{P}=c_{n,\alpha}\hat{\boldsymbol{\Sigma}}_w^{1/2}$.

  1. Template ansatz formalism. Unification of scalar, diagonal, and full-matrix parameterizations under a single framework indexed by transformation class.
  2. Parameter-space invariance conditions. Sufficient conditions for RPI formulated as LMIs coupling $\mathbf{P}(\cdot)$ to closed-loop dynamics, bypassing set-valued iteration.
  3. Graph-based finite verification. Discretization of the continuous operating space into a finite graph with arc-wise spectral conditions.
  4. Single-query online evaluation. After offline GP training and graph verification, each online tube cross-section is obtained by one GP covariance query and one small matrix square root, independent of horizon length, graph size, and iteration count. The base template pair $(\bar{\mathbf{W}}, \bar{\boldsymbol{\Omega}})$ is fixed after initialization; subsequent learning epochs refine only $\mathbf{P}^{(q)}(\cdot)$, and tubes contract monotonically as data accumulate.

II. Anisotropic Template Geometry

Consider a discrete-time system with nominal linear dynamics and additive state- and input-dependent uncertainty:

$$\mathbf{x}_{k+1}=\mathbf{A}\mathbf{x}_k+\mathbf{B}\mathbf{u}_k+\mathbf{w}(\mathbf{x}_k,\mathbf{u}_k),$$

where $\mathbf{x}_k\in\mathcal{X}\subseteq\mathbb{R}^n$, $\mathbf{u}_k\in\mathcal{U}\subseteq\mathbb{R}^m$, $(\mathbf{A},\mathbf{B})$ are nominal matrices, and $\mathbf{w}(\mathbf{x},\mathbf{u})\in\mathbb{W}(\mathbf{x},\mathbf{u})\subset\mathbb{R}^n$ is a state- and input-dependent residual. Thus $\mathbf{A},\mathbf{B}$ are assumed available from modeling, local linearization, or identification; in the latter case $\mathbf{w}$ also absorbs identification error.

A matrix $\mathbf{A}\in\mathbb{R}^{n\times n}$ is Schur stable if all its eigenvalues lie strictly inside the unit circle: $\rho(\mathbf{A}):=\max_i|\lambda_i(\mathbf{A})|<1$. The number $\rho(\mathbf{A})$ is the spectral radius.

Why it matters here. For the error dynamics $\mathbf{e}_{k+1}=\mathbf{A}_{\mathrm{cl}}\mathbf{e}_k+\mathbf{w}_k$, Schur stability of $\mathbf{A}_{\mathrm{cl}}$ ensures that disturbance effects do not accumulate unboundedly. Without it, no bounded invariant set can exist (the error can grow arbitrarily). With $\rho(\mathbf{A}_{\mathrm{cl}})<1$, the "contraction" from $\mathbf{A}_{\mathrm{cl}}$ can compensate for the "expansion" from $\mathbf{w}_k$, enabling bounded tubes.

Discrete vs. continuous. In continuous time the analogous concept is Hurwitz stability (eigenvalues in the open left half-plane). In discrete time we use Schur stability (eigenvalues inside the unit disk).

Assumption 1 (Standing Hypotheses)

The constraint sets $\mathcal{X}$, $\mathcal{U}$ are compact and convex. A stabilizing gain $\mathbf{K}\in\mathbb{R}^{m\times n}$ exists such that $\mathbf{A}_{\mathrm{cl}}:=\mathbf{A}+\mathbf{B}\mathbf{K}$ is Schur stable, i.e., $\rho(\mathbf{A}_{\mathrm{cl}})<1$.

Under affine feedback $\mathbf{u}=\mathbf{K}\mathbf{x}+\mathbf{v}$, the error $\mathbf{e}_k:=\mathbf{x}_k-\hat{\mathbf{x}}_k$ relative to a nominal trajectory satisfies

$$\mathbf{e}_{k+1}=\mathbf{A}_{\mathrm{cl}}\mathbf{e}_k+\mathbf{w}(\mathbf{x}_k,\mathbf{u}_k).$$

The Anisotropic Template Ansatz

Definition 1 (Anisotropic Template Ansatz)

An anisotropic template ansatz is a family $\mathbb{W}(\mathbf{x},\mathbf{u})=\mathbf{P}(\mathbf{x},\mathbf{u})\bar{\mathbf{W}}:=\{\mathbf{P}(\mathbf{x},\mathbf{u})\mathbf{w}:\mathbf{w}\in\bar{\mathbf{W}}\}$ where $\bar{\mathbf{W}}\subset\mathbb{R}^n$ is a fixed template with $\mathbf{0}\in\mathrm{int}(\bar{\mathbf{W}})$ and $\mathbf{P}:\mathcal{X}\times\mathcal{U}\to\mathbb{S}^n_{++}$ maps to the cone of symmetric positive-definite matrices.

A symmetric matrix $\mathbf{P}\in\mathbb{R}^{n\times n}$ (i.e., $\mathbf{P}=\mathbf{P}^\top$) belongs to $\mathbb{S}^n_{++}$ (positive definite) if $\mathbf{v}^\top\mathbf{P}\mathbf{v}>0$ for all nonzero $\mathbf{v}\in\mathbb{R}^n$, and to $\mathbb{S}^n_+$ (positive semidefinite) if $\mathbf{v}^\top\mathbf{P}\mathbf{v}\ge 0$.

Spectral decomposition. Any symmetric $\mathbf{P}$ admits the decomposition $\mathbf{P}=\mathbf{V}\boldsymbol{\Lambda}\mathbf{V}^\top$ where $\mathbf{V}$ is orthogonal (columns are eigenvectors) and $\boldsymbol{\Lambda}=\mathrm{diag}(\lambda_1,\ldots,\lambda_n)$ are the eigenvalues. The matrix is positive definite iff all $\lambda_i>0$.

Geometric meaning. The set $\mathbf{P}\mathcal{B}_2^n$ (image of the unit ball under $\mathbf{P}$) is an ellipsoid whose principal axes have directions $\mathbf{v}_i$ and lengths $\lambda_i$. The matrix $\mathbf{P}$ thus encodes both orientation and shape of this ellipsoid, which is why it captures anisotropic (direction-dependent) uncertainty.

Matrix square root. If $\mathbf{P}\in\mathbb{S}^n_{++}$, there exists a unique $\mathbf{P}^{1/2}\in\mathbb{S}^n_{++}$ such that $(\mathbf{P}^{1/2})^2=\mathbf{P}$. Via the spectral decomposition: $\mathbf{P}^{1/2}=\mathbf{V}\boldsymbol{\Lambda}^{1/2}\mathbf{V}^\top$.

The matrix $\mathbf{P}(\mathbf{x},\mathbf{u})$ encodes local anisotropy via its spectral decomposition: eigenvectors define principal uncertainty directions; eigenvalues set magnitudes along these axes. Setting $\mathbf{P}=\sigma\mathbf{I}_n$ recovers isotropic scaling; $\mathbf{P}=\mathrm{diag}(\sigma_1,\ldots,\sigma_n)$ yields axis-aligned boxes; the general case admits rotation and shear.

GP Posteriors as Anisotropy Fields

From trajectory data, disturbance samples $\mathbf{w}^{(j)}=\mathbf{x}^{(j+1)}-\mathbf{A}\mathbf{x}^{(j)}-\mathbf{B}\mathbf{u}^{(j)}$ yield dataset $\mathcal{D}=\{(\mathbf{z}^{(j)},\mathbf{w}^{(j)})\}_{j=1}^N$ with $\mathbf{z}:=(\mathbf{x},\mathbf{u})\in\mathcal{Z}:=\mathcal{X}\times\mathcal{U}$. The GP posterior covariance $\hat{\boldsymbol{\Sigma}}_w(\mathbf{x},\mathbf{u})\in\mathbb{S}^n_+$ yields the $(1-\alpha)$ credible region:

$$\mathcal{E}(\mathbf{x},\mathbf{u})=\hat{\boldsymbol{\mu}}_w\oplus c_{n,\alpha}\hat{\boldsymbol{\Sigma}}_w^{1/2}\mathcal{B}_2^n,$$

where $c_{n,\alpha}:=\sqrt{\chi^2_{n,1-\alpha}}$ and $\mathcal{B}_2^n$ is the Euclidean unit ball. The GP-induced anisotropy field is $\mathbf{P}(\mathbf{x},\mathbf{u}):=c_{n,\alpha}\hat{\boldsymbol{\Sigma}}_w(\mathbf{x},\mathbf{u})^{1/2}$.

Base Template and Invariance Geometry

The base template pair $(\bar{\mathbf{W}},\bar{\boldsymbol{\Omega}})$ is computed once at initialization. We fix $\bar{\mathbf{W}} = \mathcal{B}_2^n$ and set the tube template to $\bar{\boldsymbol{\Omega}} = r\mathcal{B}_2^n$, with template inflation factor

$$r \;:=\; \frac{1}{1 - \gamma_{\mathrm{cl}}}.$$

Here the closed-loop contraction rate $\gamma_{\mathrm{cl}} < 1$ is defined as follows: if $\|\mathbf{A}_{\mathrm{cl}}\|_2 < 1$ set $\gamma_{\mathrm{cl}} := \|\mathbf{A}_{\mathrm{cl}}\|_2$; otherwise set $\gamma_{\mathrm{cl}} := \sqrt{\rho(\mathbf{P}_{\mathrm{d}}^{-1}\mathbf{A}_{\mathrm{cl}}^\top\mathbf{P}_{\mathrm{d}}\,\mathbf{A}_{\mathrm{cl}})}$ where $\mathbf{P}_{\mathrm{d}} \succ \mathbf{0}$ is the DARE terminal-cost matrix. Schur stability guarantees $\gamma_{\mathrm{cl}} < 1$ in both cases.

The template inflation factor $r$ is the minimal scalar ensuring $\gamma_{\mathrm{cl}}\,r + 1 \leq r$, i.e., $r\mathcal{B}_2^n$ is RPI for $(\mathbf{A}_{\mathrm{cl}}, \mathcal{B}_2^n)$. The tube template $\bar{\boldsymbol{\Omega}}$ is strictly larger than $\bar{\mathbf{W}}$; this separation provides the geometric headroom that makes the invariance conditions feasible. Subsequent analysis constrains only $\mathbf{P}(\cdot)$.

Remark (Lyapunov Coordinate Change)

When $\|\mathbf{A}_{\mathrm{cl}}\|_2 < 1$, one sets $\mathbf{P}_{\mathrm{d}} = \mathbf{I}_n$ and $\gamma_{\mathrm{cl}} = \|\mathbf{A}_{\mathrm{cl}}\|_2$. When $\|\mathbf{A}_{\mathrm{cl}}\|_2 \geq 1$ (e.g. a 12-state quadrotor with $\rho(\mathbf{A}_{\mathrm{cl}}) < 1$), the change of coordinates $\tilde{\mathbf{e}} = \mathbf{P}_{\mathrm{d}}^{1/2}\mathbf{e}$ yields $\|\tilde{\mathbf{A}}_{\mathrm{cl}}\|_2 = \gamma_{\mathrm{cl}} < 1$, reducing to the same framework. All spectral conditions in Sections III–IV use $\gamma_{\mathrm{cl}}$ accordingly, with spectral bounds $p_i^{\pm}$ understood in the $\mathbf{P}_{\mathrm{d}}$-weighted norm.

Assumption 2 (Anisotropy Field Regularity)

The map $\mathbf{P}:\mathcal{Z}\to\mathbb{S}^n_{++}$ satisfies uniform Löwner bounds $p_{\min}\mathbf{I}_n\preceq\mathbf{P}(\mathbf{z})\preceq p_{\max}\mathbf{I}_n$ with $0\lt p_{\min}\le p_{\max}\lt\infty$, and is Lipschitz continuous with constant $L_P\gt 0$.

Local RPI Fields and the Invariance Problem

For a compact convex set $\mathbf{S}\subset\mathbb{R}^n$ with $\mathbf{0}\in\mathrm{int}(\mathbf{S})$, the gauge function is

$$\gamma_{\mathbf{S}}(\mathbf{w}):=\inf\{\lambda\ge 0:\mathbf{w}\in\lambda\mathbf{S}\}.$$

It measures "how far" $\mathbf{w}$ is from the origin relative to $\mathbf{S}$: $\gamma_{\mathbf{S}}(\mathbf{w})\le 1$ iff $\mathbf{w}\in\mathbf{S}$; $\gamma_{\mathbf{S}}(\mathbf{w})=2$ means $\mathbf{w}$ is twice as far as the boundary of $\mathbf{S}$.

Properties: (i) $\gamma_{\mathbf{S}}(\alpha\mathbf{w})=\alpha\,\gamma_{\mathbf{S}}(\mathbf{w})$ for $\alpha\ge 0$ (positive homogeneity); (ii) $\gamma_{\mathbf{S}}(\mathbf{w}_1+\mathbf{w}_2)\le\gamma_{\mathbf{S}}(\mathbf{w}_1)+\gamma_{\mathbf{S}}(\mathbf{w}_2)$ (subadditivity); (iii) for invertible $\mathbf{M}$: $\gamma_{\mathbf{M}\mathbf{S}}(\mathbf{w})=\gamma_{\mathbf{S}}(\mathbf{M}^{-1}\mathbf{w})$.

Special case. For the unit ball $\mathcal{B}_2^n$, the gauge is the Euclidean norm: $\gamma_{\mathcal{B}_2^n}(\mathbf{w})=\|\mathbf{w}\|_2$. For the ellipsoid $\mathbf{P}\mathcal{B}_2^n$ with $\mathbf{P}\succ\mathbf{0}$: $\gamma_{\mathbf{P}\mathcal{B}_2^n}(\mathbf{w})=\|\mathbf{P}^{-1}\mathbf{w}\|_2$.

Definition 2 (Template Contraction Coefficient)

For Schur-stable $\mathbf{A}_{\mathrm{cl}}$ and template $\bar{\boldsymbol{\Omega}}\ni\mathbf{0}$, the contraction coefficient is $\rho_{\bar{\boldsymbol{\Omega}}}:=\sup_{\mathbf{e}\in\bar{\boldsymbol{\Omega}}}\gamma_{\bar{\boldsymbol{\Omega}}}(\mathbf{A}_{\mathrm{cl}}\mathbf{e})$.

Lemma 1 (Strict Contraction)

If $\bar{\boldsymbol{\Omega}}$ is RPI for $(\mathbf{A}_{\mathrm{cl}},\bar{\mathbf{W}})$ with $\mathbf{0}\in\mathrm{int}(\bar{\mathbf{W}})$, then $\rho_{\bar{\boldsymbol{\Omega}}}<1$.

Definition 3 (Local RPI Field)

A map $\mathbf{x}\mapsto\mathcal{B}(\mathbf{x})$ is a local RPI field if the graph $\mathcal{O}:=\bigcup_{\mathbf{x}\in\mathcal{X}}(\mathbf{x}+\mathcal{B}(\mathbf{x}))$ is RPI for the closed-loop system: $\mathbf{x}_k\in\mathcal{O}\Rightarrow\mathbf{x}_{k+1}\in\mathcal{O}$ for all admissible $\mathbf{w}_k\in\mathbb{W}(\mathbf{x}_k,\mathbf{u}_k)$.

Problem 1 (Anisotropic Template Invariance)

Given dynamics, template pair $(\bar{\mathbf{W}}, \bar{\boldsymbol{\Omega}}) = (\mathcal{B}_2^n, r\mathcal{B}_2^n)$ with $r = 1/(1-\gamma_{\mathrm{cl}})$, and GP-induced field $\mathbf{P}(\mathbf{x},\mathbf{u})$ satisfying Assumption 2, find conditions on $\mathbf{P}(\cdot)$ such that $\boldsymbol{\Omega}(\mathbf{x},\mathbf{u}):=\mathbf{P}(\mathbf{x},\mathbf{u})\bar{\boldsymbol{\Omega}}$ is a local RPI field.

III. Learning the Anisotropy Field from Data

This section constructs $\mathbf{P}(\mathbf{x},\mathbf{u})$ from trajectory data via Gaussian Process regression and establishes regularity properties required for invariance.

A Gaussian Process (GP) is a collection of random variables, any finite number of which have a joint Gaussian distribution. A GP is fully specified by a mean function $\mu(\mathbf{z})$ and a covariance (kernel) function $k(\mathbf{z},\mathbf{z}')$: $f\sim\mathcal{GP}(\mu,k)$.

Posterior. Given $N$ observations $\mathcal{D}=\{(\mathbf{z}^{(j)},y^{(j)})\}$ with noise $y=f(\mathbf{z})+\varepsilon$, $\varepsilon\sim\mathcal{N}(0,\sigma_n^2)$, the posterior is also a GP with:

$$\hat{\mu}(\mathbf{z}_*)=\mathbf{k}_*^\top(\mathbf{G}+\sigma_n^2\mathbf{I})^{-1}\mathbf{y},\quad\hat{\sigma}^2(\mathbf{z}_*)=k(\mathbf{z}_*,\mathbf{z}_*)-\mathbf{k}_*^\top(\mathbf{G}+\sigma_n^2\mathbf{I})^{-1}\mathbf{k}_*,$$

where $\mathbf{k}_*=[k(\mathbf{z}_*,\mathbf{z}^{(j)})]_{j=1}^N$ and $[\mathbf{G}]_{jl}=k(\mathbf{z}^{(j)},\mathbf{z}^{(l)})$ is the Gram matrix.

Key property. The posterior variance $\hat{\sigma}^2(\mathbf{z}_*)$ is always $\le$ the prior variance $k(\mathbf{z}_*,\mathbf{z}_*)$, because the subtracted term $\mathbf{k}_*^\top(\mathbf{G}+\sigma_n^2\mathbf{I})^{-1}\mathbf{k}_*\ge 0$. Data never increases uncertainty - this is the foundation for monotone tube contraction.

Multi-output GPs. For vector-valued $\mathbf{w}\in\mathbb{R}^n$, the Linear Model of Coregionalization (LMC) uses mixing matrices to produce a full posterior covariance matrix $\hat{\boldsymbol{\Sigma}}_w(\mathbf{z})\in\mathbb{S}^n_+$ that captures cross-correlations between components.

Data Model and GP Posterior

From observed transitions $\{(\mathbf{x}^{(j)},\mathbf{u}^{(j)},\mathbf{x}^{(j+1)})\}$, disturbance samples $\mathbf{w}^{(j)}:=\mathbf{x}^{(j+1)}-\mathbf{A}\mathbf{x}^{(j)}-\mathbf{B}\mathbf{u}^{(j)}$ capture model mismatch at $\mathbf{z}^{(j)}:=(\mathbf{x}^{(j)},\mathbf{u}^{(j)})$. The GP therefore learns residual uncertainty around a nominal model, not a fully model-free controller.

Assumption 3 (Kernel Regularity)

The kernel $k$ is bounded ($k(\mathbf{z},\mathbf{z})\le\bar{k}$ for all $\mathbf{z}\in\mathcal{Z}$) and Lipschitz ($|k(\mathbf{z}_1,\mathbf{z}')-k(\mathbf{z}_2,\mathbf{z}')|\le L_k\|\mathbf{z}_1-\mathbf{z}_2\|$ for all $\mathbf{z}_1,\mathbf{z}_2,\mathbf{z}'\in\mathcal{Z}$). The active GP dictionary is finite, $N_{\mathrm{gp}}<\infty$, the observation noise satisfies $\sigma_n^2>0$, and the LMC prior covariance is uniformly nondegenerate on $\mathcal{Z}$. The squared-exponential kernel $k_{\mathrm{SE}}(\mathbf{z},\mathbf{z}')=\sigma_f^2\exp(-\|\mathbf{z}-\mathbf{z}'\|^2/2\ell^2)$ satisfies both with $L_k = \sigma_f^2/\ell$.

From Posterior Covariance to Anisotropy Matrix

Lemma 2 (Credible Ellipsoid Decomposition)

The credible ellipsoid admits the representation $\mathcal{E}(\mathbf{z})=\hat{\boldsymbol{\mu}}_w\oplus c_{n,\alpha}\hat{\boldsymbol{\Sigma}}_w^{1/2}\mathcal{B}_2^n$ where $c_{n,\alpha}:=\sqrt{\chi^2_{n,1-\alpha}}$.

Definition 4 (GP-Induced Anisotropy Matrix)

Given posterior covariance $\hat{\boldsymbol{\Sigma}}_w(\mathbf{x},\mathbf{u})$ and confidence level $\alpha\in(0,1)$, the anisotropy matrix is $\mathbf{P}(\mathbf{x},\mathbf{u}):=c_{n,\alpha}\,\hat{\boldsymbol{\Sigma}}_w(\mathbf{x},\mathbf{u})^{1/2}$.

Regularity of the Learned Field

Lemma 3 (Boundedness)

Under Assumption 3, there exist $0\lt p_{\min}\le p_{\max}\lt\infty$ such that $p_{\min}\mathbf{I}_n\preceq\mathbf{P}(\mathbf{x},\mathbf{u})\preceq p_{\max}\mathbf{I}_n$ for all $(\mathbf{x},\mathbf{u})\in\mathcal{Z}$.

Lemma 4 (Lipschitz Continuity)

Under Assumption 3, the map $\mathbf{P}:\mathcal{Z}\to\mathbb{S}^n_{++}$ is Lipschitz: $\|\mathbf{P}(\mathbf{z}_1)-\mathbf{P}(\mathbf{z}_2)\|_F\le L_P\|\mathbf{z}_1-\mathbf{z}_2\|$.

Remark 1 (Verification of Assumption 2)

Lemmas 3-4 establish that the GP-induced field satisfies the regularity conditions of Assumption 2, closing the loop between the learning and invariance frameworks.

Learning Pipeline

Algorithm 1: GP-Induced Anisotropy Field Learning

Require: Trajectory data $\mathcal{D}$, confidence $\alpha$, kernel $k$, nominal model $(\mathbf{A},\mathbf{B})$, stabilizing gain $\mathbf{K}$

Ensure: Template pair $(\bar{\mathbf{W}},\bar{\boldsymbol{\Omega}})$, anisotropy field $\mathbf{P}(\cdot)$

1. Disturbance extraction: $\mathbf{w}^{(j)}\gets\mathbf{x}^{(j+1)}-\mathbf{A}\mathbf{x}^{(j)}-\mathbf{B}\mathbf{u}^{(j)}$
2. GP training: Fit hyperparameters via marginal likelihood; compute $\hat{\boldsymbol{\Sigma}}_w(\cdot)$
3. Template pair: compute $\gamma_{\mathrm{cl}}$; $r\gets 1/(1-\gamma_{\mathrm{cl}})$; $\bar{\boldsymbol{\Omega}}\gets r\mathcal{B}_2^n$; $\bar{\mathbf{W}}\gets\mathcal{B}_2^n$
4. Bounds: $p_{\max}\gets\sup_{(\mathbf{x},\mathbf{u})\in\mathcal{Z}}\|\mathbf{P}(\mathbf{x},\mathbf{u})\|_2$ (Lemma 3)
5. Anisotropy field: $\mathbf{P}(\mathbf{x},\mathbf{u})\gets c_{n,\alpha}\hat{\boldsymbol{\Sigma}}_w(\mathbf{x},\mathbf{u})^{1/2}$
6. Return $(\bar{\mathbf{W}},\bar{\boldsymbol{\Omega}},r),\;\mathbf{P}(\cdot)$
Proposition 1 (Learned Field Properties)

The field $\mathbf{P}(\mathbf{x},\mathbf{u})$ from Algorithm 1 satisfies: (i) $\mathbf{P}\in\mathbb{S}^n_{++}$; (ii) uniform bounds per Lemma 3; (iii) Lipschitz continuity per Lemma 4; (iv) $\mathbf{P}(\mathbf{x},\mathbf{u})\bar{\mathbf{W}}$ contains the true disturbance with probability $\ge 1-\alpha$.

IV. Invariance Conditions for Anisotropic Tubes

This section derives sufficient conditions for the tube field $\boldsymbol{\Omega}(\mathbf{x},\mathbf{u})=\mathbf{P}(\mathbf{x},\mathbf{u})\bar{\boldsymbol{\Omega}}$ to be locally RPI.

Gauge Functions and Set Membership

Definition 5 (Gauge Function)

For a compact convex set $\mathbf{S}\subset\mathbb{R}^n$ with $\mathbf{0}\in\mathrm{int}(\mathbf{S})$, the gauge function is $\gamma_{\mathbf{S}}(\mathbf{w}):=\inf\{\lambda\ge 0:\mathbf{w}\in\lambda\mathbf{S}\}$.

Lemma 5 (Gauge Function Properties)

Let $\mathbf{S}$ be compact, convex with $\mathbf{0}\in\mathrm{int}(\mathbf{S})$. Then: (i) $\mathbf{S}=\{\mathbf{w}:\gamma_{\mathbf{S}}(\mathbf{w})\le 1\}$; (ii) $\gamma_{\mathbf{S}}(\alpha\mathbf{w})=\alpha\gamma_{\mathbf{S}}(\mathbf{w})$ for $\alpha\ge 0$; (iii) $\gamma_{\mathbf{S}}(\mathbf{w}_1+\mathbf{w}_2)\le\gamma_{\mathbf{S}}(\mathbf{w}_1)+\gamma_{\mathbf{S}}(\mathbf{w}_2)$; (iv) $\gamma_{\mathbf{M}\mathbf{S}}(\mathbf{w})=\gamma_{\mathbf{S}}(\mathbf{M}^{-1}\mathbf{w})$ for invertible $\mathbf{M}$.

Anisotropic RPI via Coordinate Transformation

Geometrically, $\mathbf{P}'^{-1}\mathbf{A}_{\mathrm{cl}}\mathbf{P}$ is the closed-loop error map expressed in the successor ellipsoid's coordinates, while $\mathbf{P}'^{-1}\mathbf{P}$ is the relative disturbance injection. Invariance requires their Minkowski sum to fit inside the unit template. The key idea is to work in template coordinates: if $\mathbf{e}_k \in \mathbf{P}_k\bar{\boldsymbol{\Omega}}$, write $\mathbf{e}_k = \mathbf{P}_k\boldsymbol{\xi}$ for $\boldsymbol{\xi} \in \bar{\boldsymbol{\Omega}}$, so the condition becomes entirely about the fixed template shape.
Theorem 1 (Anisotropic RPI Condition)

Let $\mathbf{P}(\mathbf{x},\mathbf{u})$ satisfy Assumption 2, and let $\bar{\boldsymbol{\Omega}}$ be RPI for $(\mathbf{A}_{\mathrm{cl}},\bar{\mathbf{W}})$ with $\rho_{\bar{\boldsymbol{\Omega}}}<1$. If for all transitions $(\mathbf{x},\mathbf{u})\to(\mathbf{x}',\mathbf{u}')$:

$$\mathbf{P}'^{-1}\mathbf{A}_{\mathrm{cl}}\mathbf{P}\,\bar{\boldsymbol{\Omega}}\oplus\mathbf{P}'^{-1}\mathbf{P}\,\bar{\mathbf{W}}\subseteq\bar{\boldsymbol{\Omega}},$$

where $\mathbf{P}:=\mathbf{P}(\mathbf{x},\mathbf{u})$ and $\mathbf{P}':=\mathbf{P}(\mathbf{x}',\mathbf{u}')$, then $\boldsymbol{\Omega}(\mathbf{x},\mathbf{u})=\mathbf{P}(\mathbf{x},\mathbf{u})\bar{\boldsymbol{\Omega}}$ is locally RPI.

LMI Formulation via the S-Procedure

The S-procedure (or S-lemma) is a technique from convex optimization for certifying quadratic implications. The basic idea: suppose we want to show that whenever a quadratic constraint $f_1(\mathbf{z})\ge 0$ holds, another $f_0(\mathbf{z})\ge 0$ also holds.

Statement. Let $f_i(\mathbf{z})=\mathbf{z}^\top\mathbf{A}_i\mathbf{z}+2\mathbf{b}_i^\top\mathbf{z}+c_i$ for $i=0,1$. If there exists $\tau\ge 0$ such that $f_0(\mathbf{z})\ge\tau f_1(\mathbf{z})$ for all $\mathbf{z}$, then $f_1(\mathbf{z})\ge 0\Rightarrow f_0(\mathbf{z})\ge 0$.

Multi-constraint version. With constraints $f_1(\mathbf{z})\ge 0, f_2(\mathbf{z})\ge 0$, we seek $\tau_1,\tau_2\ge 0$ with $f_0(\mathbf{z})\ge\tau_1 f_1(\mathbf{z})+\tau_2 f_2(\mathbf{z})$ for all $\mathbf{z}$. This is a sufficient (not always necessary) condition, but it converts a logical implication into a matrix inequality (LMI) that can be checked computationally.

In this paper. The S-procedure converts the invariance question "if $\|\boldsymbol{\xi}\|\le 1$ and $\|\boldsymbol{\eta}\|\le 1$, then $\|\mathbf{M}\boldsymbol{\xi}+\mathbf{N}\boldsymbol{\eta}\|\le 1$" into a single LMI.

Theorem 2 (LMI Condition for Anisotropic RPI)

The field $\boldsymbol{\Omega}(\mathbf{x},\mathbf{u})$ is locally RPI if for all transitions there exist $\lambda_1,\lambda_2\ge 0$ with $\lambda_1+\lambda_2\le 1$ such that:

$$\begin{bmatrix}\lambda_1\mathbf{I}_n-\mathbf{M}^\top\mathbf{M}&-\mathbf{M}^\top\mathbf{N}\\-\mathbf{N}^\top\mathbf{M}&\lambda_2\mathbf{I}_n-\mathbf{N}^\top\mathbf{N}\end{bmatrix}\succeq\mathbf{0},$$

where $\mathbf{M}:=\mathbf{P}'^{-1}\mathbf{A}_{\mathrm{cl}}\mathbf{P}$ and $\mathbf{N}:=\tfrac{1}{r}\mathbf{P}'^{-1}\mathbf{P}$.

Corollary 1 (Spectral Sufficient Condition)

Local RPI holds if $\|\mathbf{P}'^{-1}\mathbf{A}_{\mathrm{cl}}\mathbf{P}\|_2+\tfrac{1}{r}\|\mathbf{P}'^{-1}\mathbf{P}\|_2\le 1$ for all transitions. With uniform bounds, this is implied by $(\gamma_{\mathrm{cl}}+1/r)\kappa_P\le 1$ where $\kappa_P:=p_{\max}/p_{\min}$.

Remark 2 (Uniform vs. Arc-Wise Verification)

The uniform condition $(\gamma_{\mathrm{cl}}+1/r)\kappa_P\le 1$ is restrictive: since $\kappa_P := p_{\max}/p_{\min} \geq 1$ by definition, it requires $\gamma_{\mathrm{cl}} + 1/r \leq 1$, i.e., $r \geq 1/(1-\gamma_{\mathrm{cl}})$, which is exactly the template inflation factor. Thus the uniform condition is satisfiable with $\kappa_P = 1$ (isotropic fields) but becomes restrictive for highly anisotropic fields with $\kappa_P \gg 1$. The operative verification is the arc-wise criterion of Theorem 3, which exploits local ratios $p_i^+/p_j^-$ between neighboring regions. On a sufficiently fine grid, these local ratios approach unity even when the global condition number is large.

Graph-Based Discretization for Finite Verification

Assumption 4 (Graph Completeness)

The graph $\mathcal{G}=(\mathcal{V},\mathcal{A})$ captures all dynamically feasible transitions: for any closed-loop trajectory, if $(\mathbf{x}_k,\mathbf{u}_k)\in\mathcal{R}_i$ and $(\mathbf{x}_{k+1},\mathbf{u}_{k+1})\in\mathcal{R}_j$, then $(i,j)\in\mathcal{A}$.

Remark (Constructive graph construction)

In the implementation, $\mathcal{V}$ is constructed by clustering the sampled operating points using $k$-means, and arcs are initialized from consecutive closed-loop transitions in the rollout data. To make Assumption 4 constructive rather than empirical, each observed arc is enlarged by an interval over-approximation of the reachable successor set using the spectral bounds below; this may add spurious arcs, but spurious arcs only strengthen the verification problem because every added arc must also satisfy the invariance test. For the quadrotor benchmark: $k$-means with $N_v = 30$ yields $|\mathcal{A}| = 63$ arcs.

Definition 6 (Region Diameter and Lipschitz Deviation)

For each region $\mathcal{R}_i$: (1) diameter: $\mathrm{diam}(\mathcal{R}_i):=\sup_{\mathbf{z},\mathbf{z}'\in\mathcal{R}_i}\|\mathbf{z}-\mathbf{z}'\|_2$; (2) Lipschitz deviation bound: $\delta_i:=L_P\cdot\mathrm{diam}(\mathcal{R}_i)$.

Lemma 6 (Eigenvalue Bounds within Regions)

For any $\mathbf{z}\in\mathcal{R}_i$: $\lambda_{\max}(\mathbf{P}(\mathbf{z}))\le\lambda_{\max}(\mathbf{P}_i)+\delta_i$ and $\lambda_{\min}(\mathbf{P}(\mathbf{z}))\ge\lambda_{\min}(\mathbf{P}_i)-\delta_i$.

Weyl's inequality bounds how much eigenvalues can change under matrix perturbation. For symmetric matrices $\mathbf{A},\mathbf{B}\in\mathbb{S}^n$:

$$|\lambda_i(\mathbf{A}+\mathbf{B})-\lambda_i(\mathbf{A})|\le\|\mathbf{B}\|_2\quad\text{for }i=1,\ldots,n.$$

Here $\lambda_i$ denotes the $i$-th eigenvalue (ordered). In particular, both the maximum and minimum eigenvalues can shift by at most $\|\mathbf{B}\|_2$.

Why it matters. We know $\mathbf{P}(\mathbf{z})$ exactly only at grid nodes. Within each region, Weyl's inequality (combined with Lipschitz continuity) ensures eigenvalues stay within known bounds, enabling finite verification of invariance over the entire continuous domain.

Definition 7 (Spectral Bound Parameters)

For each node $i\in\mathcal{V}$: $p_i^+:=\lambda_{\max}(\mathbf{P}_i)+\delta_i$ (upper bound on $\|\mathbf{P}(\mathbf{z})\|_2$ for $\mathbf{z}\in\mathcal{R}_i$) and $p_i^-:=\lambda_{\min}(\mathbf{P}_i)-\delta_i$ (lower bound on $\lambda_{\min}(\mathbf{P}(\mathbf{z}))$ for $\mathbf{z}\in\mathcal{R}_i$).

Assumption 5 (Grid Fineness)

The partition $\{\mathcal{R}_i\}$ is sufficiently fine such that $\delta_i<\lambda_{\min}(\mathbf{P}_i)$ for all $i\in\mathcal{V}$, ensuring $p_i^->0$.

Lemma 7 (Spectral Norm Bounds for Matrix Products)

For any $\mathbf{z}\in\mathcal{R}_i$ and $\mathbf{z}'\in\mathcal{R}_j$: $\|\mathbf{M}\|_2\le\frac{p_i^+}{p_j^-}\gamma_{\mathrm{cl}}$ and $\|\mathbf{N}\|_2\le\frac{1}{r}\frac{p_i^+}{p_j^-}$.

Main Discretization Theorem

Theorem 3 (Discrete Graph Invariance via Spectral Bounds)

Let $\mathcal{G}=(\mathcal{V},\mathcal{A})$ satisfy Assumptions 4-5. If for every arc $(i,j)\in\mathcal{A}$:

$$\frac{p_i^+}{p_j^-}\cdot\bigl(\gamma_{\mathrm{cl}}+\tfrac{1}{r}\bigr)\le 1$$

then the tube field $\boldsymbol{\Omega}(\mathbf{x},\mathbf{u})=\mathbf{P}(\mathbf{x},\mathbf{u})\bar{\boldsymbol{\Omega}}$ with $\bar{\boldsymbol{\Omega}}=r\mathcal{B}_2^n$ is locally RPI over $\mathcal{Z}$.

Remark 3 (Comparison with the Full LMI Condition)

The arc-wise spectral condition is a conservative upper-bound version of the full LMI in Theorem 2. It replaces the exact matrices $\mathbf{M}$ and $\mathbf{N}$ by norm bounds and then applies the triangle inequality. This loses directional information in the Minkowski sum, but gives a scalar certificate that is easy to check on graph arcs. In the quadrotor benchmark, the per-point weighted norms lie in $[0.279,\,0.639]$, so the verified instances remain well below the unit threshold despite the conservative bound.

Remark (Scalability)

Verification is offline and scales with the feature dimension of $\mathbf{P}(\cdot)$, not the full state-input space. For the 12-state quadrotor, disturbances enter only the 3D velocity subspace, so the effective partition dimension is $3 + 4 = 7$ (velocity + control features), not $12 + 4 = 16$. A full dense grid in this subspace is subject to the curse of dimensionality, motivating disturbance-active subspaces, adaptive clustering, or sparse graph refinement. The quadrotor uses a 30-node, 63-arc graph in the three velocity components. Online, the graph is not traversed: evaluation is one cached GP covariance query and one $n_w \times n_w$ square root, about 0.1 ms for $N_{\mathrm{gp}} \le 300$, $n_w = 3$.

Interactive Visualization: Graph-Based Finite Verification

LMI Verification (Optional Tighter Condition)

For applications requiring tighter bounds, we provide an LMI-based alternative with explicit conservatism quantification.

Theorem 4 (Scalar-Bound LMI Verification)

Let the spectral bounds $\{p_i^+, p_i^-\}$ be as in Definition 7. Define

$$\bar{M}_{ij} := \frac{p_i^+}{p_j^-}\gamma_{\mathrm{cl}}, \qquad \bar{N}_{ij} := \frac{1}{r}\frac{p_i^+}{p_j^-}.$$

If for every arc $(i,j) \in \mathcal{A}$ the LMI

$$\begin{bmatrix} \lambda_1\mathbf{I}_n - \bar{M}_{ij}^2 \mathbf{I}_n & -\bar{M}_{ij}\bar{N}_{ij}\mathbf{I}_n \\ -\bar{M}_{ij}\bar{N}_{ij}\mathbf{I}_n & \lambda_2\mathbf{I}_n - \bar{N}_{ij}^2 \mathbf{I}_n \end{bmatrix} \succeq \mathbf{0}$$

is feasible for some $\lambda_1,\lambda_2 \geq 0$ with $\lambda_1+\lambda_2\leq 1$, then the tube field is locally RPI.

Existence of Admissible Parameter Fields

We establish that the set of admissible parameter fields is non-empty and carries a partial order under the Löwner ordering, with smaller fields yielding tighter tubes. Define the admissible set

$$\mathscr{A} := \bigl\{\mathbf{P}: \mathcal{Z} \to \mathbb{S}^n_{++} \;\big|\; \text{the spectral condition holds for all } (i,j) \in \mathcal{A}\bigr\}.$$
Theorem (Existence via Isotropic Fallback)

Let $\gamma_{\mathrm{cl}} < 1$ and let $r = 1/(1 - \gamma_{\mathrm{cl}})$. Then for any $p > 0$, the constant isotropic field $\mathbf{P}(\mathbf{z}) \equiv p\,\mathbf{I}_n$ is admissible: $p\,\mathbf{I}_n \in \mathscr{A}$.

Remark (Role of the Template Inflation Factor)

The equality $\gamma_{\mathrm{cl}} + 1/r = 1$ is not coincidental: the template inflation factor $r = 1/(1-\gamma_{\mathrm{cl}})$ is the minimal inflation ensuring $r\mathcal{B}_2^n$ is RPI for $(\mathbf{A}_{\mathrm{cl}}, \mathcal{B}_2^n)$, and the isotropic fallback saturates this bound with equality. Any non-constant field must therefore satisfy $p_i^+/p_j^- \leq 1/(\gamma_{\mathrm{cl}} + 1/r) = 1$, extracting invariance margin from the local regularity of $\mathbf{P}(\cdot)$ (i.e., from $\delta_i \to 0$ on fine grids) rather than from the template headroom. In particular, the Existence Theorem guarantees $\mathscr{A} \neq \emptyset$ but the isotropic field captures no directional structure; the GP-derived field must be certified separately via the arc-wise spectral condition.

Corollary (Admissibility of GP-Derived Field)

If the GP-induced field $\mathbf{P}^{\mathrm{GP}}(\mathbf{z})=c_{n,\alpha}\hat{\boldsymbol{\Sigma}}_w(\mathbf{z})^{1/2}$ satisfies Assumption 2 and the arc-wise spectral condition holds for all $(i,j)\in\mathcal{A}$, then $\mathbf{P}^{\mathrm{GP}} \in \mathscr{A}$.

Partial Order Structure

The Löwner order generalizes "less than or equal to" from scalars to symmetric matrices. For $\mathbf{A},\mathbf{B}\in\mathbb{S}^n$:

$$\mathbf{A}\preceq\mathbf{B}\quad\Longleftrightarrow\quad\mathbf{B}-\mathbf{A}\succeq\mathbf{0}\quad\Longleftrightarrow\quad\mathbf{v}^\top\mathbf{B}\mathbf{v}\ge\mathbf{v}^\top\mathbf{A}\mathbf{v}\;\;\forall\mathbf{v}\in\mathbb{R}^n.$$

Why ordinary comparison fails. Consider $\mathbf{A}=\mathrm{diag}(5,1)$ and $\mathbf{B}=\mathrm{diag}(2,3)$. Entry-wise: $5>2$ but $1<3$. There is no consistent scalar ranking. The Löwner order resolves this by requiring the difference to be positive semidefinite in all directions simultaneously.

Partial order. Unlike scalars (which are totally ordered), matrices under $\preceq$ form only a partial order: some pairs are incomparable. But for the GP posterior covariance, data accumulation pushes the ordering consistently: $\hat{\boldsymbol{\Sigma}}_w^{(q+1)}\preceq\hat{\boldsymbol{\Sigma}}_w^{(q)}$ is guaranteed for all points.

Geometric meaning. For covariance matrices $\boldsymbol{\Sigma}_1\preceq\boldsymbol{\Sigma}_2$ (both positive definite), the ellipsoid $\boldsymbol{\Sigma}_1^{1/2}\mathcal{B}_2^n$ is contained inside $\boldsymbol{\Sigma}_2^{1/2}\mathcal{B}_2^n$ (see Proposition 2 for the proof). In our context: when the GP posterior covariance shrinks in the Löwner sense, the local tube ellipsoid shrinks with it.

Caution. For generic SPD matrices, $\mathbf{P}_1\preceq\mathbf{P}_2$ does not in general imply $\mathbf{P}_1\mathcal{B}_2^n\subseteq\mathbf{P}_2\mathcal{B}_2^n$. The containment requires the additional structure $\mathbf{P}_i=c\,\boldsymbol{\Sigma}_i^{1/2}$ with $\boldsymbol{\Sigma}_1\preceq\boldsymbol{\Sigma}_2$; see the warning box after Proposition 2.

Proposition 2 (Ordering of Admissible Fields)

Let $\boldsymbol{\Sigma}_1,\boldsymbol{\Sigma}_2:\mathcal{Z}\to\mathbb{S}^n_{++}$ with $\boldsymbol{\Sigma}_1(\mathbf{z})\preceq\boldsymbol{\Sigma}_2(\mathbf{z})$ pointwise, and let $\mathbf{P}_i := c\,\boldsymbol{\Sigma}_i^{1/2}$ for a scalar $c > 0$. If both $\mathbf{P}_1,\mathbf{P}_2$ satisfy the spectral invariance condition, then: (1) $\boldsymbol{\Omega}_1(\mathbf{z})\subseteq\boldsymbol{\Omega}_2(\mathbf{z})$ for all $\mathbf{z}$; (2) both are locally RPI but $\boldsymbol{\Omega}_1$ gives tighter tubes.

Why the hypothesis is on $\boldsymbol{\Sigma}_1 \preceq \boldsymbol{\Sigma}_2$ rather than $\mathbf{P}_1 \preceq \mathbf{P}_2$. For general SPD matrices, the Löwner ordering $\mathbf{P}_1 \preceq \mathbf{P}_2$ does not imply $\mathbf{P}_1\mathcal{B}_2^n \subseteq \mathbf{P}_2\mathcal{B}_2^n$. The matrix $\mathbf{P}_2^{-1}\mathbf{P}_1$ is generically non-symmetric, so its eigenvalues being $\le 1$ does not bound its spectral norm (largest singular value). The hypothesis $\boldsymbol{\Sigma}_1 \preceq \boldsymbol{\Sigma}_2$ with $\mathbf{P}_i = c\,\boldsymbol{\Sigma}_i^{1/2}$ provides the additional algebraic structure needed: the key matrix $\boldsymbol{\Sigma}_2^{-1/2}\boldsymbol{\Sigma}_1^{1/2}$ has singular values bounded by 1 because its $\mathbf{C}\mathbf{C}^\top$ form is SPD and $\preceq \mathbf{I}$. This is exactly the structure present in the paper, where $\mathbf{P} = c_{n,\alpha}\hat{\boldsymbol{\Sigma}}_w^{1/2}$ and the GP variance reduction gives $\hat{\boldsymbol{\Sigma}}_w^{(q+1)} \preceq \hat{\boldsymbol{\Sigma}}_w^{(q)}$ directly.

Monotone Tube Contraction

A function $f:\mathbb{R}_+\to\mathbb{R}_+$ is operator monotone if, whenever $\mathbf{A}\preceq\mathbf{B}$ (in the Löwner order), we also have $f(\mathbf{A})\preceq f(\mathbf{B})$. Here $f(\mathbf{A})$ is defined via the spectral theorem: if $\mathbf{A}=\mathbf{V}\boldsymbol{\Lambda}\mathbf{V}^\top$, then $f(\mathbf{A})=\mathbf{V}f(\boldsymbol{\Lambda})\mathbf{V}^\top$.

Key fact (Löwner-Heinz theorem): The map $t\mapsto t^r$ is operator monotone for $r\in[0,1]$. In particular, the square root ($r=1/2$) is operator monotone: $\mathbf{A}\preceq\mathbf{B}\Rightarrow\mathbf{A}^{1/2}\preceq\mathbf{B}^{1/2}$.

Caution: For $r>1$, operator monotonicity fails. For example, $\mathbf{A}\preceq\mathbf{B}$ does NOT imply $\mathbf{A}^2\preceq\mathbf{B}^2$ in general. This is why the square root step is essential and non-trivial.

Role in this paper. From $\hat{\boldsymbol{\Sigma}}_w^{(q+1)}\preceq\hat{\boldsymbol{\Sigma}}_w^{(q)}$ (posterior variance reduces with data), operator monotonicity gives $[\hat{\boldsymbol{\Sigma}}_w^{(q+1)}]^{1/2}\preceq[\hat{\boldsymbol{\Sigma}}_w^{(q)}]^{1/2}$, and hence $\mathbf{P}^{(q+1)}\preceq\mathbf{P}^{(q)}$, which yields tube contraction.

Theorem 5 (Monotone Tube Contraction under Data Accumulation)

As data accumulate across learning epochs $q=0,1,2,\ldots$:

  1. $\hat{\boldsymbol{\Sigma}}_w^{(q+1)}(\mathbf{z})\preceq\hat{\boldsymbol{\Sigma}}_w^{(q)}(\mathbf{z})$ for all $\mathbf{z}\in\mathcal{Z}$ (variance reduction).
  2. $\mathbf{P}^{(q+1)}(\mathbf{z})\preceq\mathbf{P}^{(q)}(\mathbf{z})$ for all $\mathbf{z}\in\mathcal{Z}$ (anisotropy field shrinkage).
  3. $\boldsymbol{\Omega}^{(q+1)}(\mathbf{z})\subseteq\boldsymbol{\Omega}^{(q)}(\mathbf{z})$ for all $\mathbf{z}\in\mathcal{Z}$ (tube contraction).
Remark 5 (Invariance Re-verification)

The spectral condition is not automatically preserved under tube contraction. Since $\mathbf{P}^{(q+1)}\preceq\mathbf{P}^{(q)}$ implies both $\lambda_{\max}$ and $\lambda_{\min}$ decrease, the ratio $p_i^+/p_j^-$ may increase or decrease depending on relative rates. The condition must be re-verified at each learning epoch.

Interactive Visualization: Monotone Tube Contraction
Proposition 3 (Sufficient Condition for Invariance Preservation)

If at epoch $q$ the uniform condition $\frac{p_{\max}^{(q)}}{p_{\min}^{(q)}}\cdot(\|\mathbf{A}_{\mathrm{cl}}\|_2+1/r)\le 1$ holds, then the arc-wise condition is satisfied for all $(i,j)\in\mathcal{A}$.

Remark 6 (Practical Persistence of Invariance)

Although the arc-wise condition requires re-verification, the uniform condition often persists in practice. The noise floor $\sigma_n^2$ prevents the denominator from vanishing, and as data accumulate in high-uncertainty regions, $p_{\max}^{(q)}$ typically decreases faster than $p_{\min}^{(q)}$, improving the condition number.

Two-Time-Scale Architecture

The verification condition requires the successor $(\mathbf{x}',\mathbf{u}')$ to evaluate $\mathbf{P}'$, yet $\mathbf{x}'$ depends on $\mathbf{w}\in\mathbb{W}(\mathbf{x},\mathbf{u})$, a circular dependency. We resolve this by freezing the GP-derived field $\mathbf{P}^{(q)}(\cdot)$ during each learning epoch $q$:

  1. Train the GP to obtain $\hat{\boldsymbol{\Sigma}}_w^{(q)}$ and construct $\mathbf{P}^{(q)}=c_{n,\alpha}[\hat{\boldsymbol{\Sigma}}_w^{(q)}]^{1/2}$.
  2. With $\mathbf{P}^{(q)}$ fixed, verify invariance via the graph-based spectral condition of Theorem 3.
  3. Collect new data during operation.
  4. Increment $q$ and repeat.
Unlike the lifted fixed-point method, which recomputes set geometry through iterative polytope operations, the proposed method updates only the anisotropy matrix field. Online tube evaluation reduces to one GP covariance query and one small matrix square root per operating point ($O(N_{\mathrm{gp}}^2)$ with cached Cholesky factors, where $N_{\mathrm{gp}}$ is the capped training subset size), independent of the MPC horizon, graph size, and learning epoch count.

Computational Complexity Analysis

We now provide a detailed phase-by-phase complexity breakdown. Let $n_x$, $n_u$, $n_w$ denote the state, input, and disturbance dimensions respectively, $N_{\mathrm{gp}}$ the (capped) GP training set size, $R$ the number of latent GPs in the LMC kernel, $|\mathcal{V}|$ the number of graph vertices, and $|\mathcal{A}|$ the number of graph arcs. For the quadrotor system: $n_x{=}12$, $n_u{=}4$, $n_w{=}3$, $R{=}3$, $N_{\mathrm{gp}}{\le}300$, $|\mathcal{V}|{=}30$, $|\mathcal{A}|{=}63$.

I. Offline Phase (One-Time or Per-Epoch)

The offline phase runs once at initialization (Steps 1–2) and is repeated each learning epoch (Steps 3–4).

StepOperationCostNotes
1. Template pair Solve DARE for $\mathbf{S}_\infty$; compute $\bar{\mathbf{T}},\bar{\mathbf{T}}_w$ via~\eqref{eq:template_inflation} $O(n_x^3)$ One-time; standard Schur decomposition. For $n_x{=}12$: negligible (${\lt}1\,$ms).
2. Controller synthesis LQR gain $\mathbf{K}$ from $(A,B,Q,R)$ $O(n_x^3)$ One-time; produces $\mathbf{A}_\mathrm{cl}=\mathbf{A}+\mathbf{B}\mathbf{K}$.
3. GP training LMC kernel hyperparameter optimization via marginal likelihood $O(E_{\mathrm{gp}}\,N_{\mathrm{gp}}^3\,R\,n_w)$ $E_{\mathrm{gp}}{=}500$ L-BFGS epochs. Dominated by Cholesky factorization of the $N_{\mathrm{gp}}\times N_{\mathrm{gp}}$ kernel matrix. With $N_{\mathrm{gp}}{\le}300$: typically ${\sim}30\,$s on GPU (GPyTorch).
4. Graph construction $k$-means clustering of training data into $|\mathcal{V}|$ clusters; arcs from consecutive trajectory transitions $O(N_{\mathrm{gp}}\,|\mathcal{V}|\,d)$ $d{=}n_x{+}n_u{=}16$ feature dimensions. For $N_{\mathrm{gp}}{=}300$, $|\mathcal{V}|{=}30$: ${\lt}1\,$ms.
5. Graph verification Per-arc spectral condition: eigendecomposition of $(\mathbf{P}_i^+)^{-1}\mathbf{P}_j^-$ $O(|\mathcal{A}|\,n_w^3)$ $63$ arcs $\times$ $3{\times}3$ eigendecompositions: ${\lt}1\,$ms total.
Offline total. GP training dominates the offline cost at $O(N_{\mathrm{gp}}^3)$ per epoch. With the training set capped at $N_{\mathrm{gp}}{\le}300$, this remains tractable (${\sim}30\,$s on a consumer GPU). All other offline steps are negligible by comparison. Crucially, the template pair (Step 1) and controller (Step 2) are computed once and reused across all epochs—only the GP field $\mathbf{P}^{(q)}(\cdot)$ is updated.

II. Online Phase (Per MPC Step)

StepOperationCostNotes
1. GP covariance query Query the frozen GP covariance $\hat{\boldsymbol{\Sigma}}_w(\mathbf{z}_*)$ at the current operating point $\mathbf{z}_*$ $O(N_{\mathrm{gp}}^2)$ Uses cached Cholesky factor $\mathbf{L}$ from training: $\boldsymbol{\alpha}=\mathbf{L}^{-\top}\mathbf{L}^{-1}\mathbf{y}$ precomputed, prediction is $\mathbf{k}_*^\top\boldsymbol{\alpha}$. For $N_{\mathrm{gp}}{=}300$: ${\sim}0.1\,$ms.
2. Template evaluation $\mathbf{P}(\mathbf{z}_*)=c_{n_w,\alpha}\,[\hat{\boldsymbol{\Sigma}}_w(\mathbf{z}_*)]^{1/2}$; compute tube cross-section $\bar{\mathbf{T}}_w\,\mathbf{P}(\mathbf{z}_*)$ $O(n_w^3)$ $3{\times}3$ Cholesky: negligible.
3. Total per step $O(N_{\mathrm{gp}}^2)$ Dominated by GP kernel-vector product. Independent of MPC horizon $H$, graph size $|\mathcal{V}|$, and learning epoch $q$.
Single-query evaluation. The online cost is one GP covariance query and one small matrix square root per operating point: no set iteration, no online LMI solve, and no graph traversal. The phrase "single-query" refers to one evaluation of the frozen GP covariance surrogate; the $O(N_{\mathrm{gp}}^2)$ cost reflects the kernel computation against the capped active set, fixed at $N_{\mathrm{gp}}{\le}300$ for the reported deployment.

III. Comparison with the Lifted-RPI Baseline

Our prior lifted-RPI method iterates a set-valued fixed-point operator. Each iteration requires clipping a polytope against $N_{\mathrm{gp}}$ disturbance ellipsoids—a cost that scales as $O(N_{\mathrm{gp}}\,n_{\mathrm{facets}})$ per iteration, where $n_{\mathrm{facets}}$ grows combinatorially with iteration count. With GPU acceleration and Nyström surrogates, the full $57$-iteration pipeline converges in $\sim\!4.2\,$s on the quadrotor benchmark.

MetricLifted-RPI fixed-point baselineAnsatz-RPI (proposed)
Iterations to convergence$57$$0$ (direct evaluation)
Per-iteration cost${\sim}0.07\,$s (GPU + Nyström)N/A
Total offline time${\sim}4.2\,$s (GPU)${\sim}30\,$s (GP training)
Online query cost$O(n_{\mathrm{facets}}\,N_{\mathrm{gp}})$$O(N_{\mathrm{gp}}^2)$
Online query timeSeconds–minutes${\sim}0.1\,$ms
Set representationPolytope ($n_{\mathrm{facets}}$ grows)Ellipsoid ($n_w(n_w{+}1)/2$ parameters)
Dimensional scaling. The lifted-RPI polytope facet count grows combinatorially with dimension and iteration count, making it impractical beyond low-dimensional systems. The ansatz approach parameterizes each tube cross-section with $n_w(n_w{+}1)/2{=}6$ Cholesky entries (for $n_w{=}3$), regardless of iteration count. This fixed representation size is what enables scaling to the 12-state quadrotor.

V. Simulation and Results

We validate the anisotropic template ansatz on a full nonlinear 12-state quadrotor linearized about hover; the residual $\mathbf{w}$ is state-and-input-dependent and has three terms: (i) velocity-dependent aerodynamic drag $-( \beta_1 / m )\|\mathbf{v}\|\mathbf{v}$, (ii) actuator inefficiency $-\beta_2\,\mathbf{a}_{\mathrm{cmd}}$, and (iii) stochastic wind $\sigma_w\boldsymbol{\varepsilon}$, $\boldsymbol{\varepsilon}\sim\mathcal{N}(\mathbf{0},\mathbf{I}_3)$ ($\beta_1{=}0.15$, $\beta_2{=}0.08$, $\sigma_w{=}0.05$, $T_s{=}0.02\,$s). The pipeline proceeds through GP learning, controller synthesis, invariance verification, graph discretization, monotone contraction analysis, and method comparison.

A. Interactive Quadrotor Simulator

The embedded simulator below shows a full nonlinear 12-state quadrotor with RK4 integration at 200 Hz, PID tracking controller, anisotropic disturbance model, and real-time RPI tube visualization. Use the control panel to toggle disturbance ellipsoids, RPI tubes, drag & input tube unions, and adjust physical parameters.

Figure 1. Interactive 3D quadrotor simulator with Three.js — shows anisotropic disturbance ellipsoids, RPI tube, and tube union accumulation in real time. Open full-screen →

B. System and Controller Parameters

The closed-loop system uses a 7-layer adaptive LQR sweep (50,365 unique configurations) to jointly optimize position, velocity, attitude, rate, thrust, and torque weights. The best configuration achieves $\rho(\mathbf{A}_{\mathrm{cl}}) = 0.9918$ with Lyapunov contraction rate $\gamma = 0.9995$ and template inflation factor $r = 1/(1-\gamma) = 2{,}144$. At the primary confidence level $\alpha^* = 0.95$, the invariance norm is $\max_z\|Q_v^{1/2}P(z)\|_2 = 0.167$ (margin 83.3% below 1), and the configuration also certifies at $\alpha^* = 0.99$.

Table 1. LQR Controller Gains — Best Configuration from 7-Layer Sweep
$Q_p$$Q_v$$Q_a$$Q_r$$R_T$$R_{\tau}$$\gamma$$r$
3.13×10−46.25×10−60.250.056.25×10−50.0250.999532 144
Controller Tuning Sweep — 7-Layer Summary

A 7-layer adaptive sweep evaluated 50,365 unique LQR weight configurations across $Q_p$, $Q_v$, $Q_a$, $Q_r$, $R_T$, and $R_\tau$, covering coarse factorial (L1), attitude/rate refinement (L2), fine zoom (L3), asymmetric axes (L4), Latin Hypercube sampling in 8D (L5), ultra-fine polish (L6), and a dedicated 60×60 heatmap grid (L7).

The safest configuration achieves $\max_z\|Q_v^{1/2}P(z)\|_2 = 0.167$ (83.3% margin). The boundary (most aggressive passing) config reaches $\max = 0.99998$ (0.002% margin), $\gamma = 0.9993$.

B.1.  Pareto Sweep Analysis

The following panels characterize the full sweep landscape. Each figure is independently generated from the 7-layer result set.

Pareto: confidence vs norm

(a) $\alpha^*$ vs. $\max_z\|Q_v^{1/2}P(z)\|_2$

Pareto: contraction vs norm

(b) Contraction rate $\gamma$ vs. weighted norm

Figure 2a. (a) The Pareto frontier (blue) traces the achievable trade-off between certifiable confidence $\alpha^*$ and the invariance norm. All 40,975 passing configurations (green) cluster well below the invariance limit $\|Q_v^{1/2}P\|_2 = 1$. (b) Contraction rate vs. norm, colored by $\alpha^*$. Lower $\gamma$ (faster contraction) correlates with lower norm margin, highlighting the inherent trade-off between contraction speed and safety margin.

Alpha distribution

(c) Distribution of $\alpha^*$

Multi-alpha envelope

(d) Multi-$\alpha$ confidence envelope

Figure 2b. (c) The $\alpha^*$ histogram shows that the vast majority of evaluated configurations achieve $\alpha^* \ge 0.90$, with a pronounced spike at $\alpha^* = 0.95$ (the sweep's primary target). (d) The confidence envelope for passing configs demonstrates tight separation between $\alpha^*_{0.90}$, $\alpha^*_{0.95}$, and $\alpha^*_{0.99}$ — the 0.99 curve diverges cleanly at higher norms, quantifying the cost of increased confidence.

Layer contributions

(e) Sweep layer contributions

Heatmap smooth

(f) $\alpha^*$ heatmap over $Q_p \times Q_v$

Figure 2c. (e) Each sweep layer's contribution color-coded in ($\|Q_v^{1/2}P\|_2$, $\alpha^*$) space. L6 (polish) dominates volume owing to its 33,278 fine-grid evaluations around the global optimum, while L5 (LHS) provides broad 8D coverage that discovers isolated feasible islands. (f) The dedicated 60×60 heatmap grid (L7) shows $\alpha^*$ as a function of $\log_{10}(Q_p)$ and $\log_{10}(Q_v)$ with remaining weights fixed at their global-best values. The star marks the grid-optimal configuration. A clear ridge of high $\alpha^*$ emerges at low $Q_p$ and $Q_v$, corresponding to controllers with aggressive disturbance rejection.

Top 20 configs

(g) Top-20 configurations (bar chart)

Asymmetric z

(h) Asymmetric $z$-weight effect

Figure 2d. (g) The top-20 passing configurations ranked by $\alpha^*$, with bars showing the invariance norm at both $\alpha = 0.95$ (colored) and $\alpha = 0.99$ (gray). All top configs certify well below the invariance limit. (h) Effect of asymmetric position weighting ($Q_{p,z} / Q_{p,xy}$) on $\alpha^*$: both passing (green) and failing (red) configs are shown; values near the ratio of 1 (isotropic) generally achieve the best confidence, though moderate asymmetry remains certifiable.

Key Observation: Sweep Landscape

The sweep reveals that 81% of evaluated configurations pass the invariance test at $\alpha = 0.95$. The feasible region in LQR weight space is broad. The dense Pareto frontier and smooth heatmap landscape confirm that the invariance norm varies continuously, and the problem admits well-conditioned tuning. Layers L2 (attitude refinement) and L6 (polish) contribute the most passing configurations, consistent with the observation that secondary weights ($Q_a$, $Q_r$, $R_\tau$) have less influence on invariance than the primary position/velocity weights.

C. GP Learning — LMC vs. Independent

The Linear Model of Coregionalization (LMC, $R{=}3$ latent GPs) captures cross-correlations between velocity-axis disturbance components. RMSE comparison:

Table 2. GP Prediction Accuracy (RMSE)
TaskIndependent GPLMC ($R{=}3$)Improvement
$w_{v_x}$0.04770.0480−0.6%
$w_{v_y}$0.05240.0520+0.6%
$w_{v_z}$0.05940.0588+1.1%
LMC matches or improves per-task RMSE, while also producing a joint covariance $\Sigma(\mathbf{z})$ that captures off-diagonal correlations — the key ingredient for anisotropic template shaping.
LMC vs Independent

Figure 3. LMC vs. independent GP — joint posterior captures inter-axis correlations for anisotropic template construction.

D. Per-Point Invariance Verification

The spectral invariance condition $\|Q_v^{1/2} P(\mathbf{z})\|_2 < 1$ is verified at 100 test points, with weighted norms ranging from 0.279 to 0.639 — all well below 1.

Invariance Test — Per-Point (Theorem 2)
Field vs Constraint

Figure 4. Anisotropy field $\|\mathbf{Q}_v^{1/2}\mathbf{P}(\mathbf{z})\|_2$ across operating space — all values remain safely below 1 (invariance threshold).

E. Graph-Based Discretization (Theorem 3)

The operating space is discretized into 30 KMeans nodes with 63 edges. Lipschitz constants: $L_P = 0.283$, $L_{Pw} = 0.342$. The LP tube-sizing problem yields $\alpha^*/r \in [0.605, 0.612]$ — all 30 nodes feasible.

RPI Contraction

Figure 5. RPI contraction map — per-node tube sizing shows uniform feasibility across the graph.

F. Monotone Tube Contraction (Theorem 5)

Across 5 learning epochs (80 → 400 training points), the posterior covariance contracts monotonically and the invariance margin improves:

Table 3. Monotone Tube Contraction Across Epochs
Epoch$N_{\mathrm{train}}$$p_{\max}$$\kappa_P$Pass
0801.0507.1098/100
11600.3832.76100/100
22400.2311.68100/100
44000.1941.44100/100
Löwner Ordering Verification
$\Sigma^{(k+1)}(\mathbf{z}) \preceq \Sigma^{(k)}(\mathbf{z})$ verified for all epochs, all test points — strict monotone decrease confirmed.
Monotone Contraction

Figure 6. Monotone tube contraction — (a) tube volume, (b) $p_{\max}/p_{\min}$ evolution, (c) condition number $\kappa_P$, (d) Löwner ordering min eigenvalue, (e) spectral pass rate, (f) uniform condition.

G. Comparison: Anisotropic vs. Isotropic vs. Diagonal vs. Homothetic

We compare four template parameterizations and assess how the anisotropic ansatz reduces tube volume relative to simpler methods. The dimensional compounding effect is dramatic:

Table 4. Tube Volume Ratios — Baseline / Proposed (Anisotropic)
Method3D (velocity)7D (vel+ctrl)Amplification (7D/3D)
Isotropic1.3×2.1×1.6×
Diagonal4.3×31.2×7.3×
Homothetic195×2.1×1051.1×103
Dimensional compounding: Even modest per-axis reductions compound dramatically in the joint velocity–control subspace. In the 7D subspace ($\delta\mathbf{v}\in\mathbb{R}^3$, $\delta\mathbf{u}=\mathbf{K}\mathbf{e}\in\mathbb{R}^4$), the fixed non-adaptive homothetic baseline inflates tube volume by 2.1×105 relative to the anisotropic ansatz, a thousand-fold amplification from the 3D velocity-only ratio.
Dimensional Compounding — Velocity + Control

Figure 6b. Dimensional compounding — volume ratios (baseline/proposed) as state dimension increases from 3D (velocity only) to 7D (velocity + control), showing exponential amplification for coarser parameterizations.

G.1.  Velocity-Space Disturbance & RPI Projections

vx-vy disturbance

$v_x$–$v_y$ disturbance projection

vx-vz disturbance

$v_x$–$v_z$ disturbance projection

vy-vz disturbance

$v_y$–$v_z$ disturbance projection

Figure 7. Per-operating-point disturbance ellipses (colored by flight speed) with RPI union boundary in each velocity plane.

dT-dtphi control

$\delta T$–$\delta\tau_\phi$ control projection

evx-dT cross

$e_{v_x}$–$\delta T$ cross-space projection

evy-dtphi cross

$e_{v_y}$–$\delta\tau_\phi$ cross-space projection

Figure 8. Control- and cross-state-space RPI projections — the anisotropic template captures directional coupling between velocity errors and control authority.

G.2.  Four-Method Comparison (2D Projections)

Table 5. 2D Projection Areas (m²/s²)
ProjectionOurs (Anisotropic)HomotheticDiagonalIsotropic
Dist. $v_x$–$v_y$1.1002.8491.6361.161
RPI $e_{v_x}$–$e_{v_y}$4.27877.518.3254.911
RPI $e_{v_x}$–$e_{v_z}$2.75435.585.4763.196
RPI $e_{v_y}$–$e_{v_z}$2.68448.845.7713.540
4-method dist vx-vy

Dist. $v_x$–$v_y$

4-method rpi vx-vy

RPI $e_{v_x}$–$e_{v_y}$

4-method dist vx-vz

Dist. $v_x$–$v_z$

4-method rpi vx-vz

RPI $e_{v_x}$–$e_{v_z}$

4-method dist vy-vz

Dist. $v_y$–$v_z$

4-method rpi vy-vz

RPI $e_{v_y}$–$e_{v_z}$

Figure 9. Four-method comparison — Disturbance (left) and RPI error (right) boundaries in 3 velocity-plane projections. The anisotropic ansatz stays close to the operating region, while the fixed non-adaptive homothetic baseline gives a much larger over-approximation.

G.3.  3D Union Boundaries (Marching Cubes)

3D disturbance union

3D disturbance union mesh + cross-section insets

3D RPI union

3D RPI error union mesh + constraint ellipsoid wireframe

Figure 10. 3D $(v_x, v_y, v_z)$ union boundary meshes via marching cubes — the anisotropic set hugs the true disturbance region while baselines over-approximate.

H. Tube MPC Integration [Work in Progress]

Section Under Development
This section is currently being developed. The anisotropic RPI tubes will be integrated into a tube MPC controller with obstacle avoidance. Preliminary results exist but are being re-validated with the updated 7-D velocity+control pipeline. Check back for the full Tube MPC demonstration with trajectory tracking figures.

I. Interactive Visualizations

The following interactive (Plotly) visualizations allow 3D rotation, zoom, and toggling of individual method surfaces. Click to expand.

J. Summary Comparison Panels

Combined Summary

Figure 13. Combined summary — disturbance, RPI, and control-space projections across all methods in a single multi-panel display.

Aniso vs Iso vs Independent

Anisotropic vs. isotropic vs. independent GP

Proposed vs Rakovic

Proposed ansatz vs. fixed non-adaptive homothetic baseline

Baseline Comparison

Full baseline comparison across all parameterizations

RPI Comparison

RPI set comparison — tube tightness per method

Figure 14. Summary comparisons — the anisotropic ansatz consistently produces the tightest tubes, with the gap widening exponentially in state dimension.

RPI Tube Comparison

Figure 15. RPI tube comparison — per-method tube radius evolution and tightness analysis.

All Projections

Figure 16. All velocity-plane projections — comprehensive view of disturbance and RPI sets across every 2D projection.

VI. Conclusion

This work introduced the anisotropic template ansatz for robust positive invariance under state- and input-dependent uncertainty. A full positive-definite matrix field $\mathbf{P}(\mathbf{x},\mathbf{u})$, learned from GP posteriors, captures directional uncertainty structure that scalar and diagonal parameterizations miss. The invariance conditions (Theorems 1-4) reduce to arc-wise spectral inequalities on a finite graph; the isotropic fallback guarantees a non-empty admissible set; and posterior covariance monotonicity drives tube contraction across learning epochs (Theorem 5). Online evaluation requires only one GP covariance query and one small matrix square root per operating point, independent of horizon, graph size, and iteration count.

Future directions: (i) tighter verification via sum-of-squares relaxations; (ii) contraction-based nonlinear generalizations beyond the linearization-plus-residual decomposition; (iii) joint optimization of the feedback gain $\mathbf{K}$ and anisotropy field $\mathbf{P}(\cdot)$ for minimal tube volume; (iv) adversarial validation outside GP credible sets.

Simulations on a 12-state quadrotor confirm a $195\times$ tube-volume reduction in the velocity subspace and a $2.1{\times}10^5$ reduction in the joint velocity-control subspace relative to a non-adaptive homothetic baseline, with the advantage compounding across dimensions. The reported epochs also show contraction of $p_{\max}$ from 1.050 to 0.194, while verified online evaluation remains one GP covariance query and one small matrix square root, about 0.1 ms in the reported implementation.


Interactive companion to the extended author version of the IEEE Control Systems Letters article.