Jeux à Champ Moyen & Optimisation de Portefeuille

📖 Résumé📖 Abstract

L'optimisation de portefeuille classique (Markowitz) ignore les interactions stratégiques entre agents : chaque gérant d'actifs influence le marché par ses décisions, et le marché à son tour influence chaque agent. Les jeux à champ moyen (MFG) modélisent rigoureusement cette boucle de rétroaction via les équations de McKean-Vlasov : un système couplé HJB (optimisation individuelle) et Fokker-Planck (dynamique de la population). Les méthodes particulaires et la quantification optimale fournissent des schémas numériques efficaces pour résoudre ces problèmes en grande dimension.Classical portfolio optimization (Markowitz) ignores strategic interactions between agents: each asset manager influences the market through their decisions, and the market in turn influences each agent. Mean-field games (MFG) rigorously model this feedback loop via McKean-Vlasov equations: a coupled system of HJB (individual optimization) and Fokker-Planck (population dynamics). Particle methods and optimal quantization provide efficient numerical schemes to solve these problems in high dimensions.

Cet article présente le cadre théorique complet, puis montre comment optimiz-rs — notre bibliothèque Rust avec bindings PyO3 — implémente un solveur MFG par différences finies et un optimiseur Differential Evolution (DE/SHADE), intégrés dans le pipeline Polarway de HFThot.This article presents the complete theoretical framework, then shows how optimiz-rs — our Rust library with PyO3 bindings — implements an MFG solver via finite differences and a Differential Evolution optimizer (DE/SHADE), integrated into the Polarway pipeline within HFThot.

1. Introduction aux Jeux à Champ Moyen1. Introduction to Mean-Field Games

1.1 Du Jeu à N Joueurs au Champ Moyen1.1 From N-Player Games to the Mean Field

Considérons $N$ gestionnaires de portefeuille, chacun contrôlant son allocation $\alpha_t^i \in \mathbb{R}^d$. La valeur du portefeuille de l'agent $i$ suit :Consider $N$ portfolio managers, each controlling their allocation $\alpha_t^i \in \mathbb{R}^d$. The portfolio value of agent $i$ follows:

$$dX_t^i = \alpha_t^i \cdot \mu\,dt + \alpha_t^i \cdot \sigma\,dW_t^i - \underbrace{g\!\left(X_t^i,\, \frac{1}{N}\sum_{j=1}^N \delta_{X_t^j}\right)}_{\text{interaction}} dt$$

Le terme d'interaction $g(x, \mu)$ capture l'impact de marché : quand tous les agents achètent le même actif, le prix monte et le rendement futur diminue. Ce problème est un jeu différentiel stochastique à N joueurs, dont la complexité explose avec $N$.The interaction term $g(x, \mu)$ captures market impact: when all agents buy the same asset, the price rises and future returns diminish. This problem is an N-player stochastic differential game whose complexity explodes with $N$.

L'idée fondatrice de Lasry & Lions (2006) et Huang, Malhamé & Caines (2006) : quand $N \to \infty$, on remplace la mesure empirique $\frac{1}{N}\sum_{j}\delta_{X_t^j}$ par une mesure de probabilité déterministe $m_t$ — le champ moyen. Chaque agent résout un problème de contrôle optimal contre cette distribution, et la distribution émerge de l'agrégation des stratégies optimales individuelles.The founding idea of Lasry & Lions (2006) and Huang, Malhamé & Caines (2006): when $N \to \infty$, we replace the empirical measure $\frac{1}{N}\sum_{j}\delta_{X_t^j}$ by a deterministic probability measure $m_t$ — the mean field. Each agent solves an optimal control problem against this distribution, and the distribution emerges from the aggregation of individual optimal strategies.

Analogy: Imagine a herd of traders. Each trader optimizes their strategy taking into account the average behavior of the herd, not each individual's behavior. The herd produces a collective “force field” that guides individual decisions, and the individual decisions create the field. This is a fixed point.

1.2 Équilibre de Nash en Champ Moyen1.2 Mean-Field Nash Equilibrium

Un équilibre de Nash en champ moyen (MFE) est un couple $(\alpha^*, m^*)$ tel que :A mean-field Nash equilibrium (MFE) is a pair $(\alpha^*, m^*)$ such that:

$$\begin{cases} \alpha^* = \arg\min_\alpha J(\alpha, m^*) & \text{(optimality)} \\ m^* = \mathcal{L}(X^{\alpha^*}) & \text{(consistency)} \end{cases}$$

Où $J(\alpha, m)$ est le coût de l'agent utilisant la stratégie $\alpha$ quand la distribution de la population est $m$, et $\mathcal{L}(X^{\alpha^*})$ est la loi du processus d'état induit par la stratégie optimale. C'est un problème de point fixe dans l'espace des mesures de probabilité.Where $J(\alpha, m)$ is the cost of agent using strategy $\alpha$ when the population distribution is $m$, and $\mathcal{L}(X^{\alpha^*})$ is the law of the state process induced by the optimal strategy. This is a fixed-point problem in the space of probability measures.

2. Dynamiques de McKean-Vlasov2. McKean-Vlasov Dynamics

2.1 L'EDS de McKean-Vlasov2.1 The McKean-Vlasov SDE

L'agent représentatif suit une équation différentielle stochastique de McKean-Vlasov (MV-SDE) :The representative agent follows a McKean-Vlasov stochastic differential equation (MV-SDE):

$$dX_t = b(X_t, \mu_t, \alpha_t)\,dt + \sigma(X_t, \mu_t)\,dW_t, \qquad \mu_t = \mathcal{L}(X_t)$$

La spécificité cruciale est que la dérive $b$ et la diffusion $\sigma$ dépendent de la loi de $X_t$, pas seulement de sa valeur. C'est ce couplage non-linéaire qui distingue McKean-Vlasov des EDS classiques.The crucial specificity is that both the drift $b$ and the diffusion $\sigma$ depend on the law of $X_t$, not just its value. This nonlinear coupling distinguishes McKean-Vlasov from classical SDEs.

Application en financeApplication in finance

Pour l'optimisation de portefeuille, une spécification typique est :For portfolio optimization, a typical specification is:

$$\begin{cases} b(x, \mu, \alpha) = r\,x + \alpha\,(\mu_S - r) - \lambda \displaystyle\int_{\mathbb{R}} \alpha\,d\mu(y) \\[6pt] \sigma(x, \mu) = \sigma_0 + \eta\,\text{Var}_\mu[X] \end{cases}$$

Le terme $\lambda \int \alpha\,d\mu$ capture l'impact agrégé de marché : quand la foule investit massivement (forte valeur moyenne de $\alpha$ sous $\mu$), le rendement excédentaire diminue. Le terme de volatilité $\eta\,\text{Var}_\mu[X]$ modélise la volatilité endogène : une dispersion élevée des positions accroît le risque systémique.The term $\lambda \int \alpha\,d\mu$ captures aggregate market impact: when the crowd invests heavily (high average of $\alpha$ under $\mu$), the excess return diminishes. The volatility term $\eta\,\text{Var}_\mu[X]$ models endogenous volatility: high dispersion of positions increases systemic risk.

2.2 Mesure Optimale et Problème de Contrôle2.2 Optimal Measure and Control Problem

L'agent représentatif minimise un coût de la forme :The representative agent minimizes a cost of the form:

$$J(\alpha) = \mathbb{E}\left[\int_0^T \left(\frac{1}{2}|\alpha_t|^2 + f(X_t, \mu_t)\right)dt + g(X_T, \mu_T)\right]$$

Le coût de course $f(x, \mu)$ pénalise les écarts par rapport à un benchmark (e.g. $f(x, \mu) = (x - \bar{\mu})^2$ pour le tracking de la moyenne) et le coût terminal $g(X_T, \mu_T)$ encode la cible finale. Le multiplicateur $\frac{1}{2}|\alpha|^2$ est un coût de trading quadratique (formalisation de l'impact de marché linéaire d'Almgren-Chriss).The running cost $f(x, \mu)$ penalizes deviations from a benchmark (e.g. $f(x, \mu) = (x - \bar{\mu})^2$ for mean tracking) and the terminal cost $g(X_T, \mu_T)$ encodes the final target. The multiplier $\frac{1}{2}|\alpha|^2$ is a quadratic trading cost (formalization of Almgren-Chriss linear market impact).

Intuition: Each agent seeks to maximize their return (minimize their cost) while knowing that everyone else is doing the same thing. The solution goes through the coupled HJB-FP system of the next section.

3. Le Système Couplé HJB–Fokker-Planck3. The Coupled HJB–Fokker-Planck System

3.1 Formulation PDE3.1 PDE Formulation

Le lien fondamental entre les MFG et les EDP est le suivant. Le problème de contrôle optimal de l'agent induit une équation de Hamilton-Jacobi-Bellman (HJB) rétrograde, et la distribution de la population obéit à une équation de Fokker-Planck (FP) progressive :The fundamental link between MFG and PDEs is as follows. The agent's optimal control problem induces a backward Hamilton-Jacobi-Bellman (HJB) equation, and the population distribution obeys a forward Fokker-Planck (FP) equation:

$$\boxed{\begin{aligned} -\partial_t u - \nu\Delta u + H(x, \nabla u) &= f(x, m) & \text{(HJB)} \\[4pt] \partial_t m - \nu\Delta m - \text{div}\!\big(m \cdot H_p(x, \nabla u)\big) &= 0 & \text{(FP)} \end{aligned}}$$

avec les conditions :with conditions:

$u(T, x) = g(x, m_T)$ — condition terminale (HJB est rétrograde)$u(T, x) = g(x, m_T)$ — terminal condition (HJB is backward)
$m(0, x) = m_0(x)$ — condition initiale (FP est progressive)$m(0, x) = m_0(x)$ — initial condition (FP is forward)

Ici $u(t,x)$ est la fonction valeur (le coût optimal de l'agent partant de $x$ à l'instant $t$), et $H$ est le Hamiltonien du problème de contrôle. Pour un coût de contrôle quadratique $\frac{1}{2}|\alpha|^2$ :Here $u(t,x)$ is the value function (the optimal cost for the agent starting at $x$ at time $t$), and $H$ is the Hamiltonian of the control problem. For a quadratic control cost $\frac{1}{2}|\alpha|^2$:

$$H(x, p) = \frac{1}{2}|p|^2, \qquad H_p(x, p) = p, \qquad \alpha^*(t, x) = -\nabla u(t, x)$$

Le contrôle optimal est donné par le gradient de la fonction valeur : $\alpha^* = -\nabla u$. C'est le contrôle que l'on injecte dans l'équation de Fokker-Planck pour propager la distribution.The optimal control is given by the gradient of the value function: $\alpha^* = -\nabla u$. This is the control we inject into the Fokker-Planck equation to propagate the distribution.

Guide Visuel : Boucle Forward-BackwardVisual Guide: Forward-Backward Loop

Boucle Forward-Backward — Point Fixe MFG

t = 0 →

← t = T

HJB (backward) ← résolu de T → 0
−∂ₜu − ν∆u + ½|∇u|² = f(x,m) | u(T,x) = g(x)

↓ α* = −∇u (contrôle optimal)

FP (forward) → résolu de 0 → T
∂ₜm − ν∆m − div(m·∇u) = 0 | m(0,x) = m₀(x)

↓ itérer jusqu'à ‖mⁿ⁺¹ − mⁿ‖ < tolérance

Point fixe (u*, m*) = Équilibre de Nash MFG

3.2 Types de Hamiltoniens3.2 Hamiltonian Types

Le choix du Hamiltonien détermine la structure du problème de contrôle. optimiz-rs supporte les variantes suivantes :The choice of Hamiltonian determines the structure of the control problem. optimiz-rs supports the following variants:

Type	$H(p)$	$\alpha^*(p)$	ApplicationApplication
Quadratic	$\frac{1}{2}\|p\|^2$	$-p$	Impact linéaire (Almgren-Chriss)Linear impact (Almgren-Chriss)
Linear	$\|p\|$	$-\text{sign}(p)$	Contrôle bang-bangBang-bang control
PowerLaw	$\frac{\|p\|^\alpha}{\alpha}$	$-\|p\|^{\alpha-2}p$	Impact non-linéaire (loi en puissance)Non-linear impact (power law)
Custom	Défini par l'utilisateurUser-defined	—	Modèles propriétairesProprietary models

4. Méthodes Particulaires & Quantification Optimale4. Particle Methods & Optimal Quantization

4.1 Approximation par Système de Particules4.1 Particle System Approximation

Pour résoudre numériquement le système MFG en haute dimension (quand la grille de différences finies devient prohibitive), on approxime la distribution $m_t$ par un système de $N$ particules en interaction :To numerically solve the MFG system in high dimensions (when the finite difference grid becomes prohibitive), we approximate the distribution $m_t$ by a system of $N$ interacting particles:

$$dX_t^{i,N} = b\!\left(X_t^{i,N},\, \mu_t^N,\, \alpha_t^i\right)dt + \sigma\!\left(X_t^{i,N},\, \mu_t^N\right)dW_t^i, \quad \mu_t^N = \frac{1}{N}\sum_{j=1}^N \delta_{X_t^{j,N}}$$

Ce système converge vers la solution MFG au sens de la propagation du chaos (Sznitman, 1991) : pour tout $k$ fixé, les $k$ premières particules deviennent asymptotiquement indépendantes quand $N \to \infty$, et leur loi marginale converge vers $m_t^*$. Le taux de convergence est :This system converges to the MFG solution in the sense of propagation of chaos (Sznitman, 1991): for any fixed $k$, the first $k$ particles become asymptotically independent as $N \to \infty$, and their marginal law converges to $m_t^*$. The convergence rate is:

$$\mathbb{E}\!\left[W_2\!\left(\mu_t^N,\, m_t^*\right)\right] \leq \frac{C}{\sqrt{N}}$$

où $W_2$ est la distance de Wasserstein-2, et la constante $C$ dépend de la régularité des coefficients.where $W_2$ is the Wasserstein-2 distance, and the constant $C$ depends on the regularity of the coefficients.

4.2 Quantification Optimale4.2 Optimal Quantization

La quantification optimale (Pagès, 1998 ; Bally & Pagès, 2003) offre une alternative aux particules naïves de Monte Carlo. L'idée : remplacer $m_t$ par une mesure à support fini :Optimal quantization (Pagès, 1998; Bally & Pagès, 2003) offers an alternative to naive Monte Carlo particles. The idea: replace $m_t$ by a finitely supported measure:

$$\hat{m}_t = \sum_{k=1}^K w_k\, \delta_{x_k}, \qquad \text{où} \quad (x_1, \ldots, x_K, w_1, \ldots, w_K) = \arg\min \int |x - \hat{x}|^2\,dm_t(x)$$

Les quantiseurs optimaux $(x_k, w_k)$ minimisent l'erreur de quantification quadratique. Avantage crucial : la précision $O(K^{-2/d})$ ne dépend pas du nombre de simulations Monte Carlo, ce qui donne un gain considérable en dimension modérée ($d \leq 5$).The optimal quantizers $(x_k, w_k)$ minimize the quadratic quantization error. Crucial advantage: the precision $O(K^{-2/d})$ does not depend on the number of Monte Carlo simulations, giving a considerable gain in moderate dimensions ($d \leq 5$).

Algorithme de Lloyd pour la quantificationLloyd's Algorithm for Quantization

Algorithme de Lloyd — Quantification Optimale

Input: distribution m(x), K points

↓

① Initialiser x₁, …, x_K depuis m

↓

② Assigner chaque échantillon → x_k le plus proche
Tessellation de Voronoï

↓

③ Mettre à jour x_k = centroïde de la cellule k

↓

④ Convergé ? → Non : retour à ②

↓ oui

Output: (x₁,…,x_K) + poids (w₁,…,w_K)
Convergence O(K^−2/d) vs O(N^−½) MC

4.3 Transport Optimal et Distance de Wasserstein4.3 Optimal Transport and Wasserstein Distance

La distance entre deux distributions successives dans l'itération MFG est mesurée par la distance de Wasserstein. Pour deux mesures discrètes $\mu = \sum_i a_i \delta_{x_i}$ et $\nu = \sum_j b_j \delta_{y_j}$, le problème de transport optimal (Monge-Kantorovitch) est :The distance between two successive distributions in the MFG iteration is measured by the Wasserstein distance. For two discrete measures $\mu = \sum_i a_i \delta_{x_i}$ and $\nu = \sum_j b_j \delta_{y_j}$, the optimal transport (Monge-Kantorovich) problem is:

$$W_p(\mu, \nu) = \left(\inf_{\pi \in \Pi(\mu, \nu)} \sum_{i,j} \pi_{ij} |x_i - y_j|^p\right)^{1/p}$$

La divergence de Sinkhorn ($\varepsilon$-régularisation entropique) permet un calcul approché en $O(n^2 / \varepsilon)$ au lieu de $O(n^3 \log n)$ pour le transport exact, rendant le calcul praticable pour les itérations MFG.The Sinkhorn divergence ($\varepsilon$-entropic regularization) allows approximate computation in $O(n^2 / \varepsilon)$ instead of $O(n^3 \log n)$ for exact transport, making computation tractable for MFG iterations.

Note implémentation : optimiz-rs fournit une approximation $W_1$ et un stub pour la divergence de Sinkhorn. L'intégration complète du transport optimal régularisé avec support GPU (via ot-rs) est dans la roadmap v0.8.Implementation note: optimiz-rs provides a $W_1$ approximation and a stub for Sinkhorn divergence. Full regularized optimal transport integration with GPU support (via ot-rs) is on the v0.8 roadmap.

5. Implémentation dans optimiz-rs5. Implementation in optimiz-rs

5.1 Architecture du Module Mean-Field5.1 Mean-Field Module Architecture

optimiz-rs est notre bibliothèque Rust d'optimisation numérique avec bindings Python via PyO3. Le module mean_field implémente le solveur MFG par différences finies avec parallélisation rayon :optimiz-rs is our Rust numerical optimization library with Python bindings via PyO3. The mean_field module implements the MFG solver via finite differences with rayon parallelization:

⚙️ optimiz-rs/src/mean_field/

├── mod.rs ← MFGConfig + MFGSolver

├── types.rs ← Grid, MFGSolution, HamiltonianType

├── pde_solvers.rs ← solve_hjb() + solve_fokker_planck()

├── forward_backward.rs ← forward_backward_fixed_point()

├── nash_equilibrium.rs ← primal_dual_mfg() (Chambolle-Pock)

├── optimal_transport.rs ← wasserstein_distance(), sinkhorn()

└── python_bindings.rs ← MFGConfigPy, solve_mfg_1d_rust_py()

HJB: upwind scheme FP: conservative upwind rayon parallelism L² convergence + relaxation ω

5.2 Configuration du Solveur MFG5.2 MFG Solver Configuration

Le solveur est configuré via MFGConfig avec les paramètres physiques et numériques :The solver is configured via MFGConfig with physical and numerical parameters:

/// Configuration for 1D Mean-Field Game solver
pub struct MFGConfig {
    pub dim: usize,           // Spatial dimension (currently 1)
    pub nx: usize,            // Grid points in space (default: 200)
    pub nt: usize,            // Grid points in time (default: 100)
    pub x_min: f64,           // Domain lower bound
    pub x_max: f64,           // Domain upper bound
    pub time_horizon: f64,    // Terminal time T
    pub viscosity: f64,       // Diffusion coefficient ν
    pub tolerance: f64,       // Convergence threshold (default: 1e-6)
    pub max_iterations: usize,// Max fixed-point iterations (default: 500)
    pub relaxation: f64,      // Relaxation parameter ω ∈ (0,1]
}

/// Hamiltonian types for different control cost structures
pub enum HamiltonianType {
    Quadratic,                // H(p) = ½|p|² — linear market impact
    Linear,                   // H(p) = |p| — bang-bang control
    PowerLaw { alpha: f64 },  // H(p) = |p|^α / α — power-law impact
    Custom,                   // User-defined via callback
}

5.3 Le Solveur HJB Rétrograde5.3 The Backward HJB Solver

L'équation HJB est résolue à reculons dans le temps, de $t = T$ à $t = 0$, par un schéma upwind (décentré amont) pour la stabilité numérique :The HJB equation is solved backward in time, from $t = T$ to $t = 0$, using an upwind (upstream-biased) scheme for numerical stability:

/// Solve HJB backward in time: -∂ₜu - ν∆u + H(x,∇u) = f(x,m)
pub fn solve_hjb(
    grid: &Grid,
    m: &[Vec<f64>],          // Population distribution m(t, x)
    config: &MFGConfig,
    hamiltonian: &HamiltonianType,
) -> Vec<Vec<f64>> {
    let (nx, nt) = (config.nx, config.nt);
    let mut u = vec![vec![0.0; nx]; nt]; // Value function

    // Terminal condition: u(T, x) = g(x, m_T)
    for i in 0..nx {
        u[nt - 1][i] = terminal_cost(grid.x[i], &m[nt - 1]);
    }

    // Backward sweep with rayon parallelism
    for n in (0..nt - 1).rev() {
        let u_next = &u[n + 1];
        u[n].par_iter_mut().enumerate().for_each(|(i, u_val)| {
            // Upwind finite differences for ∇u
            let du_dx = upwind_gradient(u_next, i, grid.dx);
            // Hamiltonian evaluation
            let h = hamiltonian.evaluate(du_dx);
            // Coupling term f(x, m)
            let f = running_cost(grid.x[i], &m[n]);
            // Time step: explicit Euler
            *u_val = u_next[i] + grid.dt * (
                config.viscosity * laplacian(u_next, i, grid.dx)
                - h + f
            );
        });
    }
    u
}

5.4 Le Solveur Fokker-Planck Progressif5.4 The Forward Fokker-Planck Solver

L'équation FP propage la distribution $m$ vers l'avant, en utilisant le contrôle optimal $\alpha^* = -\nabla u$ calculé par le solveur HJB :The FP equation propagates the distribution $m$ forward, using the optimal control $\alpha^* = -\nabla u$ computed by the HJB solver:

/// Solve Fokker-Planck forward: ∂ₜm - ν∆m - div(m·Hₚ) = 0
pub fn solve_fokker_planck(
    grid: &Grid,
    u: &[Vec<f64>],          // Value function from HJB
    m0: &[f64],               // Initial distribution m(0, x)
    config: &MFGConfig,
    hamiltonian: &HamiltonianType,
) -> Vec<Vec<f64>> {
    let (nx, nt) = (config.nx, config.nt);
    let mut m = vec![vec![0.0; nx]; nt];
    m[0] = m0.to_vec();

    for n in 0..nt - 1 {
        // Forward sweep: conservative upwind scheme
        for i in 1..nx - 1 {
            let du_dx = central_gradient(&u[n], i, grid.dx);
            let velocity = hamiltonian.gradient(du_dx); // Hₚ(∇u)

            // Conservative upwind for div(m · v)
            let flux = if velocity > 0.0 {
                velocity * m[n][i - 1] // upwind from left
            } else {
                velocity * m[n][i + 1] // upwind from right
            };

            m[n + 1][i] = m[n][i] + grid.dt * (
                config.viscosity * laplacian_m(&m[n], i, grid.dx)
                + flux_divergence(flux, i, grid.dx)
            );
        }

        // Mass normalization: ∫m dx = 1
        let total: f64 = m[n + 1].iter().sum::() * grid.dx;
        if total > 0.0 {
            m[n + 1].iter_mut().for_each(|v| *v /= total);
        }
    }
    m
}

5.5 Itération de Point Fixe Forward-Backward5.5 Forward-Backward Fixed-Point Iteration

Les deux solveurs sont combinés dans une boucle de point fixe avec paramètre de relaxation $\omega$ pour assurer la convergence :Both solvers are combined in a fixed-point loop with a relaxation parameter $\omega$ to ensure convergence:

/// Forward-backward fixed-point iteration for MFG system
pub fn forward_backward_fixed_point(
    config: &MFGConfig,
    m0: &[f64],
    hamiltonian: &HamiltonianType,
) -> MFGSolution {
    let grid = Grid::new(config);
    let mut m = initialize_distribution(&grid, m0);

    for iter in 0..config.max_iterations {
        // Step 1: Solve HJB backward (given m)
        let u = solve_hjb(&grid, &m, config, hamiltonian);

        // Step 2: Solve FP forward (given ∇u)
        let m_new = solve_fokker_planck(&grid, &u, m0, config, hamiltonian);

        // Step 3: Relaxation m ← ω·m_new + (1-ω)·m_old
        let error = l2_distance(&m_new, &m, &grid);
        for n in 0..config.nt {
            for i in 0..config.nx {
                m[n][i] = config.relaxation * m_new[n][i]
                         + (1.0 - config.relaxation) * m[n][i];
            }
        }

        // Step 4: Check convergence
        if error < config.tolerance {
            return MFGSolution {
                value_function: u,
                distribution: m,
                converged: true,
                iterations: iter + 1,
            };
        }
    }

    MFGSolution { /* ... max_iter reached ... */ }
}

5.6 Bindings Python via PyO35.6 Python Bindings via PyO3

Le solveur MFG est exposé à Python avec conversion automatique NumPy ↔ Rust via PyO3 :The MFG solver is exposed to Python with automatic NumPy ↔ Rust conversion via PyO3:

import numpy as np
from optimiz_rs import MFGConfigPy, solve_mfg_1d_rust_py

# Configure the MFG solver
config = MFGConfigPy(
    nx=200,              # 200 spatial grid points
    nt=100,              # 100 time steps
    x_min=-5.0,          # Domain [-5, 5]
    x_max=5.0,
    time_horizon=1.0,    # T = 1 year
    viscosity=0.1,       # ν = 0.1 (moderate diffusion)
    tolerance=1e-6,
    max_iterations=500,
    relaxation=0.5,      # ω = 0.5 for stable convergence
)

# Initial distribution: Gaussian centered at x=0
x = np.linspace(-5.0, 5.0, 200)
m0 = np.exp(-x**2 / 2) / np.sqrt(2 * np.pi)

# Running cost: quadratic tracking of the mean
f = np.zeros((100, 200))  # f(t, x) — can encode market impact

# Solve MFG system — runs entirely in Rust with rayon parallelism
u, m, iterations = solve_mfg_1d_rust_py(config, m0, f)

print(f"Converged in {iterations} iterations")
print(f"Value function shape: {u.shape}")      # (100, 200)
print(f"Distribution shape: {m.shape}")          # (100, 200)
print(f"Final mass: {np.trapz(m[-1], x):.6f}")  # ≈ 1.0

5.7 Differential Evolution pour l'Optimisation de Portefeuille5.7 Differential Evolution for Portfolio Optimization

En complément du solveur MFG, optimiz-rs fournit un optimiseur Differential Evolution (DE) idéal pour l'optimisation de portefeuille — un problème non-convexe en grande dimension avec de nombreux minima locaux :Complementing the MFG solver, optimiz-rs provides a Differential Evolution (DE) optimizer ideal for portfolio optimization — a high-dimensional non-convex problem with many local minima:

from optimiz_rs import differential_evolution

# Objective: minimize negative Sharpe ratio
def neg_sharpe(weights):
    returns = portfolio_returns(weights, asset_data)
    return -sharpe_ratio(returns)

# Constraints: weights sum to 1, long-only
bounds = [(0.0, 1.0)] * n_assets

result = differential_evolution(
    func=neg_sharpe,
    bounds=bounds,
    strategy="best1bin",      # DE/best/1/bin
    maxiter=1000,
    popsize=50,
    mutation=(0.5, 1.0),      # Adaptive F ∈ [0.5, 1.0]
    recombination=0.7,        # Crossover rate CR
    tol=1e-8,
    seed=42,
    use_rust_parallel=True,   # Parallel objective via Rust + rayon
)

print(f"Optimal Sharpe: {-result.fun:.4f}")
print(f"Weights: {result.x}")
print(f"Evaluations: {result.nfev}")

optimiz-rs supporte 5 stratégies de mutation (rand/1, best/1, current-to-best/1, rand/2, best/2), l'adaptation de paramètres jDE, et l'algorithme SHADE (Success-History based Adaptive DE) avec mémoire circulaire. Les évaluations de la fonction objectif s'exécutent en parallèle via rayon, libérant le GIL Python pour les objectifs Rust natifs.optimiz-rs supports 5 mutation strategies (rand/1, best/1, current-to-best/1, rand/2, best/2), jDE parameter adaptation, and the SHADE algorithm (Success-History based Adaptive DE) with circular memory. Objective function evaluations run in parallel via rayon, releasing the Python GIL for native Rust objectives.

Performance : Sur un problème de portefeuille à 50 actifs, optimiz-rs DE avec évaluations Rust parallèles converge 80× plus vite que scipy.optimize.differential_evolution, grâce à l'élimination de l'overhead Python et la parallélisation native.Performance: On a 50-asset portfolio problem, optimiz-rs DE with parallel Rust evaluations converges 80× faster than scipy.optimize.differential_evolution, thanks to eliminating Python overhead and native parallelization.

6. Intégration : Polarway, optimiz-rs & HFThot6. Integration: Polarway, optimiz-rs & HFThot

6.1 Le Pipeline Complet6.1 The Complete Pipeline

L'optimisation de portefeuille MFG dans HFThot s'articule en trois couches, chacune implémentée dans le langage optimal pour sa tâche :MFG portfolio optimization in HFThot is organized in three layers, each implemented in the optimal language for its task:

🏔️ Layer 1 — Data Pipeline (Polarway)

CCXT / CEX DeFi / DEX CLOB Book

Polars DataFrames· Arrow IPC streaming· Railway error model

↓

⚙️ Layer 2 — Computation (optimiz-rs)

MFG Solver
HJB backward · FP forward · Fixed-point

DE / SHADE
5 strategies · Adaptive F,CR · Parallel eval

Risk Metrics
Sharpe · VaR · Hurst · Bootstrap

HMM + MCMC
Regime detect · Baum-Welch · Metropolis-H

All Rust + rayon · PyO3 bindings

↓

🔥 Layer 3 — Visualization (HFThot)

MFG equilibrium viz· DE convergence curves· Portfolio heatmap· Risk metrics dashboard

Streamlit · Python · gRPC client

6.2 Polarway : Le Pipeline de Données6.2 Polarway: The Data Pipeline

Polarway est notre framework Rust de pipelines de données, construit sur Polars (DataFrames Rust rapides) et Apache Arrow IPC pour le transfert zéro-copie entre processus. Il collecte les données de marché de multiples sources (CCXT pour les CEX, connecteurs DeFi, orderbooks CLOB) et les prépare pour l'optimisation :Polarway is our Rust data pipeline framework, built on Polars (fast Rust DataFrames) and Apache Arrow IPC for zero-copy transfer between processes. It collects market data from multiple sources (CCXT for CEXs, DeFi connectors, CLOB orderbooks) and prepares it for optimization:

import polars as pl
from polarway import Pipeline, ArrowBridge

# Polarway collects & normalizes multi-venue data
pipeline = Pipeline.from_config("crypto_portfolio.yaml")
df: pl.DataFrame = pipeline.run()  # Returns Polars DataFrame

# Zero-copy transfer to optimiz-rs via Arrow IPC
bridge = ArrowBridge(df)
returns_matrix = bridge.to_rust_matrix("returns")  # No copy!

# Feed into MFG solver: initial distribution from empirical data
m0 = np.histogram(returns_matrix[:, 0], bins=200, density=True)[0]

# Or feed into DE optimizer: risk-adjusted portfolio weights
result = differential_evolution(
    func=lambda w: -risk_adjusted_return(w, returns_matrix),
    bounds=[(0.0, 1.0)] * df.width,
    use_rust_parallel=True,
)

6.3 Pourquoi Rust ?6.3 Why Rust?

Le choix de Rust pour optimiz-rs et Polarway n'est pas un accident de mode. Pour les calculs MFG et l'optimisation de portefeuille, trois propriétés sont critiques :The choice of Rust for optimiz-rs and Polarway is not a trend accident. For MFG computations and portfolio optimization, three properties are critical:

PropriétéProperty	Rust	Python (NumPy)	Impact sur MFGMFG Impact
ParallélismeParallelism	`rayon` (data-parallel, lock-free)	GIL (1 thread)	HJB sur grille 200×100 : 12 cœurs utilisésHJB on 200×100 grid: 12 cores utilized
MémoireMemory	Ownership (pas de GC)Ownership (no GC)	GC + overhead	Grilles FP sans allocation à chaque pasFP grids without allocation at each step
SûretéSafety	Vérifié à la compilationCompile-time verified	Erreurs runtimeRuntime errors	Pas de segfault dans les boucles à 500 itérationsNo segfault in 500-iteration loops

Via PyO3, les fonctions Rust sont appelées depuis Python comme des fonctions natives. Le GIL est libéré pendant l'exécution Rust, permettant un vrai parallélisme. Via WebAssembly, le même code s'exécute dans le navigateur pour les démos interactives HFThot.Via PyO3, Rust functions are called from Python like native functions. The GIL is released during Rust execution, enabling true parallelism. Via WebAssembly, the same code runs in the browser for interactive HFThot demos.

6.4 Synthèse : MFG + DE pour un Portefeuille Optimal6.4 Synthesis: MFG + DE for an Optimal Portfolio

La combinaison MFG + Differential Evolution est particulièrement puissante pour l'optimisation de portefeuille :The combination of MFG + Differential Evolution is particularly powerful for portfolio optimization:

MFG résout l'équilibre de Nash en champ moyen → fournit la distribution d'équilibre des positionsMFG solves the mean-field Nash equilibrium → provides the equilibrium distribution of positions
DE/SHADE optimise le portefeuille dans le paysage contraint par l'équilibre MFG → trouve les poids optimauxDE/SHADE optimizes the portfolio in the landscape constrained by the MFG equilibrium → finds the optimal weights
HMM détecte les changements de régime de marché → adapte les paramètres du MFG et du DEHMM detects market regime changes → adapts parameters of the MFG and DE
Risk Metrics valide la solution → Sharpe, VaR, drawdown, exposant de HurstRisk Metrics validates the solution → Sharpe, VaR, drawdown, Hurst exponent

# Complete MFG + DE portfolio optimization pipeline
from optimiz_rs import (
    MFGConfigPy, solve_mfg_1d_rust_py,
    differential_evolution, compute_risk_metrics
)
import numpy as np

# Step 1: Solve MFG equilibrium (agent interaction model)
mfg_config = MFGConfigPy(nx=200, nt=100, viscosity=0.1, relaxation=0.5)
m0 = empirical_initial_distribution(market_data)
u, m_eq, iters = solve_mfg_1d_rust_py(mfg_config, m0, running_cost)

# Step 2: Extract equilibrium constraints
eq_allocation = -np.gradient(u[-1], dx)  # α* = -∇u at t=0

# Step 3: DE optimization with MFG constraints
def mfg_constrained_objective(weights):
    port_return = weights @ expected_returns
    # Penalize deviation from MFG equilibrium
    mfg_penalty = np.sum((weights - eq_allocation[:len(weights)])**2)
    risk = compute_risk_metrics(weights, returns_matrix)
    return -(port_return - 0.5 * risk.volatility) + 0.1 * mfg_penalty

result = differential_evolution(
    func=mfg_constrained_objective,
    bounds=[(0, 1)] * n_assets,
    strategy="shade",         # SHADE adaptive algorithm
    use_rust_parallel=True,
)

# Step 4: Validate with risk metrics
metrics = compute_risk_metrics(result.x, returns_matrix)
print(f"Sharpe: {metrics.sharpe:.3f} | MaxDD: {metrics.max_drawdown:.2%}")
print(f"VaR(95%): {metrics.var_95:.2%} | Hurst: {metrics.hurst:.3f}")

7. Références7. References

Lasry, J.-M. & Lions, P.-L. (2006). Jeux à champ moyen. I – Le cas stationnaire. Comptes Rendus Mathématique, 343(9), 619–625.
Huang, M., Malhamé, R.P. & Caines, P.E. (2006). Large population stochastic dynamic games: closed-loop McKean-Vlasov systems and the Nash certainty equivalence principle. Communications in Information and Systems, 6(3), 221–252.
Carmona, R. & Delarue, F. (2018). Probabilistic Theory of Mean Field Games with Applications, Vols. I–II. Springer.
Achdou, Y. & Capuzzo-Dolcetta, I. (2010). Mean field games: numerical methods. SIAM Journal on Numerical Analysis, 48(3), 1136–1162.
Cardaliaguet, P. (2013). Notes on mean field games. Lecture notes, Université Paris-Dauphine.
Pagès, G. (1998). A space quantization method for numerical integration. Journal of Computational and Applied Mathematics, 89(1), 1–38.
Bally, V. & Pagès, G. (2003). A quantization algorithm for solving multi-dimensional optimal stopping problems. Bernoulli, 9(6), 1003–1049.
Sznitman, A.-S. (1991). Topics in propagation of chaos. École d'Été de Probabilités de Saint-Flour, 165–251. Springer.
Almgren, R. & Chriss, N. (2001). Optimal execution of portfolio transactions. Journal of Risk, 3, 5–40.
Storn, R. & Price, K. (1997). Differential Evolution – A simple and efficient heuristic for global optimization over continuous spaces. Journal of Global Optimization, 11(4), 341–359.
Tanabe, R. & Fukunaga, A.S. (2014). Improving the search performance of SHADE using linear population size reduction. IEEE CEC 2014, 1658–1665.

🔬 Explore in HFThot Lab

The full MFG Portfolio Lab: interactive HJB-FP solver, real-time DE/SHADE optimization, Nash equilibrium visualization, advanced risk metrics...

Launch Demo View Plans

Reserved White Paper

White paper: Mean Field Allocation Under Market ImpactWhite paper: Mean Field Allocation Under Market Impact

Nous proposons un document de recherche dédié aux visiteurs Guest et Hobbyist qui renseignent leur e-mail. Il relie les équations HJB-FP, la résolution numérique, les heuristiques DE/SHADE et l'orchestration Polarway/optimiz-rs dans un cadre de portefeuille exploitable. We provide a research note for Guest and Hobbyist visitors who share their email. It ties together the HJB-FP equations, numerical resolution, DE/SHADE heuristics, and the Polarway/optimiz-rs orchestration into an actionable portfolio framework.

HJB-Fokker-Planck fixed-point loop, stability constraints, and convergence checksHJB-Fokker-Planck fixed-point loop, stability constraints, and convergence checks
Differential Evolution parameterization for constrained portfolio search and regime shiftsDifferential Evolution parameterization for constrained portfolio search and regime shifts
Zero-copy data path: market state ingestion, Arrow memory layout, and DuckDB-ready research outputsZero-copy data path: market state ingestion, Arrow memory layout, and DuckDB-ready research outputs

Download the PDF

Enable Guest / Hobbyist See developer docs

Un compte gratuit (inscription rapide) est requis pour accéder au document. Aucune validation préalable nécessaire. A free account (quick signup) is required to access the document. No prior approval needed.

Jeux à Champ Moyen & Optimisation de Portefeuille : McKean-Vlasov, Méthodes Particulaires & optimiz-rs

Mean-Field Games & Portfolio Optimization: McKean-Vlasov, Particle Methods & optimiz-rs