SIMBA

SIMBA - 2025

2025Activity reportProject-TeamSIMBA

RNSR: 202424506M

Research‌ center Inria Centre at Université de Lorraine
In‌ partnership with:Université de Lorraine, CNRS
Team name:‌ Statistical Inference and Modeling for Biological Applications
In‌ collaboration with:Institut Elie Cartan de Lorraine (IECL)‌

Creation of the Project-Team: 2024 January 01

Each‌ year, Inria research teams publish an Activity Report‌ presenting their work and results over the reporting‌ period. These reports follow a common structure, with‌ some optional sections depending on the specific team.‌ They typically begin by outlining the overall objectives‌ and research programme, including the main research themes,‌ goals, and methodological approaches. They also describe the‌ application domains targeted by the team, highlighting the‌ scientific or societal contexts in which their work‌ is situated.

The reports then present the highlights‌ of the year, covering major scientific achievements, software‌ developments, or teaching contributions. When relevant, they include‌ sections on software, platforms, and open data, detailing‌ the tools developed and how they are shared.‌ A substantial part is dedicated to new results,‌ where scientific contributions are described in detail, often‌ with subsections specifying participants and associated keywords.

Finally,‌ the Activity Report addresses funding, contracts, partnerships, and‌ collaborations at various levels, from industrial agreements to‌ international cooperations. It also covers dissemination and teaching‌ activities, such as participation in scientific events, outreach,‌ and supervision. The document concludes with a presentation‌ of scientific production, including major publications and those‌ produced during the year.

Keywords

Computer Science and‌ Digital Science

A3.1. Data
A3.2. Knowledge
A3.2.3. Inference‌
A3.3. Data and knowledge analysis
A3.3.1. On-line analytical‌ processing
A3.3.2. Data mining
A3.3.3. Big data analysis‌
A6.1. Methods in mathematical modeling
A6.1.1. Continuous Modeling‌ (PDE, ODE)
A6.1.2. Stochastic Modeling
A6.1.3. Discrete Modeling‌ (multi-agent, people centered)
A6.1.4. Multiscale modeling
A6.2. Scientific‌ computing, Numerical Analysis & Optimization
A6.2.2. Numerical probability‌
A6.2.3. Probabilistic methods
A6.2.4. Statistical methods
A9.2.1. Supervised‌ learning
A9.2.2. Unsupervised learning
A9.2.5. Bayesian methods
A9.2.7.‌ Kernel methods

1 Team‌ members, visitors, external collaborators

Research Scientists

Nicolas Champagnat‌ [Team leader, INRIA, Senior Researcher‌, HDR]
Coralie Fritsch [INRIA,‌ Researcher, HDR]
Ulysse Herbach [INRIA‌, Researcher]
Edouard Strickler [CNRS,‌ Researcher]

Faculty Members

Koléhè Coulibaly Pasquier [‌UL, Associate Professor Delegation, from Sep 2025]
Sandie Ferrigno‌ [UL, Associate‌ Professor]
Anne Gégout‌‌ Petit [UL, Professor, HDR]‌
Jean-Marie Monnez [UL‌, Emeritus, HDR‌‌]
Pierre Vallois [UL, Professor,‌ HDR]
Denis Villemonais‌ [Univ. Strasbourg,‌‌ Professor, HDR]
Sophie Wantz-Mézières [UL‌, Associate Professor]‌

Post-Doctoral Fellow

Elie Cerf‌‌ [INRIA, Post-Doctoral Fellow]

PhD Students‌

Sophie Baland [UL‌]
Virgile Brodu [‌‌UL, ATER, from Sep 2025]‌
Virgile Brodu [UL‌, until Aug 2025‌‌]
Mathilde Gaillard [INRIA]
Vincent Kagan‌ [UL]
Juan‌ Mardomingo Sanz [UL‌‌, from Oct 2025]
Anouk Rago [‌UL, ATER,‌ until Oct 2025]‌‌
Vidhi Vidhi [INRIA]

Interns and Apprentices‌

Lorenzo Boussion [ENS‌ Paris-Saclay, Intern,‌‌ from Apr 2025 until Jul 2025]
Roxane‌ Cellier [INRIA,‌ Intern, from Apr‌‌ 2025 until Sep 2025]
Juan Mardomingo Sanz‌ [UL, Intern‌, from Apr 2025‌‌ until Sep 2025]
Thibaut Pannet [INRIA‌, Intern, from‌ May 2025 until Aug‌‌ 2025]
Mattheo Rapenne [UL, Intern‌, from Apr 2025‌ until Aug 2025]‌‌

Administrative Assistants

Emmanuelle Deschamps [INRIA, until‌ Mar 2025]
Ouiza‌ Herbi [INRIA,‌‌ from Mar 2025]

2 Overall objectives

SIMBA‌ is a joint team‌ of Inria, CNRS and‌‌ University of Lorraine, within the Institut Élie Cartan‌ de Lorraine (IECL), UMR‌ 7502 CNRS-UL, laboratory in‌‌ mathematics, of which Inria is a strong partner.‌ The team is composed‌ of applied mathematicians whose‌‌ research interests mainly concern probability and statistics with‌ applications in biology and‌ medicine.

Many fundamental questions‌‌ and applications in medicine and biology critically rely‌ on our capacity to‌ construct, estimate, analyse and‌‌ simulate complex mathematical models. They can aim at‌ building predictions and decision‌ making processes upon heterogeneous,‌‌ noisy, incomplete or inconsistent data, at improving the‌ understanding of complex phenomena‌ involving several interacting subsystems‌‌ that can only be calibrated separately, or at‌ supplying a priori information‌ on phenomena that cannot‌‌ be reproduced in laboratory experiments or for which‌ data are too costly‌ to collect. In the‌‌ past years, these models have been gradually refined,‌ taking into account more‌ interactions or dependencies between‌‌ different components, more heterogeneity between subsystems and a‌ wider range of time‌ or space scales. Along‌‌ with this gradual complexification, the importance of stochasticity‌ has been recognized as‌ fundamental in many biological‌‌ or medical studies, either to take into account‌ intrinsic randomness in biological‌ processes, to evaluate confidence‌‌ in a model’s parameters or predictions, or to‌ take into account randomness‌ or uncertainties in environmental‌‌ conditions. In parallel, the specificities of bio-medical data,‌ which are typically high‌ dimensional, heterogeneous, correlated and‌‌ with few observations, and their gradual increase pose‌ new statistical challenges in‌ terms of classification, prediction,‌‌ variable selection or streaming‌ data analysis.

Our expertise gathers a large spectrum‌ of mathematical domains, ranging from statistics to stochastic‌ modeling and analysis. We share a common experience‌ and dedication to interactions with other sciences (biology,‌ medicine) and interested parties (physicians, clinical researchers in‌ the medical domain, start-ups). Our application domains are‌ medicine, epidemiology, systems biology, ecology and evolution. The‌ specificity of our group is our capacity to‌ use a broad range of tools to answer‌ practical statistical, modeling and analytical questions posed by‌ collaborators from biology or medicine.

We believe that‌ an interdisciplinary approach is crucial to answer questions‌ posed by biologists or physicians, but can also‌ bring original mathematical questions requiring the development of‌ new theoretical tools which may apply to a‌ broader range of application domains. Our ambition is‌ therefore both to follow a bottom-up research program,‌ where we tackle practical modeling or statistical questions‌ posed by practitioners, and a top-down approach where‌ we develop new mathematical tools to study general‌ models and questions from biology and medicine. Part‌ of our work is purely mathematical, but always‌ motivated by biological applications.

Statistical and stochastic modeling‌ are central to our project. The range of‌ mathematical models we study is large, but they‌ all share common features: we are mainly interested‌ in dynamical models of biological populations with interactions‌ (in a general sense). In our models, populations‌ can be composed of individuals (in ecology, evolution‌ or epidemiology), cells (e.g. bacteria in ecology or‌ tumor cells in medicine), genes or proteins (in‌ systems biology), or populations in the statistical sense‌ (e.g. patients in epidemiology). Interactions or coupling can‌ occur within a population or between species (e.g.‌ in evolution), between cells (e.g. in oncology), between‌ genes or proteins (intracellular networks), with an environment‌ (e.g. in tumor growth, where environmental conditions are‌ linked to medical treatment, or in ecology) or‌ between high-dimensional statistical variables (e.g. clustering and variable‌ selection in epidemiology). In biology and medicine, models‌ were primarily developed to understand phenomena at fixed‌ time and space scales. Today, more and more‌ models aim at studying phenomena at large scales‌ resulting from delicate coupling and interactions between scales.‌ Descriptions at small scales are typically high dimensional‌ and often involve stochastic features, or a combination‌ of stochastic and deterministic features (hybrid models). Our‌ research project aims at studying such complex systems‌ using analytical tools to construct: 1. coarse-grained representations‌ of small scale features (through averaging, ergodic limits,‌ homogenization, mean field models...); 2. specific numerical methods,‌ possibly based on coarse-grained representations, to efficiently bridge‌ the gap between different time and space scales;‌ 3. appropriate statistical inference or learning tools often‌ based on limited or partial observation, in order‌ to make predictions.

3 Research program

The research‌ challenges we present in this section are mainly‌ theoretical or methodological. All of them are motivated‌ by biological or medical applications and provide a‌ wide methodological toolbox that can be combined to answer biological or medical‌ questions.

3.1 Stochastic modeling‌ for health

The modeling‌‌ issues we address in medicine aim at understanding‌ fundamental mechanisms of cancer‌ development, understanding how cells‌‌ make decisions through gene expression and bringing new‌ insights on the evolution‌ of telomere length distribution‌‌ with age and across a population. Telomeres are‌ non-coding regions of repetitive‌ nucleotide sequences located at‌‌ each end of chromosomes. In human, they shorten‌ at each cell division‌ and it is known‌‌ that short telomere lengths are statistically linked to‌ age related diseases. These‌ transversal applications are described‌‌ in Section 4.

Population dynamics of tumor‌ cells have been modeled‌ in numerous works, either‌‌ by deterministic models (ordinary differential equations or‌ ODEs and partial differential‌ equations or PDEs), or‌‌ stochastic ones, either discrete (birth and death‌ or branching processes,‌ individual-based models in infinite‌‌ dimension) or continuous (stochastic differential equations or‌ SDEs). Branching processes or‌ individual-based models are also‌‌ fundamental tools to study telomere length dynamics. Concerning‌ gene networks, in addition‌ to classical models (e.g.‌‌ Gaussian graphical models or deterministic systems with external‌ noise), we develop new‌ models (see Section 3.2.3‌‌) based on PDMPs (piecewise-deterministic Markov processes‌). Our team gathers‌ experts of all these‌‌ classes of models, both from the analytical, statistical‌ and numerical simulation points‌ of view. In most‌‌ applications we have in mind, we need to‌ combine within-cell dynamics (such‌ as telomere shortening or‌‌ gene expression) with cell population dynamics, leading to‌ multiscale and/or multicomponent hybrid‌ models. Multicomponent models are‌‌ also ubiquitous in medicine when one takes into‌ account latent variables such‌ as the genealogical tree‌‌ of mutations within a tumor when only observations‌ of clonal population sizes,‌ that is sizes of‌‌ populations of cells sharing the same genetic material,‌ are available.

3.1.1 Links‌ between macroscopic models and‌‌ stochastic microscopic processes

For multiscale models, it is‌ often relevant to distinguish‌ what could be called‌‌ the microscopic level (i.e. the level of single‌ individuals), the macroscopic level‌ (i.e. the level of‌‌ large population densities) or the intermediate mesoscopic level‌ (i.e. a level of‌ population densities, but where‌‌ demographic stochasticity cannot be neglected). Microscopic models can‌ be for example stochastic,‌ individual-based models, mesoscopic models‌‌ can be SDEs and macroscopic models are usually‌ models of population densities,‌ such as ODEs, PDEs‌‌ or PDMPs. Many biological questions are stated at‌ the macroscopic level, and‌ the main modeling issue‌‌ lies in the appropriate way to incorporate meso-‌ or microscopic features in‌ the description of the‌‌ macroscopic scale.

Typically, individual-based models are models where‌ the population state at‌ time $t$ is described‌‌ as a counting measure $ν_{t} = {\sum‌}_{i = 1}^{{N‌}_{t}} δ_{x_{i‌‌} (t)}$ where $N_{t}$ is the‌ number of individuals alive‌ at time $t$ ,‌‌ and $x_{i} ( t)$ is the‌ characteristic of the $i‌$ -th individual (e.g. phenotype,‌‌ mass, size, length of‌ telomeres, age...), belonging to some set $𝒳$ .‌ The dynamics is strongly dependent on the precise‌ phenomenon to be modeled, but in the simplest‌ cases, ${(ν_{t})}_{t \geq 0‌}$ is a Markov jump process on point measures‌ on $𝒳$ whose infinitesimal generator has the following‌ form:

L ϕ (﻿​﻿﻿ ν) = \underset{𝒳^{2}}{\int​‌﻿﻿ \int} (​​﻿﻿ ϕ (ν +​​​‌ δ_{y}) -﻿​﻿﻿ ϕ (ν)​‌﻿﻿) b (x​​﻿﻿, ν; d​​​‌ y) ν (﻿​﻿﻿ d x) +​‌﻿﻿ \int_{𝒳} (ϕ​​﻿﻿ (ν - {δ​​​‌}_{x}) - ϕ﻿​﻿﻿ (ν))​‌﻿﻿ d (x,​​﻿﻿ ν) ν (​​​‌ d x),﻿​﻿﻿

where $b (‌ x, ν; d y)$ is‌ the infinitesimal birth rate of an individual $y‌$ from an individual $x$ in the population $ν‌$ and $d (x, ν)$ is‌ the death rate of an individual $x$ in‌ the population $ν$ .

We have developed a‌ strong expertise in the derivation of simplified macroscopic‌ models from complex microscopic effects when there is‌ a strong separation between time and/or space scales.‌ Mathematically, this requires us to encode the scale‌ separation through scaling parameters and to solve an‌ asymptotic analysis problem, which can be averaging (slow-fast‌ dynamics, singular perturbation) 57, 2, homogeneization‌ 48, 65, concentration limits 52,‌ 55, 56 or more generally parameter scaling‌ problems where the scaling parameter has a biological‌ meaning 54, 49. For individual-based models,‌ the simplest parameter scaling that can be considered‌ is a large population asymptotic encoded with a‌ parameter $K$ , where the state of the‌ population is modified as $ν_{t}^{K} =‌ \frac{1}{K} \sum_{i = 1}^{N_{t‌}} δ_{x_{i} ( t)}$ and the‌ generator (1) as

L^{K} ϕ​‌﻿﻿ (ν^{K})​​﻿﻿ = K \underset{𝒳^{2}}{\int \int​​​‌} [ϕ (ν﻿​﻿﻿ + \frac{1}{K} {δ​‌﻿﻿}_{y}) - ϕ (​​﻿﻿ ν)] b (​​​‌ x, ν^{K﻿​﻿﻿}; d y)​‌﻿﻿ ν^{K} (d​​﻿﻿ x) + K​​​‌ \int_{𝒳} [ϕ (ν﻿​﻿﻿ - \frac{1}{K} {δ​‌﻿﻿}_{x}) - ϕ (​​﻿﻿ ν)] d (​​​‌ x, ν^{K﻿​﻿﻿}) ν^{K} (​‌﻿﻿ d x),​​﻿﻿

leading to mean-field‌ macroscopic models when $K \to \infty$ 54.‌ More complex scalings can involve rare or small‌ mutations combined with long-time scalings, complex local interactions‌ as in 48 or multiscale phenomena (e.g. within‌ cell dynamics combined with the dynamics of populations‌ of cells, like for telomeres). Each problem usually‌ requires new methodological development.

3.1.2 Numerical analysis

Sometimes,‌ it is not possible to construct a simplified‌ macroscopic description of complex, multiscale biological phenomena. In‌ such cases, we need to rely on numerical‌ simulations. More generally, numerical simulations are very helpful‌ to supply a priori information on phenomena that are difficult to reproduce‌ in laboratory experiments or‌ on large-scale models that‌‌ involve several interacting elements.

Despite increasing computing facilities,‌ existing numerical schemes are‌ rarely adapted to individual-based‌‌ models. Better numerical approaches based on a deeper‌ understanding of their multiscale‌ features would significantly reduce‌‌ computational costs and yield reliable error estimates. This‌ is one of our‌ motivations in studying model‌‌ reduction through asymptotic analysis as described above. We‌ propose to design numerical‌ schemes taking into account‌‌ local extinction or local deterministic approximation, or develop‌ hybrid methods relying on‌ duality methods based e.g.‌‌ on the Poisson representation of birth and death‌ processes, which allows to‌ switch between microscopic and‌‌ mesoscopic models when the population size crosses a‌ threshold.

3.2 Analysis of‌ biological and medical data‌‌

Modern medicine is turning to highly personalized approaches,‌ and a major challenge‌ is to design and‌‌ develop a new generation of techniques to assist‌ prevention, diagnosis, prognosis and‌ therapy. A major difficulty‌‌ is the integration and exploitation of data which‌ are often high-dimensional, heterogeneous,‌ incomplete or inconsistent, to‌‌ build predictions and decision making processes. The main‌ part of our research‌ in statistical learning aims‌‌ to develop operational tools for the analysis of‌ data from our collaborations‌ with biologists and physicians.‌‌ Another part of our project builds upon the‌ sophisticated models described in‌ Section 3.1, for‌‌ which specific inference tools need to be designed.‌

3.2.1 Statistical learning, regression‌

We want to develop‌‌ methodologies that take into account the specificities of‌ biological data: they are‌ often high dimensional and‌‌ correlated (for instance multiomics data, such as genome,‌ proteome or transcriptome) but‌ with few observations (patients)‌‌ and with missing data. Variable selection is of‌ particular interest to study‌ the link between an‌‌ outcome (the occurence of an illness for instance)‌ with covariates or to‌ infer partial correlations between‌‌ several variables for instance to study quantitative microbiome‌ data. For instance, in‌ variable selection, we propose‌‌ to study the theoretical guarantees of our methods‌ 42, 72 and‌ to extend them to‌‌ other models of dependencies such as mixtures of‌ quantitative or qualitative covariates‌ with dependencies between the‌‌ covariates. In regression, we wish to tackle the‌ challenge not only to‌ select the covariates related‌‌ to an event (illness, death...), but also to‌ understand which configuration of‌ these covariates triggers its‌‌ occurrence.

We also develop goodness-of-fit tests to assess‌ the different assumptions of‌ a (possibly heteroscedastic) regression‌‌ model. Most of them are “directional” in that‌ they detect departures from‌ a given assumption of‌‌ the model. Other tests are “omnibus” in that‌ they assess whether a‌ model fits a dataset‌‌ on all its assumptions. We focus on the‌ task of choosing the‌ structural part of the‌‌ regression function because it contains easily interpretable information‌ about the studied relationship.‌ Among the large number‌‌ of existing tests, we consider nonparametric tests which‌ are all based on‌ generalizations of the Cramér-von‌‌ Mises statistic. To perform‌ these goodness-of-fit tests, we develop the R package‌ cvmgof41, 1, an easy-to-use tool‌ which allows practitioners to compare the implemented tests.‌ In the future, we plan to enrich the‌ cvmgof package with tests concerning the other assumptions‌ of a regression model such as the functional‌ form of the variance or the additivity of‌ the random error term, using directional tests such‌ as 67. Another perspective is to develop‌ a similar tool for other statistical models widely‌ used in biostatistics such as generalized linear models.‌

3.2.2 Signal or online data analysis

We develop‌ tools for the analysis of online data, which‌ are now very frequent in the health domain‌ (recording of cardiac signals, physiological measurements via connected‌ objects...). The purpose is either the update of‌ estimation parameters for models with sequential arrival of‌ new data, or the online detection of change-points‌ in temporal signals. For the first purpose, stochastic‌ algorithms are essential tools 50, which allow‌ one to approximate eigenvectors in a stepwise manner‌ 83, 82, 86. We focus‌ on several incremental procedures for regression and data‌ analysis like PCA (Principal Component Analysis) 84 and‌ linear and logistic regressions on standardised data in‌ order to avoid numerical explosion 78. Our‌ aim is to apply these results to other‌ methods related to PCA, such as multiple factorial‌ analyses, partial PCA, and to learning (classification, regression,‌ event scores).

Change-point detection is of particular interest‌ in e-health, for example for detecting changes in‌ the health status of the elderly or people‌ at risk of a disease. We plan to‌ develop tools for online change-point detection using score-based‌ CUSUM statistics. Our aim is to use the‌ beginning of the signal to build simulation-based thresholds‌ that have to be crossed for the detection‌ of the change-point. We also want to propose‌ new stopping rules adapted to the characteristics of‌ the signal. These questions may be relevant for‌ a large part of the applications we shall‌ develop with physicians.

3.2.3 Network inference

Inferring networks‌ from data is often a crucial step for‌ understanding biological interactions, such as regulatory links between‌ genes within individual cells 73, 47 or‌ communication relationships between bacteria within a population.

Concerning‌ gene regulatory networks, we are interested in so-called‌ single-cell data, where mRNA levels are measured individually‌ in many cells rather than being population-averaged, revealing‌ the intrinsically stochastic “transcriptional bursting” phenomenon. In previous‌ work 73, we described a promising strategy‌ in which the network inference problem is seen‌ as a calibration procedure for a PDMP model‌ that is able to fit real single-cell data‌ 10. In the simplest version of this‌ model, the state of each gene $i \in‌ {1, ..., n}$ at‌ time $t$ is described by its promoter state‌ $E_{i} (t) \in {0‌, 1}$ , the quantity of transcribed RNA $Y_{i} (‌ t) \in {ℝ‌}_{+}$ and of translated‌‌ proteins $X_{i} ( t) \in {ℝ‌}_{+}$ , where

\begin{matrix} {E﻿﻿﻿‌}_{i} (t)﻿‌​‌ jumps from 0 to﻿​​﻿ 1 at rate {k​​​‌}_{on, i} (﻿﻿﻿‌ X (t)﻿‌​‌) and from 1﻿​​﻿ to 0 at rate​​​‌ k_{off, i﻿﻿﻿‌}, \\ {\dot{Y}}_{i﻿‌​‌} (t) =﻿​​﻿ s_{0, i​​​‌} E_{i} (t﻿﻿﻿‌) - d_{0﻿‌​‌, i} Y_{i﻿​​﻿} (t),​​​‌ {\dot{X}}_{i} (﻿﻿﻿‌ t) = {s﻿‌​‌}_{1, i} {Y﻿​​﻿}_{i} (t)​​​‌ - d_{1,﻿﻿﻿‌ i} X_{i} (﻿‌​‌ t) . \end{matrix}

3.2.3

Interactions between genes in‌ the network are encoded‌ in functions $k_{on‌‌, i} (x) ≪ k_{off‌, i}$ which correspond‌ to burst frequencies. We‌‌ plan to develop dynamical models that fully exploit‌ the particular time structure‌ of the data: as‌‌ cells have to be killed for measurements, such‌ data do not consist‌ of cell trajectories but‌‌ rather independent samples of time-varying multivariate distributions. As‌ the number of genes‌ can be large (‌‌ $\approx 2 \times {10}^{5}$ in humans), we‌ will also investigate variational‌ methods, which generalize the‌‌ usual expectation-maximization (EM) algorithm, as a relevant way‌ to make the inference‌ procedure both robust and‌‌ scalable.

We also develop methods to infer gene‌ networks from dynamic gene‌ expression data within a‌‌ collaboration with CHRU Strasbourg. The goal here is‌ to design new models‌ and inference methods adapted‌‌ to the prediction of the outcome of biological‌ intervention experiments such as‌ “gene knock-down”, in order‌‌ to identify therapeutic targets that could be experimentally‌ studied.

3.2.4 Inference for‌ stochastic processes

In our‌‌ collaborations with practitioners, we are led to infer‌ specific quantities of interest‌ for stochastic dynamical models‌‌ of various classes, such as the speed of‌ telomere shortening using multitype‌ branching processes; the growth‌‌ rate of clonal populations and their phylogenetic tree‌ using heterogeneous population growth‌ models in cancer; links‌‌ between gene expressions using PDMPs or the speed‌ of propagation of fungi‌ using stochastic models with‌‌ latent variables given by the solution of a‌ partial differential equation (see‌ our main applications in‌‌ Section 4). For all these sophisticated stochastic‌ processes, statistical inference raises‌ many open, difficult questions.‌‌ In particular, we plan to develop inference tools‌ for general classes of‌ models such as PDMP,‌‌ bifurcating Markov processes or branching processes. More generally,‌ the inference of dynamical‌ models relies strongly on‌‌ their long-time behavior, for which the team has‌ a strong expertise (see‌ Section 3.3). Most‌‌ often in biology, models rely on latent variables,‌ that may follow complex‌ dynamics as in the‌‌ examples above. The inference of this kind of‌ models requires us to‌ develop efficient Bayesian algorithms,‌‌ EM like algorithms and variational methods.

3.3 Stochastic‌ modeling for ecology and‌ evolution

In ecology, we‌‌ are specifically interested in‌ theoretical challenges in conservation biology (the branch of‌ biology dealing with extinction and survival of species)‌ and in the response of ecosystems to environmental‌ perturbations such as climate change, anthropization or niche‌ construction. In evolution, we have a strong expertise‌ in the study of the long term evolution‌ of biological populations using approximate models based on‌ various biological assumptions. Although this application domain is‌ different from the one of Section 3.1,‌ the questions we address are close from the‌ mathematical point of view.

3.3.1 Links between macroscopic‌ and microscopic eco-evolutionary models

We develop here similar‌ ideas as in Section 3.1.1, but the‌ mathematical questions are of different nature for models‌ of tumor growth in medicine than for population‌ models in ecology or evolution, because in the‌ first case one wants to capture transitory behavior‌ (e.g., in growing populations for tumor growth), whereas‌ in the second case, one usually models long‌ term evolution assuming that the ecological dynamics is‌ in a stationary (quasi-equilibrium) state and that evolution‌ acts slowly.

In addition to the various classes‌ of models described in Section 3.1.1, which‌ are also relevant for ecology and evolution, some‌ other types of models are well-developed in these‌ domains, such as Dawson–Watanabe processes 62, Fleming–Viot‌ processes 68 or a particular class of PDMPs‌ called switched dynamical systems used to model abruptly‌ changing environments. The general goal of designing macroscopic‌ models from complex multiscale models through parameter scalings‌ extends to these new classes of models. For‌ example, Fleming-Viot processes appear as the fast dynamics‌ in the long term evolution of biological populations‌ under assumptions of small mutations and large population‌ 2.

In ecology, our motivations are to‌ highlight the biological assumptions underlying different classes of‌ macroscopic models, or to take into account at‌ the macroscopic scale complex local interactions between individuals.‌ In evolutionary biology, the time scales involved are‌ so long that it is hard to observe‌ experimentally evolutionary phenomena such as diversification. Mathematical analysis‌ of models is therefore of great importance, e.g.‌ to construct approximate models allowing to predict long‌ term evolution of biological populations.

3.3.2 Adaptive dynamics:‌ concentration limits

Asymptotic analysis is particularly useful in‌ the branch of evolutionary biology called adaptive dynamics‌. This biological theory which studies the interplay‌ between ecological interactions and long term evolution was‌ developed in the mid 90's 80, 63‌ and provides theoretical ecologists with useful tools to‌ deduce evolutionary patterns from ecological parameters (directional evolution‌ through canonical equations and diversification through evolutionary branching‌).

So far, two mathematical approaches to justify,‌ study and improve these tools have been developed:‌ a deterministic one, based on PDE models 64‌, 88, 79 and a probabilistic one,‌ based on individual-based models 52, 56.‌ Both approaches are concentration limits, aiming to construct‌ approximate models where population densities are replaced by‌ Dirac masses representing coexisting sub-populations. They both can be seen as particular‌ parameter scalings on individual-based‌ models of the form‌‌ (2), combining large population scaling with‌ scalings of rare mutations‌ and/or small mutations.

However,‌‌ the adaptive dynamics toolbox and the existing mathematical‌ approaches are criticized because‌ they are based on‌‌ unrealistic biological assumptions on the scales involved in‌ the process 92,‌ 89. Mathematical analysis‌‌ is needed both to quantify the underlying assumptions‌ on scales and to‌ propose alternative models based‌‌ on more realistic assumptions. For example, we plan‌ to design PDE models‌ of adaptive dynamics allowing‌‌ for local extinctions of populations 74, 81‌. Similarly, the actual‌ occurrence of evolutionary branching‌‌ in sexual populations is debated 92 and our‌ goal is to shed‌ light on these questions‌‌ using asymptotic analysis starting from individual-based models.

3.3.3‌ Population dynamics with absorption‌ and quasi-stationary distributions

In‌‌ conservation biology, it is fundamental to quantify the‌ chances of survival of‌ species in a given‌‌ environment. In addition, the observed biological populations are‌ intrinsically conditioned to be‌ non-extinct, which introduces an‌‌ observation bias that is rarely taken into account.‌ For a given model‌ of population dynamics, it‌‌ is therefore important to develop tools allowing to‌ study the population size‌ before extinction and to‌‌ quantify its extinction probability in a given time‌ window. When the population‌ size is stable during‌‌ long time intervals before extinction, it can be‌ described by a quasi-stationary‌ distribution (QSD), defined as‌‌ a stationary distribution conditionally on non-extinction. The QSD‌ also allows to quantify‌ the population extinction rate.‌‌

Our research program on this topic builds on‌ recent works of the‌ team 59, 4‌‌, where we developed probabilistic criteria for the‌ large time convergence of‌ conditional distributions of stochastic‌‌ population processes, that proved to apply to a‌ wide range of stochastic‌ processes. These works open‌‌ many perspectives. We focus here on the methodological‌ ones and will mention‌ some numerical issues in‌‌ Section 3.3.4 below. A first question is to‌ obtain criteria for the‌ convergence of conditional distributions‌‌ for weaker distances than in 59, 4‌: instead of the‌ total variation distance, we‌‌ study convergence in Wasserstein distances. This is particularly‌ relevant for PDMPs or‌ for infinite dimensional processes‌‌ such as individual-based models, where coupling properties are‌ not strong enough to‌ expect convergence in total‌‌ variation. We also want to study other questions‌ related to QSDs: Can‌ we characterize the speed‌‌ of convergence of conditional distributions to a QSD?‌ Is it possible to‌ study the path to‌‌ extinction of a stochastic population process, in order‌ to characterize the parameters‌ improving their survival (e.g.‌‌ for protected species) or their extinction (e.g. for‌ pests in agronomy)? What‌ can be said about‌‌ the genealogy of populations before extinction?

3.3.4 Numerical‌ analysis

The challenges detailed‌ here are related to‌‌ those described in Section 3.1.2. Ecological or‌ evolutionary models pose specific‌ numerical problems that we‌‌ want to tackle. For‌ example, in the numerical simulation of individual-based models‌ like (1), when the population size‌ is small, randomness cannot be neglected and exact‌ algorithms have to be used; when the population‌ size is large, such algorithms become too costly‌ and one would like to take advantage of‌ deterministic approximations like those we developed in 53‌, 70. The error analysis of such‌ hybrid numerical schemes is difficult 40, 76‌ and the case of spatially or trait-structured individual-based‌ models is still largely open.

We are also‌ interested in numerical methods to approximate QSDs, among‌ which the most developed are particle methods 91‌, 58 and stochastic algorithms such as self-interacting‌ processes 44, 43. The analysis of‌ these algorithms relies on long-time analysis of particle‌ or nonlinear systems and our research project on‌ QSDs detailed above will be of great help.‌ In particular, Wasserstein distances are known to be‌ well adapted to the study of convergence of‌ particle systems thanks to their tensorization properties. We‌ also plan to develop new numerical approaches based‌ on the stationary distribution of approximating processes, such‌ as those obtained by central limit theorems for‌ stochastic processes 60.

4 Application domains

We‌ have described in the last section our theoretical‌ expertise and several mathematical challenges. However, our main‌ motivation comes from our interactions with biologists, physicians‌ or clinical researchers. Most often, these interactions involve‌ several of the methodological tools developed above, that‌ need to be combined for precise biological goals.‌ Our strategy is to establish collaborations with few‌ groups of biologists and physicians, that allow us‌ to tackle ambitious, long term projects. In this‌ section, we illustrate the different domains of application‌ mentioned in the previous section by describing several‌ ongoing pluridisciplinary projects that involve large subgroups of‌ the team.

4.1 Tumor growth and heterogeneity

4.1.1‌ Reconstruction of tumor heterogeneity

Targeted therapies represent a‌ real advance in the treatment of patients with‌ cancer. Most of these therapies are kinase inhibitors‌ and require precise analysis of tumor DNA mutations‌ to ensure the absence of primary resistance. Indeed,‌ tumors are often genetically heterogeneous with the presence‌ of many subclones, but they release “circulating” cell-free‌ DNA (cfDNA) that can be directly extracted from‌ basic blood samples: as measurement sensitivity improves, such‌ liquid biopsies increasingly appear as a mirror of‌ tumor heterogeneity. In this context, we are taking‌ a promising statistical approach to analyze longitudinal cfDNA‌ data, with the purpose of gaining a deeper‌ understanding of the mechanism by which resistance develops‌ in individual patients. While addressing the standard problem‌ of reconstructing the associated phylogenetic tree, this approach‌ also describes the production of cfDNA from the‌ temporal dynamics of cells, in order to best‌ exploit the longitudinal structure of the data. This‌ is a project in collaboration with physicians Jean-Louis‌ Merlin (Institut de Cancérologie de Lorraine), Alexandre Harlé‌ (Institut de Cancérologie de Lorraine) and Erwan Pencreac'h (CHRU Strasbourg). We currently‌ also have another ongoing‌ project on a tumor‌‌ heterogeneity reconstruction for chronic lymphocytic leukemia in collaboration‌ with Laurent Vallat (CHRU‌ Strasbourg).

4.1.2 Evolution of‌‌ low-grade gliomas

We have an ongoing collaboration with‌ the Centre de Recherche‌ en Automatique de Nancy‌‌ CRAN (Jean-Marie Moureaux) and neuro-oncologist and surgeon from‌ CHRU Nancy (Luc Taillandier,‌ Fabien Rech) about diffuse‌‌ low-grade gliomas (DLGG). These are slow-growing tumors that‌ are often asymptomatic for‌ a long period of‌‌ time. They progress to a higher grade, resulting‌ in the patient's death.‌ The current treatment strategy‌‌ aims to surgically reduce tumor volume as soon‌ as possible. As DLGGs‌ infiltrate functional areas, surgery‌‌ is performed in an awake state, with active‌ patient participation, while electrical‌ brain stimulations are done,‌‌ to identify functional structures. At CHRU Nancy, surgeries‌ are filmed, but the‌ patient's responses are recorded‌‌ manually. We aim to develop an automatic tool‌ to detect, analyse and‌ register the patients responses.‌‌ We use deep learning algorithms for motion and‌ speech detection. In perspective,‌ the fine anomalies that‌‌ can be identified are likely to be correlated‌ with the patient's short-‌ and long-term cognitive outcome.‌‌

4.2 Telomeres

Telomeres are non-coding regions of repetitive‌ nucleotide sequences located at‌ each end of a‌‌ chromosome. They protect the end of the chromosome‌ from deterioration or from‌ fusion with neighboring chromosomes,‌‌ ensuring the integrity of genetic material over cell‌ divisions. At each cell‌ division, telomeres loose a‌‌ short fragment, a phenomenon often called the `end‌ replication problem'. When its‌ telomeres are too short,‌‌ the cell stops dividing and enters a senescence‌ phase. In human, it‌ is known that short‌‌ telomere lengths are statistically linked to age related‌ diseases 46.

In‌ an ongoing collaboration with‌‌ Athanase Benetos (CHRU of Nancy) and Simon Toupance‌ (CHRU of Nancy), we‌ study the telomere length‌‌ distribution in a human body, its relation with‌ the patient's phenotype, its‌ evolution with age and‌‌ across generations. Our contribution to the project is‌ to bring competences ranging‌ from theoretical probability to‌‌ applied statistics, including modelling and numerical simulation of‌ stochastic models and analysis‌ of distributional data of‌‌ telomeres length. A first goal is to provide‌ physicians with additional medical‌ statistics to better understand‌‌ the health state of a patient using its‌ telomere length distribution, based‌ on our observation that‌‌ the shape of the telomere length distribution is‌ stable accross ages of‌ an individual, leading to‌‌ the concept of telomere signature 9. We‌ plan to study larger‌ cohorts of individuals with‌‌ medical records and parental relationships to construct an‌ equivalence relationship between shapes‌ of distributions and to‌‌ develop health scores for patients allowing to assess‌ the risk of particular‌ diseases. A second goal‌‌ is to bring new insights for the description‌ and modelling of the‌ evolution of telomere length‌‌ distribution with age and across a population. For‌ this, we study a‌ stochastic branching model of‌‌ the telomere length into‌ a given tissue of the form of (‌1) where ${x}_{i} (t)‌$ is the vector of lengths of telomeres of‌ cell $i$ . We also plan to study‌ models for the evolution of the telomere length‌ accross a population of individuals and on evolutionary‌ time scales, where ${x}_{i} (t)‌$ is the telomere lengths in gametes of individual‌ $i$ . This project requires advanced mathematical tools‌ related to the theory of branching processes and‌ of non-conservative semi-groups. We also plan to tackle‌ parameter estimation questions for these models.

In another‌ ongoing project with Marie-Noëlle Simon (CRCM, Aix-Marseille Université),‌ we study modeling and inference of the different‌ mechanisms of telomere shortening or elongation in survivor‌ cells of yeast Saccharomyces cerevisiae for which telomerase‌ is inactivated. Telomerase is an enzyme that is‌ active in normal yeasts and compensates telomere shortening‌ due to the end replication problem. When telomerase‌ in inactivated, most yeasts undergo replicative senescence, except‌ a few ones called survivor cells, which are‌ able to develop alternative telomere elongation mechanisms. The‌ goal of the project is to develop a‌ full model of the evolution of survivor cells,‌ in particular by estimating the rates and sizes‌ of abrupt telomere shortening or lengthening in survivor‌ cells.

4.3 Gene networks and single-cell data

4.3.1‌ Modeling gene expression at single-cell level

Gene expression‌ in cells has long been only observable through‌ averaged quantities over cell populations. The development of‌ single-cell transcriptomics has enabled gene expression to be‌ measured in individual cells: it turns out that‌ even for an isogenic population located in a‌ homogeneous medium, molecular variability can be large. An‌ average description is therefore not sufficient to account‌ for fundamental phenomena such as cell differentiation. Recently,‌ a view emerged that the dynamics governing the‌ switching of cells from one differentiation state to‌ another could be characterized by a peak in‌ gene expression variability at the point of fate‌ commitment 90. We are continuing on this‌ path, working on the link between PDMP models‌ and notions of entropy and epigenetic landscape.

4.3.2‌ Transcriptional bursting in regulatory networks

Working in the‌ active field of single-cell dynamics and gene regulatory‌ networks provides opportunities to interact with biologists such‌ as Olivier Gandrillon from ENS Lyon 90,‌ 77, and potentially also physicians such as‌ Erwan Pencreac'h in CHRU Strasbourg. The biological literature‌ increasingly highlights gene regulatory networks as playing an‌ important role (independent of genetic mutations) in the‌ acquisition of resistance to cancer treatments, hence this‌ topic might become soon also relevant to the‌ application area of oncology.

4.3.3 Prediction and identification‌ of therapeutic targets for chronic lymphocytic leukemia

In‌ an ongoing collaboration with Laurent Vallat (CHRU Strasbourg),‌ we develop new models and inference methods for‌ gene regulation networks allowing to make prediction of‌ biological intervention experiments (such as gene knock-down). Inference‌ is performed on gene expression data from cells of patients suffering from‌ different forms of chronic‌ lymphocytic leukemia. The goal‌‌ is to use prediction to identify therapeutic targets‌ which could be knocked-down‌ to reduce the cells'‌‌ proliferation. Biological experiments will be performed by Laurent‌ Vallat and his group‌ to assess the therapeutic‌‌ potential of the new targets.

4.4 Chalara

The‌ Chalara project 69 is‌ a team project with‌‌ Benoît Marçais (INRAE Champenoux) and Marie Grosdidier (INRAE‌ Avignon). Chalara is an‌ ash disease that arrived‌‌ in France 12 years ago through Grand Est‌ and has been spreading‌ throughout France ever since.‌‌ The disease spreads by means of fungus spores‌ that are deposited on‌ the leaves of trees‌‌ during summer, fall at the foot of the‌ trees during fall and‌ give rise to fungi‌‌ that release new spores that spread the following‌ summer. Affected trees show‌ signs of decline (defoliation,‌‌ canker...) that can lead to their death.

The‌ objective is to model‌ the spread of chalara‌‌ and to study and quantify the potential underlying‌ environmental effects, such as‌ humidity or high temperatures.‌‌ We use a hybrid model where spores spread‌ is based on reaction-diffusion‌ PDEs and other steps‌‌ of the disease cycle are stochastic. The project‌ involves modeling, Bayesian statistics‌ and intensive simulation.

5‌‌ Highlights of the year

5.1 Awards

The K.P.‌ Hadeler prize for best‌ paper in the Journal‌‌ of Mathematical Biology in 2024 goes to Michel‌ Benaim, Claude Lobry, Tewﬁk‌ Sari and Edouard Strickler‌‌ for their paper entitled “When can a‌ population spreading across sink‌ habitats persist?” 45‌‌.

6 Latest software developments, platforms, open data‌

6.1 Latest software developments‌

6.1.1 cvmgof

Keywords:
Regression,‌‌ Test, Estimators
Scientific Description:
Many goodness-of-fit tests have‌ been developed to assess‌ the different assumptions of‌‌ a (possibly heteroscedastic) regression model. Most of them‌ are "directional" in that‌ they detect departures from‌‌ a given assumption of the model. Other tests‌ are "global" (or "omnibus")‌ in that they assess‌‌ whether a model fits a dataset on all‌ its assumptions. cvmgof focuses‌ on the task of‌‌ choosing the structural part of the regression function‌ because it contains easily‌ interpretable information about the‌‌ studied relationship. It implements 2 nonparametric "directional" tests‌ and one nonparametric "global"‌ test, all based on‌‌ generalizations of the Cramer-von Mises statistic.
Functional Description:‌
cvmgof is an R‌ library devoted to Cramer-von‌‌ Mises goodness-of-fit tests. It implements three nonparametric statistical‌ methods based on Cramer-von‌ Mises statistics to estimate‌‌ and test a regression model.
URL:
https://cran.r-project.org/web/packages/cvmgof/index.html
Publication:‌
hal-03101612
Contact:
Romain Azais‌
Participants:
Sandie Ferrigno, Marie-Jose‌‌ Martinez, Romain Azais

6.1.2 Harissa

Name:
Hartree approximation‌ for inference along with‌ a stochastic simulation algorithm‌‌
Keywords:
Gene regulatory networks, Reverse engineering, Molecular simulation‌
Functional Description:
Harissa is‌ a Python package for‌‌ both inference and simulation of gene regulatory networks,‌ based on stochastic gene‌ expression with transcriptional bursting.‌‌ It was implemented in the context of a‌ mechanistic approach to gene‌ regulatory network inference from‌‌ single-cell data.
URL:
https://github.com/ulysseherbach/harissa‌
Publications:
hal-04208601, hal-04071033, hal-01646910
Contact:
Ulysse‌ Herbach

6.1.3 MultiRNAflow

Name:
An R package for‌ the analysis of RNAseq raw counts with multiple‌ biological conditions and time points
Keywords:
RNA-seq, Gene‌ regulatory networks, Integrated data analysis, Complex experimental design,‌ Multiple temporal and biological conditions, Differential expression
Functional‌ Description:
The R package MultiRNAflow provides an easy‌ to use unified framework allowing to make both‌ unsupervised and supervised analysis (differential expression analysis) for‌ RNAseq datasets with an arbitrary number of biological‌ conditions and time points. In particular, this package‌ makes a deep downstream analysis of differential expression‌ information, e.g. identifying temporal patterns across biological conditions‌ and differentially expresses genes which are specific to‌ a biological condition for each time.
Release Contributions:‌
First version
URL:
https://bioconductor.org/packages/release/bioc/html/MultiRNAflow.html
Contact:
Nicolas Champagnat
Participants:‌
Rodolphe Loubaton, Nicolas Champagnat, Pierre Vallois, Laurent Vallat‌
Partner:
CHRU de Strasbourg

6.1.4 quantCurves

Keyword:
Statistical‌ modeling
Functional Description:
Non-parametric methods as local normal‌ regression, polynomial local regression and penalized cubic B-splines‌ regression are used to estimate quantiles curves.
URL:‌
https://cran.r-project.org/web/packages/quantCurves/quantCurves.pdf
Contact:
Sandie Ferrigno

6.1.5 PEOC

Name:
Parameters‌ Estimation Of Chalara model
Keywords:
Statistical inference, Iterative‌ algorithm, Monte Carlo estimation, Python, Statistical modeling
Functional‌ Description:
PEOC is Python software which aims to‌ simulate and estimate the parameters of the chalara‌ model of the article 'Mechanistic-statistical model for the‌ expansion of ash dieback' by Coralie Fritsch, Marie‌ Grosdidier, Anne Gégout-Petit and Benoît Marçais. It allows‌ to reproduce the simulations made in this article,‌ which is available at the following url https://hal.science/hal-04690647.‌
URL:
https://github.com/coraliefritsch/peoc
Contact:
Coralie Fritsch

7 New results‌

7.1 Stochastic modeling for health, ecology and evolution‌

7.1.1 Gene regulatory networks

Stochastic modeling and simulation‌ of gene regulatory networks at single-cell level

Participants:‌ Mathilde Gaillard, Ulysse Herbach.

Single-cell data‌ reveal the presence of biological stochasticity between cells‌ of identical genome and environment, in particular highlighting‌ the transcriptional bursting phenomenon. To account for this‌ property, gene expression may be modeled as a‌ continuous-time Markov chain where biochemical species are described‌ in a discrete way, leading to Gillespie's stochastic‌ simulation algorithm (SSA) which turns out to be‌ computationally expensive for realistic mRNA and protein copy‌ numbers. Alternatively, hybrid models based on piecewise-deterministic Markov‌ processes (PDMPs) offer an effective compromise for capturing‌ cell-to-cell variability 73, but their simulation remains‌ limited to specialized mathematical communities.

With a view‌ to making them more accessible, we introduced a‌ simple simulation method that is reminiscent of SSA,‌ while allowing for much lower computational cost 23‌. We detailed the algorithm for a bursty‌ PDMP describing an arbitrary number of interacting genes,‌ and proved that it simulates exact trajectories of‌ the model. As an illustration, we used the‌ algorithm to simulate a two-gene toggle switch: this‌ example highlights the fact that bimodal distributions as‌ observed in real data are not explained by‌ transcriptional bursting per se, but rather by distinct‌ burst frequencies that may emerge from interactions between genes.

Cell trajectory inference‌ based on Schrödinger problem‌ and a mechanistic model‌‌ of stochastic gene expression

Participants: Ulysse Herbach.‌

External collaborators: Université‌ Paris Cité, Aix Marseille‌‌ Université, Université Lyon 1, ENS Lyon.

Cellular differentiation‌ is the biological process‌ that leads a cell‌‌ to opt for a particular cellular identity. Recently,‌ single-cell RNA-sequencing has enabled‌ the simultaneous measurement of‌‌ gene expression levels at specific times for a‌ large number of individual‌ cells and a large‌‌ number of genes. Repeating such measurements at different‌ time points gives access‌ to the temporal variation,‌‌ or transport, of a distribution over gene expression‌ space. The whole temporal‌ trajectory of distributions thus‌‌ characterizes the differentiation process at population level, but‌ trajectories of individual cells‌ are still out of‌‌ reach since most measurement techniques are destructive.

The‌ optimal transport theory that‌ has been used so‌‌ far to infer cellular differentiation trajectories from time-stamped‌ single-cell RNA-seq data involves‌ solving the so-called Schrödinger‌‌ problem in its most common version. This implies‌ assuming that cells move,‌ in the gene expression‌‌ space, by diffusion. Yet, real gene dynamics are‌ much more complex. In‌ 31, we assume‌‌ that mRNA dynamics are characterized by brief and‌ important production of RNA,‌ with long periods of‌‌ inactivity in between, and consider a bursty PDMP‌ model of gene dynamics‌ 23. We use‌‌ this model to define a reference process for‌ the Schrödinger problem. By‌ comparing the solutions of‌‌ the Schrödinger problems with diffusive and bursty reference‌ processes, under different conditions,‌ we show that the‌‌ bursty model provides a better approximation of the‌ underlying gene dynamics when‌ inferring cell trajectories.

Prediction‌‌ of silencing experiments on gene networks for chronic‌ lymphocytic leukemia

Participants: Nicolas‌ Champagnat, Anne Gégout-Petit‌‌, Anouk Rago, Pierre Vallois.

External‌ collaborators: CHRU Strasbourg‌

In this collaboration with‌‌ the group of Laurent Vallat in CHRU Strasbourg,‌ we work on the‌ inference of dynamical gene‌‌ networks from RNA-seq data of chronic lymphocytic leukemia.‌ The goal is to‌ infer a model of‌‌ gene expression allowing to predict gene expression in‌ cells where the expression‌ of specific genes is‌‌ knocked-down (e.g. using siRNA, i.e. small interfering ribonucleic‌ acid), in order to‌ select the knock-down experiments‌‌ which are more likely to reduce cell proliferation.‌ We expect the selected‌ genes to provide new‌‌ therapeutic targets for the treatment of chronic lymphocytic‌ leukemia. This year, we‌ developed in 27 two‌‌ general mathematical frameworks do model knock-down experiments on‌ gene regulatory networks using‌ gene expression data of‌‌ cells without knock-down: the first one is based‌ on conditional probabilities and‌ applies to any quantitative‌‌ model and the second assumes that the model‌ is mechanistic.

7.1.2 Parameter‌ scalings in population dynamics‌‌

Adaptive dynamics in biological populations with small mutations‌

Participants: Nicolas Champagnat.‌

External collaborators: Vincent‌‌ Hass (Université de Franche-Comté)

In 15 we studied‌ general individual-based models of‌ biological populations with mutation‌‌ and selection, under assumptions‌ of large population and small mutations. We recover‌ variants of the canonical equation of adaptive dynamics,‌ which describes the long time evolution of the‌ dominant phenotype in the population, under less stringent‌ biological assumptions than in previous works such as‌ 56.

Averaging principle for models with slow-fast‌ components

Participants: Vincent Kagan, Edouard Strickler,‌ Denis Villemonais.

In 33, we consider‌ a slow-fast stochastic process where the slow component‌ is a jump process on a measurable index‌ set whose transition rates depend on the position‌ of the fast component. Between the jumps, the‌ fast component evolves according to an ergodic dynamic‌ in a state space determined by the index‌ process. We prove that, when the ergodic dynamics‌ are accelerated, the slow index process converges to‌ an autonomous pure jump process on the index‌ set. These results can be used to legitimate‌ classical toy-models long used in applications, providing a‌ justification for their simplifying assumptions. In particular, we‌ apply our results to prove the convergence of‌ a typed branching process toward a continuous-time Galton-Watson‌ process, and of an epidemic model with fast‌ viral loads dynamics to a standard contact process.‌

Multiscale dynamics and condensation phenomenon in coagulation-fragmentation processes‌

Participants: Elie Cerf, Coralie Fritsch, Denis‌ Villemonais.

External collaborator: Alex Watson (University‌ College London)

This collaboration is an ongoing work‌ aimed to study the condensation phenomenon in the‌ context of a large population of particles randomly‌ interacting through fragmentation and coagulation. More precisely, we‌ want to exhibit the possible coexistence of two‌ phases respectively made of microscopic and macroscopic aggregates.‌ In the first part of this project, we‌ extend the results of 49 on the chemostat‌ model to study a stochastic growth-fragmentation-coagulation model in‌ which a population of particles determined by their‌ masses deterministically consume a substrate to grow and‌ randomly interact, affecting the size of the population.‌ Using usual tools of individual-based models (IBM), we‌ establish the convergence of a process modeling the‌ population to the solution of an integro-differential equation‌ when the size of the population tends to‌ infinity, for a general class of random fragmentation‌ laws. In a second part, we aim to‌ use our understanding of the previous IBM to‌ study the stochastic Becker-Döring model of aggregation and‌ fragmentation. In the deterministic case, it is well‌ known that even though the Becker-Döring system conserves‌ its total mass in finite times there are‌ conditions under which, as the time goes to‌ infinity, a positive fraction of the mass gets‌ trapped in an ever-growing macroscopic aggregate. In particular,‌ Niethammer 87 showed that under a precise normalization,‌ the dynamic of the infinite cluster is given‌ by a Lifshitz-Slyozov-Wagner coarsening model. Our goal is‌ to obtain similar results in the stochastic framework.‌

7.1.3 Individual-based modeling in medicine, ecology and evolution‌

Telomeres

Participants: Sophie Baland, Nicolas Champagnat,‌ Coralie Fritsch, Juan Mardomingo Sanz, Denis Villemonais.

External collaborators‌: CHRU Nancy, CRCM‌ Aix-Marseille Université

In most‌‌ cells, at each cell division, telomeres shorten due‌ to the so-called end‌ replication problem, which can‌‌ lead to replicative senescence and a variety of‌ age-related diseases. In certain‌ cells, the presence of‌‌ the enzyme telomerase can lead to the lengthening‌ of telomeres, which may‌ delay or prevent the‌‌ onset of such diseases but can also increase‌ the risk of cancer.‌ In 14, C.‌‌ Fritsch and D. Villemonais developed, in collaboration with‌ reserachers of CHRU Nancy,‌ a stochastic representation of‌‌ this biological model, which takes into account multiple‌ chromosomes per cell, the‌ effect of telomerase, different‌‌ cell types and the dependence of the distribution‌ of telomere length on‌ the dynamics of the‌‌ process. We study theoretical properties of this model,‌ including its long-term behavior.‌ In addition, we investigate‌‌ numerically the impact of the model parameters on‌ biologically relevant quantities, such‌ as the Hayflick limit‌‌ and the Malthusian parameter of the population of‌ cells. Similar models including‌ mechanisms of hematopoiesis (the‌‌ physiological process of blood cell production) are also‌ currently being developed by‌ S. Baland and D.‌‌ Villemonais.

In another ongoing project with Marie-Noëlle Simon‌ (CRCM, Aix-Marseille Université), N.‌ Champagnat, C. Fritsch, J.‌‌ Mardomingo Sanz and D. Villemonais started to study‌ modeling and inference of‌ the different mechanisms of‌‌ telomere shortening or elongation in survivor cells of‌ yeast Saccharomyces cerevisiae for‌ which telomerase is inactivated.‌‌ The telomerase enzyme is active in normal yeasts‌ and compensates telomere shortening‌ due to the end‌‌ replication problem. When telomerase is inactivated, most yeasts‌ undergo replicative senescence, except‌ a few ones called‌‌ survivor cells, which are able to develop alternative‌ telomere elongation mechanisms. We‌ are currently working on‌‌ methods to estimate the shortening of telomeres at‌ cell division for survivor‌ cells using the full‌‌ distribution of telomeres and on population models with‌ rare events of appearance‌ of survivors able to‌‌ account for data of yeasts cultures with daily‌ dilution.

Allometric relationships in‌ Ecology

Participants: Virgile Brodu‌‌, Nicolas Champagnat, Coralie Fritsch.

External‌ collaborator: Sylvain Billiard‌ (Université de Lille)

In‌‌ 26, we designed a stochastic individual-based model‌ structured in energy, for‌ single species consuming an‌‌ external resource, where populations are characterized by a‌ fixed positive energy at‌ birth. The resource is‌‌ maintained at a fixed amount, so we benefit‌ from a branching property‌ at the population level.‌‌ We focus on individual trajectories, constructed as a‌ PDMP with random jumps‌ modelling births and deaths‌‌ in the population and a continuous and deterministic‌ evolution of energy between‌ jumps. We are interested‌‌ in the case where metabolic (i.e. energy loss‌ for maintenance), growth, birth‌ and death rates depend‌‌ on the individual energy over time, and follow‌ allometric scalings (i.e. power‌ laws). Our goal is‌‌ to determine in a bottom-up approach what are‌ the possible allometric coefficients‌ (i.e. exponents of these‌‌ power laws) under elementary‌ and ecologically relevant constraints, for our model to‌ be valid for the whole spectrum of possible‌ body sizes. We show in particular that assuming‌ an allometric coefficient for metabolism strongly constrains the‌ range of possible values for the allometric coefficients‌ for birth, death and growth rates.

Niche construction‌

Participants: Nicolas Champagnat, Coralie Fritsch, Edouard‌ Strickler.

External collaborators: Universidad de Valparaiso,‌ Pontificia Universidad Catolica de Chile, Santa Fe Institute,‌ Universidad de Santiago de Chile, INRAE Montpellier

In‌ collaboration with Rolando Rebolledo, Pablo Marquet, Leonardo Videla,‌ Cristobal Quininao and Nicolas Zalduendo-Vidal, we are working‌ on the modeling of the eco-evolutionary process of‌ niche construction, by which a species or an‌ ecological community is able to modify its environment‌ in such a way that the induced adaptation‌ enhances survival of the species or the community.‌

In 28, we study the properties of‌ extinction and survival of population models with sublinear‌ growth rate parameterized by an exponent $θ$ .‌ We point out that this family of models,‌ recently proposed to fit various population growths, may‌ have important flaws depending on the value of‌ the parameter $θ$ , which make them unrealistic‌ for ecological modeling. We both study birth-death processes‌ and diffusion processes.

We also work on a‌ general modeling approach for niche construction based on‌ birth-death processes of $d$ interacting (sub)species immersed in‌ an environment which is influenced by the population‌ state (the so-called niche construction) and which evolves‌ on a slower time-scale. Under the above hypotheses,‌ extinction and/or re-emergence of negligible species on long‌ time scales can be observed. We prove that‌ the joint dynamics of the logarithm of the‌ species sizes and the environment undergo a piecewise‌ deterministic Markov process, which can be approximated by‌ an explicit dynamical system in the limit of‌ large populations. We apply this framework to study‌ the long term co-existence of two specialist species‌ consuming two resources, with a “joint”' niche construction‌ where each species constructs the niche of the‌ other while depleting its resources. We also study‌ an example of immune escape in cancer, where‌ the environmental variable is associated to the state‌ of the immune system and the species are‌ associated to three types of tumor cells. An‌ article is currently being written.

7.1.4 Quasi-stationary distributions‌

Participants: Nicolas Champagnat, Edouard Strickler, Denis‌ Villemonais.

External collaborators: Michel Benaïm (Univ.‌ Neuchâtel), Alex Cox (Univ. Bath), Emma Horton (Univ.‌ Warwick).

Denis Villemonais obtained in 34 a new‌ criterion for existence and convergence to a quasi-stationary‌ distribution (QSD) based on the spectral theory of‌ positive operators on Banach lattices, building on domination‌ properties for compactness of this type of operators‌ as exposed in 39.

One central question‌ in QSD theory is to construct and prove‌ convergence of numerical schemes for their approximation. In‌ 30, D. Villemonais studied a binary branching‌ model with Moran type interactions introduced in 61. In this interacting‌ particle system, particles evolve,‌ reproduce and die independently‌‌ and, with a probability that may depend on‌ the configuration of the‌ whole system, the death‌‌ of a particle may trigger the reproduction of‌ another particle, while a‌ branching event may trigger‌‌ the death of another particle. We prove optimal‌ bounds for the distance‌ between the empirical distribution‌‌ of the particle system and the quasi-stationray distribution‌ of the associated mean‌ semi-group.

In 16,‌‌ N. Champagnat, E. Strickler and D. Villemonais studied‌ the convergence of general‌ penalized Markov processes with‌‌ soft killing in (Monge-Kantorovich) $L^{1}$ Wasserstein distance.‌ We propose a simple‌ criterion ensuring uniform convergence‌‌ of conditional distributions to a unique QSD. We‌ give several examples of‌ application where our criterion‌‌ can be checked, including Bernoulli convolutions and piecewise‌ deterministic Markov processes, for‌ which convergence in total‌‌ variation is not possible.

In 29, N.‌ Champagnat and D. Villemonais‌ characterized large classes of‌‌ non-irreducible absorbed Markov processes with polynomial speed of‌ convergence to a QSD.‌ They applied their criteria‌‌ to prove the existence of a QSD for‌ general processes in denumerable‌ state spaces, assuming only‌‌ aperiodicity, the existence of a Lyapunov function and‌ the existence of a‌ point in the state‌‌ space from which the return time is finite‌ with positive probability.

N.‌ Champagnat and D. Villemonais‌‌ obtained in 11 general criteria ensuring existence, uniqueness‌ and/or exponential convergence properties‌ for QSD. The criteria‌‌ were specifically designed to apply to degenerate processes‌ such as hypoelliptic diffusions‌ and allow to improve‌‌ existing results in this domain.

7.1.5 Multi-type bisexual‌ and weighted branching process‌

Participants: Coralie Fritsch,‌‌ Denis Villemonais.

External collaborators: Nicolas Zalduendo-Vidal‌ (INRAE Montpellier).

The quasi-stationary‌ behavior of asexual subcritical‌‌ multi-type Galton-Watson branching processes is well-known. However, this‌ question was open for‌ bisexual processes, even in‌‌ the single-type case. C. Fritsch and D. Villemonais‌ studied in 17 the‌ quasi-stationary behavior of the‌‌ multi-type bisexual branching processes in the subcritical case.‌ We establish the existence‌ of an infinite number‌‌ of QSDs for the process. Among these distributions,‌ only a finite number‌ has good integrability properties.‌‌ In addition, when the process is irreducible, its‌ conditonal distributions converge towards‌ a unique QSD. This‌‌ work is based on our previous work 71‌, which was the‌ first one studying the‌‌ criticality of general multi-type bisexual branching processes.

It‌ is well-known that super-critical‌ multi-type classical branching processes‌‌ follow the strong law of large numbers at‌ large times. In 35‌, D. Villemonais studied‌‌ the long-term behavior of weighted multi-type branching processes,‌ focusing on extending classical‌ laws of large numbers‌‌ and martingale convergence to settings with infinitely many‌ weighted particles, arbitrary type‌ spaces and non-geometric rescaling.‌‌ We developed applications to Galton-Watson trees indexed by‌ random weights and by‌ random kernels, convergence in‌‌ Wasserstein distance of the underlying mean semi-group, and‌ convergence of ergodic averages‌ along lineages.

7.1.6 ODE‌‌ with semi-Markov switching

Participants:‌ Edouard Strickler.

External collaborators: Tobias Hurth‌ (Université de Neuchâtel, Suisse).

In classical PDMP, time‌ durations between random events are distributed as exponential‌ variables. This is made in order to ensure‌ the Markov property of the process but can‌ be debated from a modeling perspective. Indeed, the‌ exponential law allows for arbitrarily fast switches, which‌ is not realistic in most biological or physical‌ models, where a minimum time is required between‌ two environmental changes. Motivated by this question, in‌ collaboration with Tobias Hurth, we developed in 32‌ a framework for PDMP where the times between‌ jumps of the process can follow any law.‌ We show that if the law of jumping‌ time is sufficently regular, then certain conditions on‌ the deterministic dynamics leading to ergodicity of the‌ process in the case of exponentially distributed jump‌ times can still be used in this general‌ context, even though times between jumps are not‌ arbitrarily short—which was a property of the exponential‌ distributions used crucially in previous references.

7.1.7 Dispersal‌ Induced Growth

Participants: Edouard Strickler.

External collaborators‌: Michel Benaïm (Université de Neuchâtel, Suisse), Claude‌ Lobry (Université de Nice) and Tewfik Sari (Inrae‌ Montpellier).

In this collaboration, we exhaustively studied the‌ phenomenon of Dispersal Induced Growth (DIG), a term‌ coined by Katriel in 75. In a‌ population spreading across a finite number of patches,‌ individuals in a patch may undergo a time-varying‌ growth rate which is such that, in the‌ absence of migration between the patches, the population‌ will eventually become extinct in each patch. There‌ is Dispersal Induced Growth when adding migration between‌ such patches leads the whole population to grow‌ and survive. In the paper 45 we developed‌ several tools in order to understand when the‌ DIG phenomenon can happen. In case where the‌ graph of migration is connected at each time,‌ then DIG happens if and only if the‌ mean growth rate of an idealized habitat which‌ would have the maximal growth rate at each‌ time is positive. In 13, we consider‌ the case where migration is time dependent and‌ the graph of migration is globally connected but‌ not necessarily connected at each time. In this‌ situation, we exhibit example where there is no‌ DIG. We also give an example of Dispersal‌ Induced Decay, meaning that, in the absence of‌ migration, the population will survive in each patch,‌ while adding migration may lead the whole population‌ to extinction. The study is performed via the‌ sign of a Lyapunov exponent, leading to necessary‌ and sufficient conditions for DIG.

In the recent‌ paper 51, P. Carmona gives an asymptotic‌ formula for the top Lyapunov exponent of a‌ linear $T$ -periodic cooperative differential equation, in the‌ limit $T \to + \infty$ . The short‌ note 12 discusses and extends this result.

7.1.8‌ Modeling of chronic obstructive pulmonary disease

Participants: Pierre‌ Vallois.

External collaborators: Isabelle Dupin (Univ. Bordeaux), Élise Maurat (Univ.‌ Bordeaux).

Most of the‌ direct respiratory effects of‌‌ air pollution result from the disruption of the‌ lungs' natural defense mechanisms.‌ Among the most essential‌‌ ones is mucociliary clearance, which describes the coordinated‌ effort between mucus produced‌ in the respiratory tract‌‌ to trap microorganisms and particles and the unidirectional‌ movement of cilia, that‌ propels trapped particles towards‌‌ the throat. The perturbation of mucociliary clearance can‌ lead to the formation‌ of mucus plugs, abnormally‌‌ thick mucus accumulations that obstruct the airways. several‌ fundamental aspects of mucus‌ plugs remain poorly understood,‌‌ particularly how they form, where exactly they localize‌ within the bronchial tree,‌ and how they evolve‌‌ over time. This year, we have been working‌ on using mathematical modeling‌ to simulate the entire‌‌ bronchial network and explore plug formation, distribution, and‌ temporal behavior in ways‌ that are currently inaccessible‌‌ experimentally. A paper is currently being written.

7.1.9‌ Drawdowns of diffusions

Participants:‌ Pierre Vallois.

External‌‌ collaborators: Paavo Salminen (Åbo Akademi University, Finland).‌

In 22, using‌ the excursion theory of‌‌ diffusion processes, we provide new proofs of, firstly,‌ Lehoczky's formula for the‌ joint distribution of the‌‌ first drawdown time and the maximum before this‌ time, and, secondly, of‌ Malyutin's formula for the‌‌ joint distribution of the first hitting time and‌ the maximum drawdown before‌ this time. It is‌‌ remarkable that the excursion approach which we developed‌ first for Lehoczky's formula‌ also provides a proof‌‌ for Malyutin's formula. Moreover, we analyze the pure‌ jump process describing the‌ maximum before the first‌‌ drawdown time when the size of the drawdown‌ is varying.

7.2 Analysis‌ of biological and medical‌‌ data

7.2.1 Quantifying and predicting the evolution of‌ clonal heterogeneity in chronic‌ lymphocytic leukemia

Participants: Nicolas‌‌ Champagnat, Coralie Fritsch, Ulysse Herbach,‌ Pierre Vallois, Vidhi‌ Vidhi.

External collaborators‌‌: CHRU Strasbourg

The development of targeted therapies‌ has allowed considerable progress‌ in the treatment of‌‌ many cancers, but their efficacy is dependent on‌ intra-tumor heterogeneity. In lymphomas‌ and leukemias, the identification‌‌ of gene alterations by high-throughput sequencing allows the‌ characterization of this heterogeneity.‌ In these hemopathies, the‌‌ initial leukemic clone has a unique immune repertoire‌ corresponding to specific human‌ immunoglobulin genes called VDJ‌‌ genes encoding the antigen receptor. The occurrence of‌ additional mutations in VDJ‌ genes may be responsible‌‌ for the emergence of subclones with increased antigen‌ receptor reactivity further complicating‌ the clonal heterogeneity of‌‌ these hemopathies. However, this second level of clonal‌ heterogeneity and its evolution‌ remain poorly characterized and‌‌ is not considered in the management of these‌ cancers.

In collaboration with‌ the group of Laurent‌‌ Vallat in CHRU Strasbourg, we aim to develop‌ a mathematical model for‌ the evolution of the‌‌ two levels of clonal heterogeneity in leukemia, allowing‌ to characterize their evolution‌ from longitudinal bulk sequencing‌‌ data of VDJ and cancer genes mutations using‌ a Bayesian approach. This‌ year, we developed a‌‌ model of the phylogenetic‌ tree of squences of VDJ genes based on‌ the weighted uniform distribution over trees, with weights‌ related to the Hamming distance which counts the‌ number of mutations between sequences. This model allows‌ us to represent with a graph the most‌ probable phylogenetic trees and the uncertainties of certain‌ parts of the tree. We are currently working‌ on the development of variational methods of inference‌ of the tree in the context of unobserved‌ VDJ sequences.

7.2.2 Promoting physical activity and limiting‌ sedentary behaviors to manage pain in endometriosis: analysis‌ of psychosocial variables

Participants: Ulysse Herbach.

External‌ collaborators: Université de Haute-Alsace, Université de Nîmes‌

Endometriosis is negatively linked to physical activity (PA),‌ as its symptoms and psychosocial barriers may hinder‌ participation in PA. This study compared PA, sedentary‌ behavior (SED), and psychosocial variables in women with‌ and without endometriosis 19. Women with endometriosis‌ reported lower specific SED, poorer quality of life‌ (QoL), greater score of kinesiophobia, more health-related barriers‌ to PA, lower physical self-concepts, and negative PA-related‌ stereotypes (beliefs or perceptions). The amount of PA‌ was most related to previous PA behavior and‌ intention in both groups. In endometriosis, PA and‌ SED are correlated to barriers related to health,‌ motivational variables, beliefs about the risks of PA,‌ physical self-concepts and QoL. Findings highlight the importance‌ of motivational variables and self-concepts, which could be‌ targeted to support engagement in PA to improve‌ symptom management and QoL in women with endometriosis.‌ This study is a first step towards a‌ finer multivariate analysis including a specific treatment of‌ the ordinal nature of the data, for example‌ using latent Gaussian variables.

7.2.3 Uncovering candidate Nanog-Helper‌ genes in early mouse embryo differentiation using differential‌ entropy and network inference

Participants: Ulysse Herbach.‌

External collaborators: Université libre de Bruxelles, ENS‌ Lyon, Université Clermont Auvergne

In the preimplantation mammalian‌ embryo, stochastic cell-to-cell expression heterogeneity is followed by‌ signal reinforcement to initiate the specification of Inner‌ Cell Mass (ICM) cells into Epiblast (Epi). The‌ expression of NANOG, the key transcription factor for‌ the Epi fate, is necessary but not sufficient:‌ coincident expression of other factors is required. To‌ identify possible Nanog-helper genes, we analyzed gene expression‌ variability in five time-stamped single-cell transcriptomic datasets using‌ differential entropy, a quantitative measure of cell-to-cell heterogeneity‌ 21. The entropy of Nanog displays a‌ peak-shaped temporal pattern from the 16-cell to the‌ 64-cell stage, consistent with its key role in‌ Epi specification. By estimating the entropy profiles of‌ the 21 genes common to all five datasets,‌ we identified three genes - Pecam1, Sox2, and‌ Hnf4a - whose variability in expression patterns mirrors‌ that of Nanog.

We further performed gene regulatory‌ network inference using CARDAMOM 10, an algorithm‌ that exploits temporal dynamics and transcriptional bursting. The‌ results revealed that these three genes exhibit reciprocal‌ activation with Nanog at the 32-cell stage. This‌ regulatory motif reinforces fate-switching decisions and co-expression states. This analysis of single-cell‌ transcriptomic data thus uncovers‌ a likely role for‌‌ Pecam1, Sox2, and Hnf4a as key genes that,‌ when coincidentally expressed with‌ Nanog, initiate ICM differentiation.‌‌

7.2.4 Harnessing ecological niche modeling of Listeria monocytogenes‌ for biopreservation system engineering‌

Participants: Sandie Ferrigno.‌‌

External collaborators: LIBio - Laboratoire d'Ingénierie des‌ Biomolécules (

In this‌ collaboration with the LIBio‌‌ - Laboratoire d'Ingénierie des Biomolécules of Lorraine University,‌ we work on the‌ presence of pathogens in‌‌ food. To reduce this presence, hurdle technology, which‌ is based on the‌ use of a combination‌‌ of several preservative methods, is used by food‌ business operators. Among the‌ multiple available hurdles, biopreservation‌‌ consists of using microorganisms as protective cultures and/or‌ their metabolites to improve‌ the microbial quality of‌‌ food. This study explores the potential of ecological‌ niche modeling to guide‌ the selection of biopreservation‌‌ candidates. A luminescent strain of Listeria monocytogenes was‌ utilized in a multivariate‌ high-throughput competition assay. The‌‌ resulting data were analyzed using two parallel methods:‌ k-means clustering and Response‌ Surface Modeling. An article‌‌ 18 has been published on this work.

7.2.5‌ Nonparametric estimation

Participants: Sandie‌ Ferrigno.

External collaborators‌‌: R. Azais (MOSAIC INRIA Team, ENS Lyon),‌ M.-J. Martinez (LJK-Grenoble University)‌

Many goodness-of-fit tests have‌‌ been developed to assess the different assumptions of‌ a (possibly heteroscedastic) regression‌ model. Most of them‌‌ are `directional' in that they detect departures from‌ a given assumption of‌ the model. Other tests‌‌ are `global' (or `omnibus') in that they assess‌ whether a model fits‌ a dataset on all‌‌ its assumptions. We focus on the task of‌ choosing the structural part‌ of the regression and‌‌ the variance functions because they contain easily interpretable‌ information about the studied‌ relationship. We consider two‌‌ nonparametric `directional' tests and one nonparametric `global' test,‌ all based on generalizations‌ of the Cramér-von Mises‌‌ statistic. To perform these goodness-of-fit tests, we have‌ developed the R package‌ cvmgof, an easy-to-use tool‌‌ for practitioners, available from the Comprehensive R Archive‌ Network (CRAN). The package‌ was updated in 2022‌‌ (this is its third version)(6.1.1). This‌ latest version currently allows‌ testing the regression function‌‌ in the model. Since 2025, we worked to‌ enrich the package by‌ allowing the user to‌‌ test the homoscedasticity/heteroscedasticity of the model. This new‌ version will be submitted‌ to CRAN in 2026‌‌ and an associated article is currently being written.‌ In parallel, with the‌ aim of obtaining better‌‌ estimators in the various tests we are studying,‌ we have been working‌ on bandwidth selection choice,‌‌ a crucial parameter in nonparametric estimation. In particular,‌ we have relied on‌ cross-validation methods and have‌‌ also proposed new ones. We plan to integrate‌ these methods into the‌ cvmgof package in 2026‌‌ to offer users choices that will improve the‌ quality of the estimators‌ used. We presented these‌‌ results during the CMStatistics congress in London in‌ December 2025 36.‌

7.2.6 Online big data‌‌ analysis and online learning‌

Participants: Jean-Marie Monnez.

A tool for analyzing‌ streaming data is stochastic approximation introduced by Robbins‌ and Monro in 1951, that can be used‌ for example to estimate online parameters of a‌ regression function 66 or centers of clusters in‌ unsupervised classification 50. Another type of stochastic‌ approximation process, for estimating eigenvectors and eigenvalues of‌ the unknown $Q$ -symmetric expectation $B$ of a‌ random matrix $A$ using independent observations of $A‌$ , was introduced by Benzécri in 1969 and‌ studied by several authors, like the Oja process‌ (1985). In this type of process, independent observations‌ of the random matrix are observed and one‌ or a mini-batch of observations per step are‌ taken into account. We defined an extended Oja‌ process where there is a correlation model between‌ the random matrices introduced at each step and‌ where all the observations up to the current‌ step can be taken into account without storing‌ them 85. Firstly, this extends the scope‌ of application of these processes, secondly previous experiments‌ we have conducted show that processes using all‌ the data up to the current step generally‌ converge faster than those using a mini-batch of‌ data.

Canonical components of the canonical analysis of‌ two random vectors are collinear with principal components‌ of a PCA of the muldimensional linear regression‌ function of one vector with respect to the‌ other or projected PCA. In the context of‌ streaming data, we define in 20 processes for‌ estimating online in parallel this regression function and‌ canonical components, possibly taking into account at each‌ step all the data up to this step‌ without storing them and using an extended Oja‌ process 85. We deal with the cases‌ of canonical correlation analysis, factorial correspondence analysis and‌ factorial discriminant analysis.

We also considered the canonical‌ analysis of two random vectors in the context‌ of streaming data or big data. In the‌ work described above, specific stochastic approximation processes of‌ canonical components and of couples of canonical components‌ were defined separately for canonical correlation analysis (CCA),‌ factorial correspondence analysis (FCA) and factorial discriminant analysis‌ (FDA). Here, using an extension of the Oja‌ process, we define general stochastic approximation processes of‌ canonical components and of couples of canonical components‌ of the canonical analysis of two random vectors‌ that can be directly applied to CCA, FCA‌ and FDA. A paper is currently being written.‌

We are also currently working on an extension‌ of the Oja process to estimate the general‌ and canonical components of a generalized canonical analysis.‌ It can be directly used in mixed data‌ canonical analysis, generalized canonical correlation analysis, multiple correspondence‌ analysis and in the estimation of couples of‌ canonical components in canonical analysis of two random‌ vectors.

Given a non-decreasing sequence of closed convex‌ subsets $(K_{n})$ in a separable‌ real Hilbert space $H$ , we are also‌ currently studying the convergence of a stochastic approximation process in $H$ with‌ correlated observations, projected at‌ step $n$ on ${K‌‌}_{n}$ , that extends processes of the Robbins-Monro‌ type. We established theorems‌ of almost sure convergence‌‌ and in quadratic mean and applied them to‌ streaming multidimensional linear regression,‌ using at each step‌‌ a mini-batch of data or all the data‌ up to this step,‌ and to dynamic generalized‌‌ linear models when the parameter varies with time.‌

7.2.7 Diffuse low-grade gliomas‌

Participants: Sophie Wantz-Mézières.‌‌

External collaborators: CRAN BioSIS, CHRU Nancy.

The‌ therapeutic management of patients‌ with diffuse low-grade gliomas‌‌ (DLGG) is based on monitoring progress through regular‌ MRIs, usually through the‌ reconstructed volume of the‌‌ tumor (after semiautomatic delineation). But this seems not‌ sufficient. Up to now,‌ the diffuse nature of‌‌ this kind of tumors has been observed but‌ is not well measured.‌ We designed a new‌‌ MRI-based variable, the ESVR (ExtraSphere Volume Ratio) to‌ quantify the DLGG brain‌ infiltration and discriminate patterns‌‌ of patients. A machine learning approach allows us‌ to detect that patients'‌ age and ESVR at‌‌ diagnosis seem to play an important role, as‌ well as the well-known‌ anatomopathology results. This result‌‌ has been submitted to Biomedical Signal Processing and‌ Control.

A thesis is‌ currently underway to develop‌‌ a tool to assist in awake surgery. This‌ tool should make it‌ possible to better identify‌‌ subtle disorders during surgery that are impossible or‌ difficult for humans to‌ assess, in order to‌‌ better organize the procedure (detection of possible “tipping‌ points” beyond which the‌ patient is no longer‌‌ performing) or to stop it (detection of alterations‌ that are generally minor‌ but which, in the‌‌ patient's socio-professional context will have deleterious consequences on‌ their quality of life).‌ Significant progress has been‌‌ made up to date. The foundations of this‌ tool have been laid,‌ based on two complementary‌‌ processes: the detection of motor states through pose‌ recognition, and the transcription‌ of oral responses via‌‌ speech recognition. In order to identify the most‌ suitable technical configuration for‌ the operating room context,‌‌ work is currently being carried out on simulated‌ data obtained from volunteers:‌ comparison of voice transcription‌‌ models on audio recordings, and implementation of a‌ comparative study of pose‌ recognition and motor state‌‌ classification models, based on videos mimicking the exercises‌ performed in the operating‌ room. Finally, a tool‌‌ for anonymizing videos from the operating room has‌ been developed, allowing secure‌ access to them without‌‌ compromising patient identity. This work has been partly‌ presented in congress 37‌, and a communication‌‌ has been submitted in ICASSP 2026.

8 Partnerships‌ and cooperations

Participants: Sophie‌ Baland, Virgile Brodu‌‌, Nicolas Champagnat, Elie Cerf, Coralie‌ Fritsch, Mathilde Gaillard‌, Anne Gégout-Petit,‌‌ Ulysse Herbach, Juan Mardomingo Sanz, Anouk‌ Rago, Édouard Strickler‌, Pierre Vallois,‌‌ Vidhi Vidhi, Denis Villemonais, Sophie Wantz-Mézières‌.

8.1 International initiatives‌

8.1.1 Associate Teams in‌‌ the framework of an‌ Inria International Lab or in the framework of‌ an Inria International Program

aStoNiche

Title
Towards a‌ stochastic theory of niche construction
Duration
2024-2026
Coordinators‌
Nicolas Champagnat and Rolando Rebolledo
Partners
- Inria team‌ SIMBA (N. Champagnat, C. Fritsch, E. Strickler)
- Universidad‌ de Valparaiso (R. Rebolledo, N. Rivera)
- Pontificia Universidad‌ Catolica de Chile (P. Marquet, C. Quininao)
- Universidad‌ de Santiago de Chile (L. Videla, N. Zalduendo-Vidal)‌
Objectives
We aim to provide a general stochastic‌ formulation of the niche construction process. In particular,‌ we want to take into account the feedbacks‌ of species on their environment, and the evolutionary‌ aspects that follow. This requires to deal with‌ different time-scales (ecological, niche construction, evolutionary…) and to‌ keep track of non-extinct traits that may be‌ positively selected after niche construction. We plan to‌ use mean-field stochastic individual-based model and branching processes‌ and consider appropriate parameter scalings.

8.1.2 Visits of‌ international scientists

Other international visits to the team‌

Leonardo Videla

Status
Assistant professor
Institution of origin:‌
University of Santiago de Chile
Country:
Chile
Dates:‌
September 18 to 24
Context of the visit:‌
Collaboration within the associate team aStoNiche
Mobility program/type‌ of mobility:
Research stay

Cristobal Quininao

Status
Assistant‌ professor
Institution of origin:
University of Santiago de‌ Chile
Country:
Chile
Dates:
November 3 to 7‌
Context of the visit:
Collaboration within the associate‌ team aStoNiche
Mobility program/type of mobility:
Research stay‌

Nicolas Zalduendo-Vidal

Status
Assistant professor
Institution of origin:‌
University of Santiago de Chile
Country:
Chile
Dates:‌
December 1 to 5
Context of the visit:‌
Collaboration within the associate team aStoNiche
Mobility program/type‌ of mobility:
Research stay

8.1.3 Visits to international‌ teams

Research stays abroad

Elie Cerf

Visited institution:‌
University College London
Country:
United Kingdom
Dates:
June‌ 30 - July 11
Context of the visit:‌
Collaboration with Alex Watson on multiscale dynamics in‌ fragmentation-coagulation processes.
Mobility program/type of mobility:
Research stay‌

Nicolas Champagnat

Visited institution:
Pontificia Universidad Católica de‌ Chile
Country:
Chile
Dates:
March 8 to 20‌
Context of the visit:
Collaboration within the associate‌ team aStoNiche
Mobility program/type of mobility:
Research stay‌

8.2 European initiatives

8.2.1 H2020 projects

N. Champagnat‌ is scientific collaborator of the ERC SINGER (AdG‌ 101054787) on Stochastic dynamics of sINgle cells, coordinated‌ by S. Méléard (Ecole Polytechnique). He is involved‌ in the research axes “From stochastic processes to‌ singular Hamilton-Jacobi equations” and “Lineages and time reversed‌ trajectories” of this project.

8.3 National initiatives

Projects‌ coordinated by the team:

INSERM funding
Project Predi-CLL‌, ITMO Physics, Mathematics applied to Cancer (from‌ October 2023): “Quantifying and predicting the evolution of‌ clonal heterogeneity in chronic lymphocytic leukemia”. Funding organisms:‌ ITMO Cancer, ITMO Technologies pour la santé de‌ l'alliance nationale pour les sciences de la vie‌ et de la santé (AVIESAN), INCa. Partners: Inria‌ and IECL (Institut Élie Cartan de Lorraine) and‌ CHRU Strasbourg. Leader: N. Champagnat. Participants: C. Fritsch,‌ U. Herbach, P. Vallois, V. Vidhi, D. Villemonais.‌
France 2030 funding
PEPR Exploratoire Maths-VivES (from July 2024), target project DyLT‌ (Dynamics of Telomere Length)‌ on “Influence of telomere‌‌ length dynamics and environmental conditions on biological and‌ clinical aspects of aging”'.‌ Funding organisms: ANR. Partners:‌‌ Inria Nancy and Saclay, Institut Élie Cartan de‌ Lorraine (Nancy), CHRU Nancy,‌ Centre de Recherche en‌‌ Cancérologie de Marseille and Institut de recherche sur‌ le cancer et le‌ vieillissement (Nice). Coordinators: N.‌‌ Champagnat and A. Benetos (CHRU Nancy). Participants: C.‌ Fritsch, A. Gégout-Petit, J.‌ Mardomingo Sanz, D. Villemonais,‌‌ S. Baland.

Projects in which the team participates:‌

France 2030 funding
PEPR‌ Santé Numérique (from July‌‌ 2023), project AI4scMed (Multiscale AI for single-cell-based precision‌ medicine), team involved in‌ WP3: “Regulatory network inference:‌‌ from dynamical models to logical models”. Funding organisms:‌ ANR. Partners: Inria, Inserm,‌ CNRS. Coordinator: F. Picard‌‌ (CNRS, ENS Lyon). Participants: M. Gaillard, U. Herbach.‌
VEOLIA funding
Chair “Modélisation‌ Mathématique et Biodiversité” between‌‌ VEOLIA, Ecole Polytechnique, Museum National d'Histoire Naturelle and‌ Fondation X. Coordinator: S.‌ Méléard. Participants: V. Brodu,‌‌ N. Champagnat, C. Fritsch, D. Villemonais.
ANR funding‌
ANR JCJC project CRESCENDO‌ (inCRease physical Exercise and‌‌ Sport to Combat ENDOmetriosis, AAPG 2022). Coordinator: G.‌ Escriva-Boulley (LISEC, Université de‌ Haute-Alsace). Participant: U. Herbach.‌‌
CNRS funding
GDR 720 IASIS. Leader: C.‌ Richard. Participant: S. Wantz-Mézières.‌
CNRS funding
GDR Réseau‌‌ Thématique MathSAV. Leader: F. Crauste. Participants: N.‌ Champagnat, C. Fritsch, U.‌ Herbach, A. Rago.

8.4‌‌ Regional initiatives

A. Gégout-Petit is one the two‌ PIs of the interdisciplinary‌ program “Life Travel” of‌‌ the I-Site “Lorraine Université d'Excellence” on life trajectories‌ and longevity (launched on‌ January 1, 2026.).
S.‌‌ Wantz-Mézières received a grant from Interdisciplinary Pilot AAP‌ Life Travel 2025, with‌ Sébastien Hergalant (NGERE).

9‌‌ Dissemination

Participants: Sophie Baland, Virgile Brodu,‌ Nicolas Champagnat, Elie‌ Cerf, Sandie Ferrigno‌‌, Coralie Fritsch, Mathilde Gaillard, Anne‌ Gégout-Petit, Ulysse Herbach‌, Vincent Kagan,‌‌ Juan Mardomingo Sanz, Jean-Marie Monnez, Anouk‌ Rago, Édouard Strickler‌, Pierre Vallois,‌‌ Vidhi Vidhi, Denis Villemonais, Sophie Wantz-Mézières‌.

9.1 Promoting scientific‌ activities

9.1.1 Scientific events:‌‌ organisation

Member of the organizing committees

Nicolas Champagnat‌ was member of the‌ organizing committee of ReaDiNet2025:‌‌ A ReaDiNet workshop on deterministic and stochastic PDEs‌, held in Obernai‌ from November 24 to‌‌ 27.

9.1.2 Scientific events: selection

Member of the‌ conference program committees

Nicolas‌ Champagnat was member of‌‌ the scientific committee of ReaDiNet2025: A ReaDiNet workshop‌ on deterministic and stochastic‌ PDEs, held in‌‌ Obernai from November 24 to 27.

9.1.3 Journal‌

Member of the editorial‌ boards

N. Champagnat is‌‌ associate editor for ESAIM: Probability & Statistics and‌ Stochastic Models
D. Villemonais‌ is associate editor for‌‌ Applied Probability Trust

9.1.4 Invited talks

N. Champagnat‌ has been invited to‌ give talks at the‌‌ Kick-off Programme de recherche Mathématiques en interaction (PEPR‌ Maths Vives) in Montpellier‌ in March, at the‌‌ 44th Conference on Stochastic Processes and their Applications‌ SPA 2025 in Wroclaw,‌ Poland in July and‌‌ at the Fall Conference‌ “Model design, optimization & control” of the thematic‌ semester on Population Dynamics in Nice in October.‌ He also gave a seminar talk at “Séminaire‌ du Pôle Probabilités” of CMAP (Ecole Polytechnique) in‌ Palaiseau in April.
N. Champagnat, E. Strickler and‌ D. Villemonais have been invited to give talks‌ at the Workshop on quasi-stationnary distributions held at‌ CERMICS, Marne-la-Vallée, May 21 to 23.
C. Fritsch‌ has been invited to give a talk in‌ a minisymposium at the 56ème Journées de Statistiques‌ in Marseille in June. She also gave a‌ talk during the scientific meeting between the Pôle‌ AM2I and the Chamber of agriculture in Nancy‌ in June.
U. Herbach has been invited to‌ give a talk (hands-on tutorial session) at the‌ Advanced Lecture Course on Computational Systems Biology (‌CompSysBio 2025) in Aussois in October.
V.‌ Vidhi gave poster presentation at the European Mathematical‌ Genetics Meeting organized by University of Brest in‌ April and invited talks at International Conference on‌ Mathematical Methods and Model in Biosciences and School‌ for Young Scientists in Bulgaria in June and‌ at InterDoctoral Conference of Lorraine and Luxembourg at‌ IECL, University of Lorraine in November.
D. Villemonais‌ gave invited talks at the “Séminaire de probabilité‌ et mathématiques financières” of Laboratoire de Mathématiques et‌ Modélisation d'Évry in June, at the Congrès SMF‌ in Dijon in June, at the 13th Aging‌ Research and Drug Discovery Meeting in Copenhagen in‌ August, at the 12th International Conference on Stochastic‌ Analysis and its Applications in Bucarest in September,‌ and at the “Chilean Probability Seminar” of Santiago‌ (online) in October.

9.1.5 Scientific expertise

N. Champagnat‌ has been a member of the Committee for‌ junior permanent research positions of Centre Inria de‌ Lille.
N. Champagnat and C. Fritsch were members‌ of the Committee for career advancement of Inria‌ personnel.
C. Fritsch has been a member of‌ the Committee for junior permanent research positions of‌ Centre Inria de l'Université de Bordeaux and Centre‌ Inria de Paris.
A. Gégout-Petit has been member‌ of four recruitment committes (Nancy (MCF), Montpellier University‌ (PR), Troyes Technological University (PR) and Institut Henri‌ Poincaré (Director)).
D. Villemonais has been a member‌ of a recruitment comittee (Strasbourg, PU).

9.1.6 Research‌ administration

V. Brodu was an elected representative of‌ doctoral students at the doctoral school committee (local‌ scale), and also at the doctoral college committee‌ (regional scale). He was then replaced by M.‌ Gaillard.
N. Champagnat is elected member of the‌ Commission d'Evaluation of Inria, member of the COMIPERS‌ (hiring committee for non-permanent positions) of Centre Inria‌ de Nancy, substitute member of the Comité de‌ Centre of Centre Inria de Nancy and local‌ researcher (correspondant local) representing the COERLE (Inria's Ethic‌ Committee) at Centre Inria de Nancy.
C. Fritsch‌ is elected member of the Commission d'Evaluation of‌ Inria.
A. Gégout-Petit is director of the research‌ unit IECL (Institut Elie Cartan de Lorraine), Mathematics‌ laboratory of Univ. Lorraine (200 members).
S. Wantz-Mézières is substitute member of‌ the CNU, section 26,‌ college B.
D. Villemonais‌‌ is head of the Probability Team in Strasbourg,‌ member of the MSII‌ doctoral school board in‌‌ University of Strasbourg, and elected member of the‌ “Conseil de l'UFR de‌ mathématiques et informatique” in‌‌ University of Strasbourg.

9.2 Teaching - Supervision -‌ Juries - Educational and‌ pedagogical outreach

9.2.1 Teaching‌‌

S. Ferrigno is in charge of the “DU‌ Big Data & Data‌ Science” in ENSMN, Univ.‌‌ Lorraine.
D. Villemonais is head of L3 DUAS‌ (actuarial science) in Univ.‌ Strasbourg.
Master: V. Brodu,‌‌ Distribution theory and PDEs, 40h, M1, second year‌ of ENSEM, Univ. Lorraine.‌
Master: V. Brodu, Monte-Carlo‌‌ methods, 20h, M1, second year of ENSMN, Univ.‌ Lorraine.
Master: N. Champagnat,‌ Introduction to Quantitative Finance,‌‌ 12h, M1, second year of ENSMN, Univ. Lorraine.‌
Master: N. Champagnat, Introduction‌ to Quantitative Finance, 9h,‌‌ M2, third year of ENSMN, Univ. Lorraine.
Master:‌ S. Ferrigno, Experimental designs,‌ 6h, M1, fourth year‌‌ of EEIGM, Univ. Lorraine.
Master: S. Ferrigno, Data‌ analyzing and mining, 36h,‌ M1, second year of‌‌ ENSMN, Univ. Lorraine.
Master: S. Ferrigno, Modeling and‌ forecasting, 32h, M1, second‌ year of ENSMN, Univ.‌‌ Lorraine.
Master: S. Ferrigno, Training projects, 18h, M1/M2,‌ second and third year‌ of ENSMN, Univ. Lorraine.‌‌
Master: C. Fritsch, Inverse problem, 18h, M1, second‌ year of ENSMN, Univ.‌ Lorraine.
Master: A. Gégout-Petit,‌‌ Inferential statistics, 20h, M1 IMSD, Univ. Lorraine.
Master:‌ A. Gégout-Petit, Complex data‌ modelling, 30h, M2 IMSD,‌‌ Univ. Lorraine.
Master: D. Villemonais, Probability M1 first‌ and second semester, 56h‌
Master: D. Villemonais, Tutoring‌‌ of student in actuarial science (stage d'alternance)
Agregation‌ of mathematics training: Denis‌ Villemonais, Modelling lesson
Master:‌‌ S. Wantz-Mézières, Unsupervised Learning, 25h, M2 IMSD, Univ.‌ Lorraine
Master: S. Wantz-Mézières,‌ Advanced image analysis and‌‌ digital optimization, M2 IS-SNIM, 36h, Univ. Lorraine.
Licence:‌ S. Baland, Applied probabilities,‌ 20h, L2 Informatique, Univ.‌‌ Lorraine.
Licence: S. Baland, Mathematical tools for biology,‌ 22h, L1 Sciences de‌ la Vie, Univ. Lorraine.‌‌
Licence: S. Baland, Probabilities, 20h, L2 Informatique, Univ.‌ Lorraine.
Licence: V. Brodu,‌ Probability theory and Statistics,‌‌ 60h, L3, first year of ENSEM, Univ. Lorraine.‌
Licence: V. Brodu, Operational‌ Research, 30h, L3, first‌‌ year of ENSMN, Univ. Lorraine.
Licence: V. Brodu,‌ Mathematical tools for engineers,‌ 20h, L3, first year‌‌ of ENSEM, Univ. Lorraine.
Licence: E. Cerf, Complementary‌ Analysis, 70h, L1 Mathématiques,‌ Univ. Lorraine.
Licence: E.‌‌ Cerf, Mathematics, 12h, L1 Professorat des Ecoles, Univ.‌ Lorraine.
Licence: E. Cerf,‌ Mathematical tools for biology,‌‌ 22h, L1 Sciences de la Vie, Univ. Lorraine.‌
Licence: S. Ferrigno, Descriptive‌ and inferential statistics, 60h,‌‌ L2, second year of EEIGM, Univ. Lorraine.
Licence:‌ S. Ferrigno, Statistical modeling,‌ 60h, L2, second year‌‌ of EEIGM, Univ. Lorraine.
Licence: S. Ferrigno, Mathematical‌ and computational tools, 20h,‌ L3, third year of‌‌ EEIGM, Univ. Lorraine.
Licence: S. Ferrigno, Training projects,‌ 40h, L1/L3, first, second‌ and third year of‌‌ EEIGM, Univ. Lorraine.
Licence: M. Gaillard, Inférence statistique,‌ 40h, L3, first year‌ of ENSEM, Univ. Lorraine.‌‌
Licence: V. Kagan, Probability‌ theory tutorial, 40h, L3, first year of ENSMN,‌ Univ. Lorraine.
Licence: V. Kagan, Numerical Analysis tutorial,‌ 20h, L3, first year of ENSMN, Univ. Lorraine.‌
Licence: J. Mardomingo Sanz, Analyse numérique et optimisation,‌ 40h, L3, first year of ENSMN, Univ. Lorraine.‌
Licence: S. Wantz-Mézières, Applied Mathematics: Probability, 48h, L3,‌ first year of TELECOM-NANCY, Univ. Lorraine.
Licence: S.‌ Wantz-Mézières, Mathematical tools for management and Finance, 180h,‌ L1/L2 first and second year of BUT GEA,‌ IUT Nancy-Charlemagne, Univ. Lorraine.

9.2.2 Supervision

PhD

PhD:‌ Virgile Brodu, “Stochastic individual-based models with allometric dynamics:‌ branching, convergence, numerical simulations”, grant ENS Lyon. Advisors:‌ S. Billiard (Univ. Lille), N. Champagnat, C. Fritsch.‌ Defense on August, 27 24.
PhD: Anouk‌ Rago, “Inférence de réseaux de gènes dynamiques et‌ prédiction d'expériences d'interventions biologiques dans des cellules cancéreuses”,‌ grant Région Grand-Est and Inria. Advisors: N. Champagnat,‌ A. Gégout-Petit. Defense on June, 30.
PhD in‌ progress: Sophie Baland, “Telomere length dynamics : modelisation,‌ estimation and application to diagnostic support systems”, funding‌ LUE, since September 2023. Advisors: S. Toupance (Univ.‌ Lorraine) and D. Villemonais.
PhD in progress: Mathilde‌ Gaillard, “Processus de Markov déterministes par morceaux et‌ inférence bayésienne de réseaux de gènes”, grant PEPR‌ Santé Numérique, since October 2023. Advisors: A. Gégout-Petit,‌ U. Herbach.
PhD in progress: Anouar Jeddi, “Convergence‌ of individual-based population models to Hamilton-Jacobi equations”, grant‌ ERC SINGER (Ecole Polytechnique), since September 2023. Advisors:‌ S. Méléard (Ecole Polytechnique) and N. Champagnat.
PhD‌ in progress: Vincent Kagan, “Asymptotic behavior of epidemiological‌ models with individual viral load”, funding Université de‌ Lorraine, since September 2023. Advisors: E. Strickler (Univ.‌ Lorraine) and D. Villemonais.
PhD in progress: Juan‌ Mardomingo Sanz, “Stochastic modeling and estimation of the‌ distribution of elongation and abrupt shortening of ALT‌ cells in yeast”, funding PEPR Math-VivES, since October‌ 2025. Advisors: N. Champagnat, C. Fritsch, D. Villemonais.‌
PhD in progress: Vidhi Vidhi, “Stochastic modeling and‌ statistics for quantifying the evolution of tumor heterogeneity‌ in chronic lymphocytic leukemia”, funding ITMO Cancer, since‌ October 2024. Advisors: N. Champagnat, C. Fritsch, U.‌ Herbach.
PhD in progress: Léo Gérard, “Development of‌ a tool to assist awake brain tumor surgery‌ based on statistical learning methods”, funding CRAN BioSIS,‌ since october 2024. Advisors: J.M. Moureaux (CRAN), F.‌ Rech (CHRU- CRAN), S. Wantz-Mézières.

HDR

HDR: Coralie‌ Fritsch, “Asymptotic behavior of probabilistic models for biology:‌ criticality of processes, limit theorems, quasi-stationary behavior, model‌ approximation”. Defense on December, 15 25.

Other‌

ENSMN third year (M2) internship: M. Ammar, “Machine‌ learning tasks in the context of air freight‌ through the Cargostack application”. Advisors: M. Bleu (Wiremind‌ cargo), S. Ferrigno (ENSMN).
ENSMN third year (M2)‌ internship: A. Blanchard, “Development of a rating model‌ specific to badminton players”. Advisors: E.Hollville (Fédération française‌ de badminton), S. Ferrigno (ENSMN).
M2 internship: Roxane‌ Cellier (Sorbonne Univ.), “Inference of kinetics for large‌ multi-scale chemical reaction networks”. Advisors: U. Herbach and‌ J. Unterberger (IECL).
M2 internship: Juan Mardomingo Sanz‌ (M2 Mathématiques pour les Sciences du Vivant of Paris-Saclay Univ.), “Stochastic modeling‌ and estimation of the‌ distribution of elongation of‌‌ ALT cells in yeast”, funded by the interdisciplinary‌ program “Life Travel” of‌ the I-Site “Lorraine Université‌‌ d'Excellence”, from April to September 2025. Advisors: N.‌ Champagnat, C. Fritsch, D.‌ Villemonais.
M2 internship: Mattheo‌‌ Rapenne (Univ. Lorraine), “Diagnosis of heart failure in‌ the emergency service”. Advisors:‌ A. Gégout-Petit (IECL), N.‌‌ Girerd (CHRU Nancy).
M1 internship: Lorenzo Boussion (ENS‌ Paris-Saclay), “Necessary condition for‌ dispersal induced growth on‌‌ time periodic networks”. Advisor: E. Strickler.
M1 internship:‌ Thibaut Pannet (ENSTA Paris),‌ “Variational Bayesian inference for‌‌ single-cell transcriptomic data”. Advisor: U. Herbach.
TELECOM-NANCY PIDR‌ second-year project: N. Chatonnier,‌ R. Samba, N. Bermond,‌‌ “Comparison of deep learning models for automatic speech‌ recognition: application to awake‌ brain surgery”. Advisors: S.‌‌ Wantz-Mézières, with J.M. Moureaux and L. Gérard (CRAN).‌
ENSMN second year (M1)‌ Department project: Y. Souidi,‌‌ Y. Laribi, “Multiple linear regression and regularization methods”.‌ Advisor: S. Ferrigno.

9.2.3‌ Juries

N. Champagnat was‌‌ referee for the PhD thesis of Nathanaël Boutillon‌ (Aix-Marseille Univ., 11/06/2025).
N.‌ Champagnat, C. Fritsch and‌‌ A. Gégout-Petit were examiners for the PhD thesis‌ of Virgile Brodu (Univ.‌ Lorraine, 27/08/2025).
N. Champagnat‌‌ and A. Gégout-Petit were examiners for the PhD‌ thesis of Anouk Rago‌ (Univ. Lorraine, 30/06/2025).
N.‌‌ Champagnat and A. Gégout-Petit were examiners for the‌ HDR jury of Coralie‌ Fritsch (Univ. Lorraine, 15/12/2025).‌‌
A. Gégout-Petit was president the PhD jury of‌ Trinh Duong (Lorraine University,‌ 06/2025).
A. Gégout-Petit was‌‌ referee for the PhD jury of Cristina Chavez‌ (Nanterre University, 04/2025).
A.‌ Gégout-Petit was referee for‌‌ the PhD jury of Julie Cartier (PSL University,‌ 11/2025).
A. Gégout-Petit was‌ referee for the PhD‌‌ jury of Valentin Portmann (Univ. Bordeaux, 11/2025)
S.‌ Wantz-Mézières was examiner for‌ the PhD thesis of‌‌ Ghislain Fievet (NGERE, Univ. Lorraine, 20/11/2025).
U. Herbach‌ was examiner for the‌ PhD thesis of Emrys‌‌ Reginato (Univ. Grenoble Alpes, 24/11/2025).
U. Herbach was‌ examiner for the PhD‌ thesis of Gustavo Magaña‌‌ Loópez (Univ. Bordeaux, 17/12/2025).
E. Strickler was referee‌ for the PhD thesis‌ of Jérémy Colombo (Univ.‌‌ Neuchâtel, 20/11/25).
D. Villemonais was referee for the‌ PhD thesis of Jules‌ Olayé (Ecole Polytechnique, Palaiseau,‌‌ 04/07/2025)

9.3 Popularization

J.-M. Monnez wrote lecture notes‌ 38 on the interpretation‌ of canonical analysis of‌‌ two random vectors as a projected principal component‌ analysis (PCA), an extended‌ Oja process for estimating‌‌ eigenvectors and stochastic approximation for streaming canonical correlation,‌ factorial correspondence and factorial‌ discriminant analyses.

9.3.1 Specific‌‌ official responsibilities in science outreach structures

S. Ferrigno:‌ Advisor of groups of‌ EEIGM students in the‌‌ context of “La main à la Pâte” projects‌ and “CGénial” projects, at‌ middle schools Paul Verlaine‌‌ in Malzéville and La Craffe in Nancy, at‌ Institut médico-éducatif (IME) in‌ Commercy and in elementary‌‌ schools in Nancy.
S. Ferrigno: Advisor of a‌ group of EEIGM students,‌ “Ateliers expérimentaux : Mécanique‌‌ et Statistique” project, in various high schools in‌ Nancy.

9.3.2 Participation in‌ Live events

J. Mardomingo‌‌ Sanz and V. Brodu‌ volunteered in the popularization event “Fête de la‌ Science” in Univ. Lorraine in October.

10 Scientific‌ production

10.1 Major publications

1 articleR.Romain‌ Azaïs, S.Sandie Ferrigno and M.-J.Marie-José‌ Martinez. cvmgof: an R package for Cramér-von‌ Mises goodness-of-fit tests in regression models.Journal‌ of Statistical Computation and Simulation9262022‌, 1246-1266HAL DOIback to text
2‌ articleN.Nicolas Champagnat and V.Vincent Hass‌. Convergence of population processes with small and‌ frequent mutations to the canonical equation of adaptive‌ dynamics.The Annals of Applied Probability35‌1February 2025, 1-63HAL DOI back‌ to text back to text
3 inproceedingsN.‌Nicolas Champagnat, S.Sylvie Méléard and V.‌ C.Viet Chi Tran. Multiscale eco-evolutionary models:‌ from individuals to populations.International Congress of‌ Mathematicians, ICM 202271fully virtually, Russia‌EMS PressDecember 2023, 5656-5678HAL DOI‌
4 articleN.Nicolas Champagnat and D.Denis‌ Villemonais. General criteria for the study of‌ quasi-stationarity.Electronic Journal of Probability2023.‌ In press. HAL DOIback to text back‌ to text
5 articleE.Edmée Eyraud,‌ E.Elise Maurat, J.-M.Jean-Marc Sac-Epee,‌ P.Pauline Henrot, M.Maeva Zysman,‌ P.Pauline Esteves, T.Thomas Trian,‌ H.Hugues Bégueret, P.-O.Pierre-Oliver Girodet,‌ M.Matthieu Thumerel, R.Romain Hustache-Castaing,‌ R.Roger Marthan, F.Florian Levet,‌ P.Pierre Vallois, C.Cécile Contin-Bordes,‌ P.Patrick Berger and I.Isabelle Dupin.‌ Short-range interactions between fibrocytes and CD8+ T cells‌ in COPD bronchial inflammatory response.eLifeOctober‌ 2022HAL DOI
6 miscC.Coralie Fritsch‌, M.Marie Grosdidier, A.Anne Gégout-Petit‌ and B.Benoit Marçais. Mechanistic-statistical model for‌ the expansion of ash dieback.September 2024‌HAL
7 articleC.Coralie Fritsch, D.‌Denis Villemonais and N.Nicolás Zalduendo. The‌ Multi-type Bisexual Galton-Watson Branching Process.Annales de‌ l'Institut Henri Poincaré (B) Probabilités et Statistiques2024‌, 36p.In press. HAL DOI
8 article‌J.-M.Jean-Marie Monnez. Stochastic approximation of eigenvectors‌ and eigenvalues of the Q-symmetric expectation of a‌ random matrix.Communications in Statistics - Theory‌ and Methods5352024, 1669-1683HAL‌DOI
9 articleS.Simon Toupance, D.‌Denis Villemonais, D.Daphné Germain, A.‌Anne Gégout-Petit, E.Eliane Albuisson and A.‌Athanase Benetos. The individual’s signature of telomere‌ length distribution.Scientific Reports91January‌ 2019, 1-8HALDOI back to text‌
10 articleE.Elias Ventre, U.Ulysse‌ Herbach, T.Thibault Espinasse, G.Gérard‌ Benoit and O.Olivier Gandrillon. One model‌ fits all: Combining inference and simulation of gene‌ regulatory networks.PLoS Computational Biology193‌March 2023, e1010962HAL DOI back to‌ text back to text

10.2 Publications of the year

International journals

11‌ articleM.Michel Benaïm‌, N.Nicolas Champagnat‌‌, W.William Oçafrain and D.Denis Villemonais‌. Degenerate processes killed‌ at the boundary of‌‌ a domain.The Annals of Probability53‌2March 2025,‌ 720-752HAL DOI back‌‌ to text
12 articleM.Michel Benaïm,‌ C.Claude Lobry,‌ T.Tewfik Sari and‌‌ É.Édouard Strickler. A note on the‌ top Lyapunov exponent of‌ linear cooperative systems.‌‌Annales de la Faculté des Sciences de Toulouse.‌ Mathématiques.341June‌ 2025, 225-241HAL‌‌DOI back to text
13 articleM.Michel‌ Benaim, C.Claude‌ Lobry, T.Tewfik‌‌ Sari and E.Edouard Strickler. Dispersal-induced growth‌ or decay in a‌ time-periodic environment. The case‌‌ of reducible migration matrices.Journal of Mathematical‌ Biology9132025‌, 26HAL DOI‌‌back to text
14 articleA.Athanase Benetos‌, C.Coralie Fritsch‌, E.Emma Horton‌‌, L.Lionel Lenotre, S.Simon Toupance‌ and D.Denis Villemonais‌. Stochastic branching models‌‌ for the telomeres dynamics in a model including‌ telomerase activity.Journal‌ of Mathematical Biology91‌‌8June 2025HALDOI back to text‌
15 articleN.Nicolas‌ Champagnat and V.Vincent‌‌ Hass. Convergence of population processes with small‌ and frequent mutations to‌ the canonical equation of‌‌ adaptive dynamics.The Annals of Applied Probability‌351February 2025‌, 1-63HAL DOI‌‌back to text
16 articleN.Nicolas Champagnat‌, É.Édouard Strickler‌ and D.Denis Villemonais‌‌. Uniform Wasserstein convergence of penalized Markov processes‌.Probability Theory and‌ Related Fields1923-4‌‌2025, 1031-1069HALDOI back to text‌
17 articleC.Coralie‌ Fritsch, D.Denis‌‌ Villemonais and N.Nicolás Zalduendo. Quasi-limiting behaviour‌ of the sub-critical Multitype‌ Bisexual Galton-Watson Branching Process‌‌.Bernoulli321February 2026, 798‌ -- 822HAL DOI‌back to text
18‌‌ articleC.Cécile Mangavel, C.Chloé Gapp‌, M.Magda Corgneau‌, A.Alexis Dijamentiuk‌‌, X.Xincheng Liu, A.Annelore Elfassy‌, L.Laurent Guillier‌, G. B.Ghaya‌‌ Ben Hmidene, S.Sandie Ferrigno, M.‌Mickaël Desvaux, A.-M.‌Anne-Marie Revol-Junelles and F.‌‌Frédéric Borges. Harnessing ecological niche modeling of‌ Listeria monocytogenes for biopreservation‌ system engineering.Food‌‌ Microbiology133January 2026, 104865HAL DOI‌back to text
19‌ articleT.Tracy Milane‌‌, U.Ulysse Herbach, M.-A.Marie-Anne Jean‌ and G.Géraldine Escriva-Boulley‌. Promoting physical activity‌‌ and limiting sedentary behaviors to manage pain in‌ endometriosis: What are the‌ psychosocial variables to take‌‌ into account?Women's Reproductive HealthNovember 2025,‌ 1-21HAL DOI back‌ to text
20 article‌‌J.-M.Jean-Marie Monnez. An extended Oja process‌ for streaming canonical analysis‌.Communications in Statistics‌‌ - Theory and Methods2025HAL DOI back‌ to text
21 article‌F.Francisco Prista von‌‌ Bonhorst, O.Olivier‌ Gandrillon, U.Ulysse Herbach, C.Corentin‌ Robert, C.Claire Chazaud, Y.Yannick‌ de Decker, D.Didier Gonze and G.‌Geneviève Dupont. Uncovering candidate Nanog-Helper genes in‌ early mouse embryo differentiation using differential entropy and‌ network inference.Scientific Reports151June‌ 2025, 19975HALDOI back to text‌
22 articleP.Paavo Salminen and P.Pierre‌ Vallois. Drawdowns of Diffusions.ESAIM: Probability‌ and Statistics292025, 357-380HAL DOI‌back to text

Conferences without proceedings

23 inproceedings‌M.Mathilde Gaillard and U.Ulysse Herbach.‌ Efficient Stochastic Simulation of Gene Regulatory Networks Using‌ Hybrid Models of Transcriptional Bursting.Computational Methods‌ in Systems Biology15959Lecture Notes in Computer‌ ScienceLyon, FranceSpringer Nature SwitzerlandAugust 2025‌, 109-125HAL DOIback to text back‌ to text

Doctoral dissertations and habilitation theses

24‌ thesisV.Virgile Brodu. Individual-based stochastic models‌ with allometric dynamics : branching, convergence, numerical simulations‌.Université de LorraineAugust 2025HAL back‌ to text
25 thesisC.Coralie Fritsch.‌ Comportement asymptotique de modèles probabilistes pour la biologie‌ : criticité de processus, théorèmes limites, comportement quasi-stationnaire,‌ approximation de modèles.Université de LorraineDecember‌ 2025HAL back to text

Reports & preprints‌

26 miscS.Sylvain Billiard, V.Virgile‌ Brodu, N.Nicolas Champagnat and C.Coralie‌ Fritsch. An individual-based stochastic model reveals strong‌ constraints on allometric relationships with minimal metabolic and‌ ecological assumptions.January 2025HAL back to‌ text
27 miscN.Nicolas Champagnat, R.‌Rodolphe Loubaton, L.Laurent Vallat and P.‌Pierre Vallois. Modelling the effects of biological‌ intervention in a dynamical gene network.May‌ 2025HAL back to text
28 miscN.‌Nicolas Champagnat, P.Pablo Marquet, C.‌Cristóbal Quiñinao, R.Rolando Rebolledo, M.‌Mauricio Tejo, L.Leonardo Videla and N.‌Nicolás Zalduendo Vidal. On ecological models with‌ sublinear growth.July 2025, 22 p.‌HAL back to text
29 miscN.Nicolas‌ Champagnat and D.Denis Villemonais. Quasi-stationary distributions‌ in reducible state spaces.October 2025HAL‌back to text
30 miscA. M.A‌ M G Cox, E.E Horton and‌ D.Denis Villemonais. Linear and uniform in‌ time bound for the binary branching model with‌ Moran type interactions.January 2025HAL back‌ to text
31 miscC.Clémence Fournié,‌ E.Elias Ventre, U.Ulysse Herbach,‌ A.Aymeric Baradat, O.Olivier Gandrillon and‌ F.Fabien Crauste. Cell Trajectory Inference based‌ on Schrödinger Problem and a Mechanistic Model of‌ Stochastic Gene Expression.November 2025HAL back‌ to text
32 miscT.Tobias Hurth and‌ E.Edouard Strickler. Ergodicity and regularity properties‌ of ODEs with semi-Markov switching.September 2025‌HAL back to text
33 miscV.Vincent‌ Kagan, E.Edouard Strickler and D.Denis Villemonais. Averaging principle‌ for jump processes depending‌ on fast ergodic dynamics‌‌.October 2025HALback to text
34‌ miscD.Denis Villemonais‌. Quasi-compactness for dominated‌‌ kernels with application to quasi-stationary distribution theory.‌October 2025HAL back‌ to text
35 misc‌‌D.Denis Villemonais and N.Nicolas Zalduendo.‌ Convergence of weighted branching‌ processes.December 2025‌‌HAL back to text

Other scientific publications

36‌ inproceedingsS.Sandie Ferrigno‌ and M.-J.Marie-José Martinez‌‌. A cross-validation bandwidth choice for nonparametric tests‌ in regression models.‌CMStatistics 2025London, United‌‌ KingdomDecember 2025HALback to text
37‌ inproceedingsL.Léo Gerard‌, L.Laetitia Bau‌‌, S.Sophie Wantz-Mézières, J.-M.Jean-Marie Moureaux‌ and F.Fabien Rech‌. AN ORIGINAL AUTOMATIC‌‌ SPEECH RECOGNITION DECISION AID TOOL TO FACILITATE AWAKE‌ BRAIN SURGERY.14e‌ Forum du Cancéropôle Est‌‌Besançon (France), FranceNovember 2025HAL back to‌ text

Educational activities

38‌ unpublishedJ.-M.Jean-Marie Monnez‌‌. Streaming data analysis Canonical analysis of two‌ random vectors.March‌ 2025, DoctoralFrance‌‌HAL back to text

10.3 Cited publications

39‌ bookC. D.Charalambos‌ D Aliprantis and O.‌‌Owen Burkinshaw. Positive operators.119Springer‌ Science & Business Media‌2006back to text‌‌
40 articleD. F.David F. Anderson,‌ A.Arnab Ganguly and‌ T. G.Thomas G.‌‌ Kurtz. Error analysis of tau-leap simulation methods‌.Ann. Appl. Probab.‌2162011,‌‌ 2226--2262URL: https://doi.org/10.1214/10-AAP756DOIback to text
41‌ unpublishedR.Romain Aza\"is‌, S.Sandie Ferrigno‌‌ and M.-J.Marie-José Martinez. cvmgof: Cramer-von Mises‌ goodness-of-fit tests.November‌ 2018, An R-package,‌‌ available on the CRANHAL Software Heritage back‌ to text
42 unpublished‌B.Bérangère Bastien,‌‌ T.Taha Boukhobza, H.Hélène Dumond,‌ A.Anne Gégout-Petit,‌ A.Aurélie Muller-Gueudin and‌‌ C.Charlène Thiébaut. A statistical methodology to‌ select covariates in high-dimensional‌ data under dependence. Application‌‌ to the classification of genetic profiles in oncology‌.September 2019,‌ https://arxiv.org/abs/1909.05481 - working paper‌‌ or preprintHAL back to text
43 article‌M.Michel Bena\"im,‌ N.Nicolas Champagnat and‌‌ D.Denis Villemonais. Stochastic approximation of quasi-stationary‌ distributions for diffusion processes‌ in a bounded domain‌‌.Annales de l'Institut Henri Poincaré, Probabilités et‌ Statistiques5722021‌, 726 -- 739‌‌URL: https://doi.org/10.1214/20-AIHP1093DOI back to text
44 article‌M.Michel Benaim,‌ B.Bertrand Cloez and‌‌ F.Fabien Panloup. Stochastic approximation of quasi-stationary‌ distributions on compact spaces‌ and applications.Ann.‌‌ Appl. Probab.2842018, 2370--2416URL:‌ https://doi.org/10.1214/17-AAP1360DOI back to‌ text
45 articleM.‌‌Michel Benaim, C.Claude Lobry, T.‌Tewfik Sari and É.‌Édouard Strickler. When‌‌ can a population spreading across sink habitats persist?‌Journal of Mathematical Biology‌8819January 2024‌‌, 1-56HAL DOIback to text back‌ to text
46 article‌A.Athanase Benetos and‌‌ others. Short leukocyte‌ telomere length precedes clinical expression of atherosclerosis: The‌ blood-and-muscle model.Circulation research1224Feb‌ 2018, 616--623DOIback to text
47‌ articleA.Arnaud Bonnaffoux, U.Ulysse Herbach‌, A.Angélique Richard, A.Anissa Guillemin‌, S.Sandrine Gonin-Giraud, P.-A.Pierre-Alexis Gros‌ and O.Olivier Gandrillon. WASABI: a dynamic‌ iterative framework for gene regulatory network inference.‌BMC Bioinformatics2012019, 1--19back‌ to text
48 articleF.Fabien Campillo and‌ N.Nicolas Champagnat. Simulation and analysis of‌ an individual-based model for graph-structured plant dynamics.‌Ecological Modelling2342012, 93--105back to‌ text back to text
49 articleF.Fabien‌ Campillo and C.Coralie Fritsch. Weak convergence‌ of a mass-structured individual-based model.Appl. Math.‌ Optim.7212015, 37--73URL: https://doi.org/10.1007/s00245-014-9271-3‌DOI back to textback to text
50‌ articleH.Hervé Cardot, P.Peggy Cénac‌ and J.-M.Jean-Marie Monnez. A fast and‌ recursive algorithm for clustering large datasets with k-medians‌.Computational Statistics & Data Analysis566‌2012, 1434--1449back to text back to‌ text
51 inproceedingsP.Philippe Carmona. Asymptotic‌ of the largest Floquet multiplier for cooperative matrices‌.Annales de la Faculté des sciences de‌ Toulouse: Mathématiques3142022, 1213--1221back‌ to text
52 articleN.Nicolas Champagnat.‌ A microscopic interpretation for adaptive dynamics trait substitution‌ sequence models.Stoch. Process. Appl.1168‌2006, 1127--1160back to text back to‌ text
53 articleN.Nicolas Champagnat, R.‌Régis Ferrière and S.Sylvie Méléard. From‌ individual stochastic processes to macroscopic models in adaptive‌ evolution.Stoch. Models24suppl. 12008‌, 2--44URL: http://dx.doi.org/10.1080/15326340802437710DOI back to text‌
54 articleN.Nicolas Champagnat, R.Régis‌ Ferrière and S.Sylvie Méléard. Unifying evolutionary‌ dynamics: From individual stochastic processes to macroscopic evolution‌.Theor. Pop. Biol.692006, 297--321‌back to text back to text
55 article‌N.Nicolas Champagnat and P.-E.Pierre-Emmanuel Jabin.‌ The evolutionary limit for models of populations interacting‌ competitively via several resources.J. Differential Equations‌25112011, 176--195URL: https://doi.org/10.1016/j.jde.2011.03.007DOI‌back to text
56 articleN.Nicolas Champagnat‌ and S.Sylvie Méléard. Polymorphic evolution sequence‌ and evolutionary branching.Probab. Theory Related Fields‌1511-22011, 45--94URL: https://doi.org/10.1007/s00440-010-0292-9DOI‌back to text back to text back to‌ text
57 articleN.Nicolas Champagnat, S.‌Sylvie Méléard and V. C.Viet Chi Tran‌. Stochastic analysis of emergence of evolutionary cyclic‌ behavior in population dynamics with transfer.The‌ Annals of Applied Probability3142021,‌ 1820--1867back to text
58 articleN.Nicolas‌ Champagnat and D.Denis Villemonais. Convergence of‌ the Fleming-Viot process toward the minimal quasi-stationary distribution‌.ALEA - Latin American Journal of Probability‌ and Mathematical Statisticsto appear2019back to text
59 articleN.‌Nicolas Champagnat and D.‌Denis Villemonais. Exponential‌‌ convergence to quasi-stationary distribution and $Q$ -process.‌Probab. Theory Related Fields‌1641-22016,‌‌ 243--283URL: https://doi.org/10.1007/s00440-014-0611-7DOIback to text back‌ to text
60 article‌B.Bertrand Cloez and‌‌ C.Coralie Fritsch. Gaussian approximations for chemostat‌ models in finite and‌ infinite dimensions.J.‌‌ Math. Biol.7542017, 805--843URL:‌ https://doi.org/10.1007/s00285-017-1097-6DOI back to‌ text
61 inproceedingsA.‌‌ M.Alexander MG Cox, E.Emma Horton‌ and D.Denis Villemonais‌. Binary branching processes‌‌ with Moran type interactions.Annales de l'Institut‌ Henri Poincare (B) Probabilites‌ et statistiques612‌‌Institut Henri Poincaré2025, 917--952back to‌ text
62 incollectionD.‌ A.Donald A. Dawson‌‌. Measure-valued Markov processes.École d'Été de‌ Probabilités de Saint-Flour XXI---1991‌1541Lecture Notes in‌‌ Math.BerlinSpringer1993, 1--260back to‌ text
63 articleU.‌Ulf Dieckmann and R.‌‌Richard Law. The dynamical theory of coevolution:‌ a derivation from stochastic‌ ecological processes.J.‌‌ Math. Biol.345-61996, 579--612back‌ to text
64 article‌O.Odo Diekmann,‌‌ P.-E.Pierre-Emanuel Jabin, S.Stéphane Mischler and‌ B.Benoît Perthame.‌ The dynamics of adaptation:‌‌ An illuminating example and a Hamilton-Jacobi approach.‌Theor. Pop. Biol.67‌2005, 257--271back‌‌ to text
65 articleW.Weiwei Ding and‌ T.Thomas Giletti.‌ Admissible speeds in spatially‌‌ periodic bistable reaction-diffusion equations.Advances in Mathematics‌3892021, 107889‌back to text
66‌‌ articleK.Kévin Duarte, J.-M.Jean-Marie Monnez‌ and E.Eliane Albuisson‌. Sequential linear regression‌‌ with online standardized data.Plos one13‌12018, e0191186‌back to text
67‌‌ articleG. R.Gilles R. Ducharme and S.‌Sandie Ferrigno. An‌ omnibus test of goodness-of-fit‌‌ for conditional distributions with applications to regression models‌.J. Statist. Plann.‌ Inference142102012‌‌, 2748--2761URL: https://doi.org/10.1016/j.jspi.2012.04.008DOI back to text‌
68 articleS. N.‌S. N. Ethier and‌‌ T. G.Thomas G. Kurtz. Fleming-Viot processes‌ in population genetics.‌SIAM J. Control Optim.‌‌3121993, 345--386back to text‌
69 unpublishedC.Coralie‌ Fritsch, M.Marie‌‌ Grosdidier, A.Anne Gégout-Petit and B.Benoit‌ Marçais. Mechanistic-statistical model‌ for the expansion of‌‌ ash dieback.September 2024, working paper‌ or preprintHAL back‌ to text
70 article‌‌C.Coralie Fritsch, J.Jérôme Harmand and‌ F.Fabien Campillo.‌ A modeling approach of‌‌ the chemostat.Ecological Modelling2014back to‌ text
71 articleC.‌Coralie Fritsch, D.‌‌Denis Villemonais and N.Nicolás Zalduendo. The‌ Multi-type Bisexual Galton-Watson Branching‌ Process.Annales de‌‌ l'Institut Henri Poincaré (B) Probabilités et Statistiques60‌42024, 2975-3008‌HAL DOI back to‌‌ text
72 articleA.Anne Gégout-Petit, A.‌Aurélie Gueudin-Muller and C.‌Clémence Karmann. The‌‌ revisited knockoffs method for‌ variable selection in L 1-penalized regressions.Communications‌ in Statistics - Simulation and Computation00‌2020, 1-14URL: https://doi.org/10.1080/03610918.2020.1775850DOI back to‌ text
73 articleU.Ulysse Herbach, A.‌Arnaud Bonnaffoux, T.Thibault Espinasse and O.‌Olivier Gandrillon. Inferring gene regulatory networks from‌ single-cell data: a mechanistic approach.BMC Syst.‌ Biol.1112017back to text back‌ to text back to text
74 articleP.-E.‌Pierre-Emmanuel Jabin. Small populations corrections for selection-mutation‌ models.Netw. Heterog. Media742012‌, 805--836URL: https://doi.org/10.3934/nhm.2012.7.805DOI back to text‌
75 articleG.Guy Katriel. Dispersal-induced growth‌ in a time-periodic environment.Journal of Mathematical‌ Biology8532022, 24back to‌ text
76 articleM.Michael Klann, A.‌Arnab Ganguly and H.Heinz Koeppl. Hybrid‌ spatial Gillespie and particle tracking simulation.Bioinformatics‌28182012, i549--i555back to text‌
77 articleA.Alexey Koshkin, U.Ulysse‌ Herbach, M. R.Mar\'ia Rodr\'iguez Mart\'inez,‌ O.Olivier Gandrillon and F.Fabien Crauste.‌ Stochastic modeling of a gene regulatory network driving‌ B cell development in germinal centers.PLoS‌ ONE193March 2024, e0301022HAL‌DOI back to text
78 articleB.Benoît‌ Lalloué, J.-M.Jean-Marie Monnez and E.Eliane‌ Albuisson. Streaming constrained binary logistic regression with‌ online standardized data.Journal of Applied Statistics‌In press2021HALDOI back to text‌
79 articleA.Alexander Lorz, S.Sepideh‌ Mirrahimi and B.Benoît Perthame. Dirac mass‌ dynamics in multidimensional nonlocal parabolic equations.Comm.‌ Partial Differential Equations3662011, 1071--1098‌URL: http://dx.doi.org/10.1080/03605302.2010.538784DOI back to text
80 incollection‌J. A.J. A. J. Metz, S.‌ A.S. A. H. Geritz, G.G.‌ Meszéna, F. J.F. J. A. Jacobs‌ and J. S.J. S. van Heerwaarden.‌ Adaptive dynamics, a geometrical study of the consequences‌ of nearly faithful reproduction.Stochastic and spatial‌ structures of dynamical systems (Amsterdam, 1995)Konink. Nederl.‌ Akad. Wetensch. Verh. Afd. Natuurk. Eerste Reeks, 45‌AmsterdamNorth-Holland1996, 183--231back to text‌
81 articleS.Sepideh Mirrahimi, G.Guy‌ Barles, B.Benoît Perthame and P. E.‌Panagiotis E. Souganidis. A singular Hamilton-Jacobi equation‌ modeling the tail problem.SIAM J. Math.‌ Anal.4462012, 4297--4319URL: https://doi.org/10.1137/100819527‌DOI back to text
82 articleJ.-M.Jean-Marie‌ Monnez. Approximation stochastique en analyse factorielle multiple‌.Ann. I.S.U.P.5032006, 27--45‌back to text
83 articleJ.-M.Jean-Marie Monnez‌. Convergence d'un processus d'approximation stochastique en analyse‌ factorielle.Publ. Inst. Statist. Univ. Paris38‌11994, 37--55back to text
84‌ unpublishedJ.-M.Jean-Marie Monnez and A.Abderrahman Skiredj‌. Convergence of a normed eigenvector stochastic approximation‌ process and application to online principal component analysis‌ of a data stream.May 2019, working paper or preprint‌HAL back to text‌
85 articleJ.-M.Jean-Marie‌‌ Monnez. Stochastic approximation of eigenvectors and eigenvalues‌ of the Q-symmetric expectation‌ of a random matrix‌‌.Communications in Statistics - Theory and Methods‌53515 pages‌2024, 1669-1683HAL‌‌DOI back to textback to text
86‌ articleJ.-M.Jean-Marie Monnez‌. Stochastic approximation of‌‌ the factors of a generalized canonical correlation analysis‌.Statist. Probab. Lett.‌78142008,‌‌ 2210--2216URL: http://dx.doi.org/10.1016/j.spl.2008.01.088DOIback to text
87‌ articleB.Barbara Niethammer‌. On the evolution‌‌ of large clusters in the Becker-Döring model.‌Journal of Nonlinear Science‌1312003,‌‌ 115--122back to text
88 articleB.Benoît‌ Perthame and G.Guy‌ Barles. Dirac concentrations‌‌ in Lotka-Volterra parabolic PDEs.Indiana Univ. Math.‌ J.5772008‌, 3275--3301URL: http://dx.doi.org/10.1512/iumj.2008.57.3398‌‌DOI back to text
89 articleB.Benoît‌ Perthame and M.Mathias‌ Gauduchon. Survival thresholds‌‌ and mortality rates in adaptive dynamics: conciliating deterministic‌ and stochastic simulations.‌Math. Med. Biol.27‌‌32010, 195--210URL: https://doi.org/10.1093/imammb/dqp018DOI back‌ to text
90 article‌A.Angélique Richard,‌‌ L.Loïs Boullu, U.Ulysse Herbach,‌ A.Arnaud Bonnafoux,‌ V.Valérie Morin,‌‌ E.Elodie Vallin, A.Anissa Guillemin,‌ N.Nan Papili Gao‌, R.Rudiyanto Gunawan‌‌, J.Jérémie Cosette, O.Ophélie Arnaud‌, J.-J.Jean-Jacques Kupiec‌, T.Thibault Espinasse‌‌, S.Sandrine Gonin-Giraud and O.Olivier Gandrillon‌. Single-cell-based analysis highlights‌ a surge in cell-to-cell‌‌ molecular variability preceding irreversible commitment in a differentiation‌ process.PLOS Biology‌14122016back‌‌ to text back to text
91 articleD.‌Denis Villemonais. Interacting‌ particle systems and Yaglom‌‌ limit approximation of diffusions with unbounded drift.‌Electron. J. Probab.16‌2011, no. 61,‌‌ 1663--1692URL: https://doi.org/10.1214/EJP.v16-925DOIback to text
92‌ articleD.D. Waxman‌ and S.S. Gavrilets‌‌. 20 questions on adaptive dynamics.J.‌ Evol. Biol.182005‌, 1139--1154back to‌‌ text back to text

SIMBA - 2025

SIMBA - 2025

2025Activity reportProject-Team​​﻿﻿SIMBA

Keywords

Computer Science and​‌﻿﻿ Digital Science

Other Research﻿​﻿﻿ Topics and Application Domains​‌﻿﻿

1 Team​‌﻿﻿ members, visitors, external collaborators​​﻿﻿

Research Scientists

Faculty Members﻿​﻿﻿

Post-Doctoral Fellow

PhD Students​​​‌

Interns and Apprentices​​​‌

Administrative Assistants

2 Overall objectives

3​​﻿﻿ Research program

3.1 Stochastic modeling﻿﻿﻿‌ for health

3.1.1 Links﻿﻿﻿‌ between macroscopic models and﻿‌​‌ stochastic microscopic processes

3.1.2 Numerical analysis

3.2 Analysis of﻿﻿﻿‌ biological and medical data﻿‌​‌

3.2.1 Statistical learning, regression﻿﻿﻿‌

3.2.2 Signal or online​​﻿﻿ data analysis

3.2.3​​﻿﻿ Network inference

3.2.4 Inference for﻿﻿﻿‌ stochastic processes

3.3 Stochastic​​​‌ modeling for ecology and﻿﻿﻿‌ evolution

3.3.1 Links between macroscopic​‌﻿﻿ and microscopic eco-evolutionary models​​﻿﻿

3.3.2 Adaptive dynamics:​​​‌ concentration limits

3.3.3​​​‌ Population dynamics with absorption﻿﻿﻿‌ and quasi-stationary distributions

3.3.4 Numerical​​​‌ analysis

4 Application domains

4.1 Tumor​​﻿﻿ growth and heterogeneity

4.1.1​​​‌ Reconstruction of tumor heterogeneity﻿​﻿﻿

4.1.2 Evolution of﻿‌​‌ low-grade gliomas

4.2 Telomeres

4.3 Gene networks﻿​﻿﻿ and single-cell data

4.3.1​‌﻿﻿ Modeling gene expression at​​﻿﻿ single-cell level

4.3.2​‌﻿﻿ Transcriptional bursting in regulatory​​﻿﻿ networks

4.3.3 Prediction and identification​​​‌ of therapeutic targets for﻿​﻿﻿ chronic lymphocytic leukemia

4.4 Chalara

5﻿‌​‌ Highlights of the year﻿​​﻿

5.1 Awards

6 Latest software﻿​​﻿ developments, platforms, open data​​​‌

6.1 Latest software developments﻿﻿﻿‌

6.1.1 cvmgof

6.1.2﻿​​﻿ Harissa

6.1.3 MultiRNAflow

6.1.4 quantCurves

6.1.5 PEOC

7 New results​​​‌

7.1 Stochastic modeling for﻿​﻿﻿ health, ecology and evolution​‌﻿﻿

7.1.1 Gene regulatory networks​​﻿﻿

Stochastic modeling and simulation​​​‌ of gene regulatory networks﻿​﻿﻿ at single-cell level

Cell trajectory inference​​​‌ based on Schrödinger problem﻿﻿﻿‌ and a mechanistic model﻿‌​‌ of stochastic gene expression﻿​​﻿

Prediction﻿‌​‌ of silencing experiments on﻿​​﻿ gene networks for chronic​​​‌ lymphocytic leukemia

7.1.2 Parameter﻿﻿﻿‌ scalings in population dynamics﻿‌​‌

Adaptive dynamics in biological﻿​​﻿ populations with small mutations​​​‌

Averaging principle​​﻿﻿ for models with slow-fast​​​‌ components

Multiscale dynamics and condensation﻿​﻿﻿ phenomenon in coagulation-fragmentation processes​‌﻿﻿

7.1.3 Individual-based modeling in​​﻿﻿ medicine, ecology and evolution​​​‌

Telomeres

Allometric relationships in﻿﻿﻿‌ Ecology

Niche construction​​​‌

7.1.4 Quasi-stationary distributions​‌﻿﻿

7.1.5 Multi-type bisexual​​​‌ and weighted branching process﻿﻿﻿‌

7.1.6 ODE﻿‌​‌ with semi-Markov switching

7.1.7 Dispersal​‌﻿﻿ Induced Growth

7.1.8​​​‌ Modeling of chronic obstructive﻿​﻿﻿ pulmonary disease

7.1.9​​​‌ Drawdowns of diffusions

7.2 Analysis﻿﻿﻿‌ of biological and medical﻿‌​‌ data

7.2.1 Quantifying and﻿​​﻿ predicting the evolution of​​​‌ clonal heterogeneity in chronic﻿﻿﻿‌ lymphocytic leukemia

7.2.2 Promoting​​﻿﻿ physical activity and limiting​​​‌ sedentary behaviors to manage﻿​﻿﻿ pain in endometriosis: analysis​‌﻿﻿ of psychosocial variables

7.2.3 Uncovering candidate Nanog-Helper​​​‌ genes in early mouse﻿​﻿﻿ embryo differentiation using differential​‌﻿﻿ entropy and network inference​​﻿﻿

7.2.4 Harnessing ecological niche﻿​​﻿ modeling of Listeria monocytogenes​​​‌ for biopreservation system engineering﻿﻿﻿‌

7.2.5​​​‌ Nonparametric estimation

7.2.6 Online big data﻿‌​‌ analysis and online learning​​​‌

7.2.7 Diffuse low-grade gliomas﻿﻿﻿‌

8 Partnerships​​​‌ and cooperations

8.1 International initiatives﻿﻿﻿‌

8.1.1 Associate Teams in﻿‌​‌ the framework of an​​​‌ Inria International Lab or﻿​﻿﻿ in the framework of​‌﻿﻿ an Inria International Program​​﻿﻿

aStoNiche

8.1.2 Visits of​​​‌ international scientists

2025Activity reportProject-TeamSIMBA

Computer Science and‌ Digital Science

Other Research Topics and Application Domains‌

1 Team‌ members, visitors, external collaborators

Faculty Members

PhD Students‌

Interns and Apprentices‌

3 Research program

3.1 Stochastic modeling‌ for health

3.1.1 Links‌ between macroscopic models and‌‌ stochastic microscopic processes

3.2 Analysis of‌ biological and medical data‌‌

3.2.1 Statistical learning, regression‌

3.2.2 Signal or online data analysis

3.2.3 Network inference

3.2.4 Inference for‌ stochastic processes

3.3 Stochastic‌ modeling for ecology and‌ evolution

3.3.1 Links between macroscopic‌ and microscopic eco-evolutionary models

3.3.2 Adaptive dynamics:‌ concentration limits

3.3.3‌ Population dynamics with absorption‌ and quasi-stationary distributions

3.3.4 Numerical‌ analysis

4.1 Tumor growth and heterogeneity

4.1.1‌ Reconstruction of tumor heterogeneity

4.1.2 Evolution of‌‌ low-grade gliomas

4.3 Gene networks and single-cell data

4.3.1‌ Modeling gene expression at single-cell level

4.3.2‌ Transcriptional bursting in regulatory networks

4.3.3 Prediction and identification‌ of therapeutic targets for chronic lymphocytic leukemia

5‌‌ Highlights of the year

6 Latest software developments, platforms, open data‌

6.1 Latest software developments‌

6.1.2 Harissa

7 New results‌

7.1 Stochastic modeling for health, ecology and evolution‌

7.1.1 Gene regulatory networks

Stochastic modeling and simulation‌ of gene regulatory networks at single-cell level

Cell trajectory inference‌ based on Schrödinger problem‌ and a mechanistic model‌‌ of stochastic gene expression

Prediction‌‌ of silencing experiments on gene networks for chronic‌ lymphocytic leukemia

7.1.2 Parameter‌ scalings in population dynamics‌‌

Adaptive dynamics in biological populations with small mutations‌

Averaging principle for models with slow-fast‌ components

Multiscale dynamics and condensation phenomenon in coagulation-fragmentation processes‌

7.1.3 Individual-based modeling in medicine, ecology and evolution‌

Allometric relationships in‌ Ecology

Niche construction‌

7.1.4 Quasi-stationary distributions‌

7.1.5 Multi-type bisexual‌ and weighted branching process‌

7.1.6 ODE‌‌ with semi-Markov switching

7.1.7 Dispersal‌ Induced Growth

7.1.8‌ Modeling of chronic obstructive pulmonary disease

7.1.9‌ Drawdowns of diffusions

7.2 Analysis‌ of biological and medical‌‌ data

7.2.1 Quantifying and predicting the evolution of‌ clonal heterogeneity in chronic‌ lymphocytic leukemia

7.2.2 Promoting physical activity and limiting‌ sedentary behaviors to manage pain in endometriosis: analysis‌ of psychosocial variables

7.2.3 Uncovering candidate Nanog-Helper‌ genes in early mouse embryo differentiation using differential‌ entropy and network inference

7.2.4 Harnessing ecological niche modeling of Listeria monocytogenes‌ for biopreservation system engineering‌

7.2.5‌ Nonparametric estimation

7.2.6 Online big data‌‌ analysis and online learning‌

7.2.7 Diffuse low-grade gliomas‌

8 Partnerships‌ and cooperations

8.1 International initiatives‌

8.1.1 Associate Teams in‌‌ the framework of an‌ Inria International Lab or in the framework of‌ an Inria International Program

8.1.2 Visits of‌ international scientists

Other international visits to the team‌

8.1.3 Visits to international‌ teams

Research stays abroad

8.2.1 H2020 projects

8.4‌‌ Regional initiatives

9‌‌ Dissemination

9.1 Promoting scientific‌ activities

9.1.1 Scientific events:‌‌ organisation

Member of the organizing committees

9.1.2 Scientific events: selection

Member of the‌ conference program committees

9.1.3 Journal‌

Member of the editorial‌ boards

9.1.4 Invited talks

9.1.5 Scientific expertise

9.1.6 Research‌ administration

9.2 Teaching - Supervision -‌ Juries - Educational and‌ pedagogical outreach

9.2.1 Teaching‌‌