2025Activity reportProject-TeamSIMBA
RNSR: 202424506M- Research center Inria Centre at Université de Lorraine
- In partnership with:Université de Lorraine, CNRS
- Team name: Statistical Inference and Modeling for Biological Applications
- In collaboration with:Institut Elie Cartan de Lorraine (IECL)
Creation of the Project-Team: 2024 January 01
Each year, Inria research teams publish an Activity Report presenting their work and results over the reporting period. These reports follow a common structure, with some optional sections depending on the specific team. They typically begin by outlining the overall objectives and research programme, including the main research themes, goals, and methodological approaches. They also describe the application domains targeted by the team, highlighting the scientific or societal contexts in which their work is situated.
The reports then present the highlights of the year, covering major scientific achievements, software developments, or teaching contributions. When relevant, they include sections on software, platforms, and open data, detailing the tools developed and how they are shared. A substantial part is dedicated to new results, where scientific contributions are described in detail, often with subsections specifying participants and associated keywords.
Finally, the Activity Report addresses funding, contracts, partnerships, and collaborations at various levels, from industrial agreements to international cooperations. It also covers dissemination and teaching activities, such as participation in scientific events, outreach, and supervision. The document concludes with a presentation of scientific production, including major publications and those produced during the year.
Keywords
Computer Science and Digital Science
- A3.1. Data
- A3.2. Knowledge
- A3.2.3. Inference
- A3.3. Data and knowledge analysis
- A3.3.1. On-line analytical processing
- A3.3.2. Data mining
- A3.3.3. Big data analysis
- A6.1. Methods in mathematical modeling
- A6.1.1. Continuous Modeling (PDE, ODE)
- A6.1.2. Stochastic Modeling
- A6.1.3. Discrete Modeling (multi-agent, people centered)
- A6.1.4. Multiscale modeling
- A6.2. Scientific computing, Numerical Analysis & Optimization
- A6.2.2. Numerical probability
- A6.2.3. Probabilistic methods
- A6.2.4. Statistical methods
- A9.2.1. Supervised learning
- A9.2.2. Unsupervised learning
- A9.2.5. Bayesian methods
- A9.2.7. Kernel methods
Other Research Topics and Application Domains
- B1. Life sciences
- B1.1. Biology
- B1.1.2. Molecular and cellular biology
- B1.1.4. Genetics and genomics
- B1.1.6. Evolutionnary biology
- B1.1.8. Mathematical biology
- B1.1.10. Systems and synthetic biology
- B1.1.11. Plant Biology
- B2.2. Physiology and diseases
- B2.2.1. Cardiovascular and respiratory diseases
- B2.2.3. Cancer
- B2.3. Epidemiology
- B2.4. Therapies
- B2.4.2. Drug resistance
1 Team members, visitors, external collaborators
Research Scientists
- Nicolas Champagnat [Team leader, INRIA, Senior Researcher, HDR]
- Coralie Fritsch [INRIA, Researcher, HDR]
- Ulysse Herbach [INRIA, Researcher]
- Edouard Strickler [CNRS, Researcher]
Faculty Members
- Koléhè Coulibaly Pasquier [UL, Associate Professor Delegation, from Sep 2025]
- Sandie Ferrigno [UL, Associate Professor]
- Anne Gégout Petit [UL, Professor, HDR]
- Jean-Marie Monnez [UL, Emeritus, HDR]
- Pierre Vallois [UL, Professor, HDR]
- Denis Villemonais [Univ. Strasbourg, Professor, HDR]
- Sophie Wantz-Mézières [UL, Associate Professor]
Post-Doctoral Fellow
- Elie Cerf [INRIA, Post-Doctoral Fellow]
PhD Students
- Sophie Baland [UL]
- Virgile Brodu [UL, ATER, from Sep 2025]
- Virgile Brodu [UL, until Aug 2025]
- Mathilde Gaillard [INRIA]
- Vincent Kagan [UL]
- Juan Mardomingo Sanz [UL, from Oct 2025]
- Anouk Rago [UL, ATER, until Oct 2025]
- Vidhi Vidhi [INRIA]
Interns and Apprentices
- Lorenzo Boussion [ENS Paris-Saclay, Intern, from Apr 2025 until Jul 2025]
- Roxane Cellier [INRIA, Intern, from Apr 2025 until Sep 2025]
- Juan Mardomingo Sanz [UL, Intern, from Apr 2025 until Sep 2025]
- Thibaut Pannet [INRIA, Intern, from May 2025 until Aug 2025]
- Mattheo Rapenne [UL, Intern, from Apr 2025 until Aug 2025]
Administrative Assistants
- Emmanuelle Deschamps [INRIA, until Mar 2025]
- Ouiza Herbi [INRIA, from Mar 2025]
2 Overall objectives
SIMBA is a joint team of Inria, CNRS and University of Lorraine, within the Institut Élie Cartan de Lorraine (IECL), UMR 7502 CNRS-UL, laboratory in mathematics, of which Inria is a strong partner. The team is composed of applied mathematicians whose research interests mainly concern probability and statistics with applications in biology and medicine.
Many fundamental questions and applications in medicine and biology critically rely on our capacity to construct, estimate, analyse and simulate complex mathematical models. They can aim at building predictions and decision making processes upon heterogeneous, noisy, incomplete or inconsistent data, at improving the understanding of complex phenomena involving several interacting subsystems that can only be calibrated separately, or at supplying a priori information on phenomena that cannot be reproduced in laboratory experiments or for which data are too costly to collect. In the past years, these models have been gradually refined, taking into account more interactions or dependencies between different components, more heterogeneity between subsystems and a wider range of time or space scales. Along with this gradual complexification, the importance of stochasticity has been recognized as fundamental in many biological or medical studies, either to take into account intrinsic randomness in biological processes, to evaluate confidence in a model’s parameters or predictions, or to take into account randomness or uncertainties in environmental conditions. In parallel, the specificities of bio-medical data, which are typically high dimensional, heterogeneous, correlated and with few observations, and their gradual increase pose new statistical challenges in terms of classification, prediction, variable selection or streaming data analysis.
Our expertise gathers a large spectrum of mathematical domains, ranging from statistics to stochastic modeling and analysis. We share a common experience and dedication to interactions with other sciences (biology, medicine) and interested parties (physicians, clinical researchers in the medical domain, start-ups). Our application domains are medicine, epidemiology, systems biology, ecology and evolution. The specificity of our group is our capacity to use a broad range of tools to answer practical statistical, modeling and analytical questions posed by collaborators from biology or medicine.
We believe that an interdisciplinary approach is crucial to answer questions posed by biologists or physicians, but can also bring original mathematical questions requiring the development of new theoretical tools which may apply to a broader range of application domains. Our ambition is therefore both to follow a bottom-up research program, where we tackle practical modeling or statistical questions posed by practitioners, and a top-down approach where we develop new mathematical tools to study general models and questions from biology and medicine. Part of our work is purely mathematical, but always motivated by biological applications.
Statistical and stochastic modeling are central to our project. The range of mathematical models we study is large, but they all share common features: we are mainly interested in dynamical models of biological populations with interactions (in a general sense). In our models, populations can be composed of individuals (in ecology, evolution or epidemiology), cells (e.g. bacteria in ecology or tumor cells in medicine), genes or proteins (in systems biology), or populations in the statistical sense (e.g. patients in epidemiology). Interactions or coupling can occur within a population or between species (e.g. in evolution), between cells (e.g. in oncology), between genes or proteins (intracellular networks), with an environment (e.g. in tumor growth, where environmental conditions are linked to medical treatment, or in ecology) or between high-dimensional statistical variables (e.g. clustering and variable selection in epidemiology). In biology and medicine, models were primarily developed to understand phenomena at fixed time and space scales. Today, more and more models aim at studying phenomena at large scales resulting from delicate coupling and interactions between scales. Descriptions at small scales are typically high dimensional and often involve stochastic features, or a combination of stochastic and deterministic features (hybrid models). Our research project aims at studying such complex systems using analytical tools to construct: 1. coarse-grained representations of small scale features (through averaging, ergodic limits, homogenization, mean field models...); 2. specific numerical methods, possibly based on coarse-grained representations, to efficiently bridge the gap between different time and space scales; 3. appropriate statistical inference or learning tools often based on limited or partial observation, in order to make predictions.
3 Research program
The research challenges we present in this section are mainly theoretical or methodological. All of them are motivated by biological or medical applications and provide a wide methodological toolbox that can be combined to answer biological or medical questions.
3.1 Stochastic modeling for health
The modeling issues we address in medicine aim at understanding fundamental mechanisms of cancer development, understanding how cells make decisions through gene expression and bringing new insights on the evolution of telomere length distribution with age and across a population. Telomeres are non-coding regions of repetitive nucleotide sequences located at each end of chromosomes. In human, they shorten at each cell division and it is known that short telomere lengths are statistically linked to age related diseases. These transversal applications are described in Section 4.
Population dynamics of tumor cells have been modeled in numerous works, either by deterministic models (ordinary differential equations or ODEs and partial differential equations or PDEs), or stochastic ones, either discrete (birth and death or branching processes, individual-based models in infinite dimension) or continuous (stochastic differential equations or SDEs). Branching processes or individual-based models are also fundamental tools to study telomere length dynamics. Concerning gene networks, in addition to classical models (e.g. Gaussian graphical models or deterministic systems with external noise), we develop new models (see Section 3.2.3) based on PDMPs (piecewise-deterministic Markov processes). Our team gathers experts of all these classes of models, both from the analytical, statistical and numerical simulation points of view. In most applications we have in mind, we need to combine within-cell dynamics (such as telomere shortening or gene expression) with cell population dynamics, leading to multiscale and/or multicomponent hybrid models. Multicomponent models are also ubiquitous in medicine when one takes into account latent variables such as the genealogical tree of mutations within a tumor when only observations of clonal population sizes, that is sizes of populations of cells sharing the same genetic material, are available.
3.1.1 Links between macroscopic models and stochastic microscopic processes
For multiscale models, it is often relevant to distinguish what could be called the microscopic level (i.e. the level of single individuals), the macroscopic level (i.e. the level of large population densities) or the intermediate mesoscopic level (i.e. a level of population densities, but where demographic stochasticity cannot be neglected). Microscopic models can be for example stochastic, individual-based models, mesoscopic models can be SDEs and macroscopic models are usually models of population densities, such as ODEs, PDEs or PDMPs. Many biological questions are stated at the macroscopic level, and the main modeling issue lies in the appropriate way to incorporate meso- or microscopic features in the description of the macroscopic scale.
Typically, individual-based models are models where the population state at time is described as a counting measure where is the number of individuals alive at time , and is the characteristic of the -th individual (e.g. phenotype, mass, size, length of telomeres, age...), belonging to some set . The dynamics is strongly dependent on the precise phenomenon to be modeled, but in the simplest cases, is a Markov jump process on point measures on whose infinitesimal generator has the following form:
where is the infinitesimal birth rate of an individual from an individual in the population and is the death rate of an individual in the population .
We have developed a strong expertise in the derivation of simplified macroscopic models from complex microscopic effects when there is a strong separation between time and/or space scales. Mathematically, this requires us to encode the scale separation through scaling parameters and to solve an asymptotic analysis problem, which can be averaging (slow-fast dynamics, singular perturbation) 57, 2, homogeneization 48, 65, concentration limits 52, 55, 56 or more generally parameter scaling problems where the scaling parameter has a biological meaning 54, 49. For individual-based models, the simplest parameter scaling that can be considered is a large population asymptotic encoded with a parameter , where the state of the population is modified as and the generator (1) as
leading to mean-field macroscopic models when 54. More complex scalings can involve rare or small mutations combined with long-time scalings, complex local interactions as in 48 or multiscale phenomena (e.g. within cell dynamics combined with the dynamics of populations of cells, like for telomeres). Each problem usually requires new methodological development.
3.1.2 Numerical analysis
Sometimes, it is not possible to construct a simplified macroscopic description of complex, multiscale biological phenomena. In such cases, we need to rely on numerical simulations. More generally, numerical simulations are very helpful to supply a priori information on phenomena that are difficult to reproduce in laboratory experiments or on large-scale models that involve several interacting elements.
Despite increasing computing facilities, existing numerical schemes are rarely adapted to individual-based models. Better numerical approaches based on a deeper understanding of their multiscale features would significantly reduce computational costs and yield reliable error estimates. This is one of our motivations in studying model reduction through asymptotic analysis as described above. We propose to design numerical schemes taking into account local extinction or local deterministic approximation, or develop hybrid methods relying on duality methods based e.g. on the Poisson representation of birth and death processes, which allows to switch between microscopic and mesoscopic models when the population size crosses a threshold.
3.2 Analysis of biological and medical data
Modern medicine is turning to highly personalized approaches, and a major challenge is to design and develop a new generation of techniques to assist prevention, diagnosis, prognosis and therapy. A major difficulty is the integration and exploitation of data which are often high-dimensional, heterogeneous, incomplete or inconsistent, to build predictions and decision making processes. The main part of our research in statistical learning aims to develop operational tools for the analysis of data from our collaborations with biologists and physicians. Another part of our project builds upon the sophisticated models described in Section 3.1, for which specific inference tools need to be designed.
3.2.1 Statistical learning, regression
We want to develop methodologies that take into account the specificities of biological data: they are often high dimensional and correlated (for instance multiomics data, such as genome, proteome or transcriptome) but with few observations (patients) and with missing data. Variable selection is of particular interest to study the link between an outcome (the occurence of an illness for instance) with covariates or to infer partial correlations between several variables for instance to study quantitative microbiome data. For instance, in variable selection, we propose to study the theoretical guarantees of our methods 42, 72 and to extend them to other models of dependencies such as mixtures of quantitative or qualitative covariates with dependencies between the covariates. In regression, we wish to tackle the challenge not only to select the covariates related to an event (illness, death...), but also to understand which configuration of these covariates triggers its occurrence.
We also develop goodness-of-fit tests to assess the different assumptions of a (possibly heteroscedastic) regression model. Most of them are “directional” in that they detect departures from a given assumption of the model. Other tests are “omnibus” in that they assess whether a model fits a dataset on all its assumptions. We focus on the task of choosing the structural part of the regression function because it contains easily interpretable information about the studied relationship. Among the large number of existing tests, we consider nonparametric tests which are all based on generalizations of the Cramér-von Mises statistic. To perform these goodness-of-fit tests, we develop the R package cvmgof41, 1, an easy-to-use tool which allows practitioners to compare the implemented tests. In the future, we plan to enrich the cvmgof package with tests concerning the other assumptions of a regression model such as the functional form of the variance or the additivity of the random error term, using directional tests such as 67. Another perspective is to develop a similar tool for other statistical models widely used in biostatistics such as generalized linear models.
3.2.2 Signal or online data analysis
We develop tools for the analysis of online data, which are now very frequent in the health domain (recording of cardiac signals, physiological measurements via connected objects...). The purpose is either the update of estimation parameters for models with sequential arrival of new data, or the online detection of change-points in temporal signals. For the first purpose, stochastic algorithms are essential tools 50, which allow one to approximate eigenvectors in a stepwise manner 83, 82, 86. We focus on several incremental procedures for regression and data analysis like PCA (Principal Component Analysis) 84 and linear and logistic regressions on standardised data in order to avoid numerical explosion 78. Our aim is to apply these results to other methods related to PCA, such as multiple factorial analyses, partial PCA, and to learning (classification, regression, event scores).
Change-point detection is of particular interest in e-health, for example for detecting changes in the health status of the elderly or people at risk of a disease. We plan to develop tools for online change-point detection using score-based CUSUM statistics. Our aim is to use the beginning of the signal to build simulation-based thresholds that have to be crossed for the detection of the change-point. We also want to propose new stopping rules adapted to the characteristics of the signal. These questions may be relevant for a large part of the applications we shall develop with physicians.
3.2.3 Network inference
Inferring networks from data is often a crucial step for understanding biological interactions, such as regulatory links between genes within individual cells 73, 47 or communication relationships between bacteria within a population.
Concerning gene regulatory networks, we are interested in so-called single-cell data, where mRNA levels are measured individually in many cells rather than being population-averaged, revealing the intrinsically stochastic “transcriptional bursting” phenomenon. In previous work 73, we described a promising strategy in which the network inference problem is seen as a calibration procedure for a PDMP model that is able to fit real single-cell data 10. In the simplest version of this model, the state of each gene at time is described by its promoter state , the quantity of transcribed RNA and of translated proteins , where
Interactions between genes in the network are encoded in functions which correspond to burst frequencies. We plan to develop dynamical models that fully exploit the particular time structure of the data: as cells have to be killed for measurements, such data do not consist of cell trajectories but rather independent samples of time-varying multivariate distributions. As the number of genes can be large ( in humans), we will also investigate variational methods, which generalize the usual expectation-maximization (EM) algorithm, as a relevant way to make the inference procedure both robust and scalable.
We also develop methods to infer gene networks from dynamic gene expression data within a collaboration with CHRU Strasbourg. The goal here is to design new models and inference methods adapted to the prediction of the outcome of biological intervention experiments such as “gene knock-down”, in order to identify therapeutic targets that could be experimentally studied.
3.2.4 Inference for stochastic processes
In our collaborations with practitioners, we are led to infer specific quantities of interest for stochastic dynamical models of various classes, such as the speed of telomere shortening using multitype branching processes; the growth rate of clonal populations and their phylogenetic tree using heterogeneous population growth models in cancer; links between gene expressions using PDMPs or the speed of propagation of fungi using stochastic models with latent variables given by the solution of a partial differential equation (see our main applications in Section 4). For all these sophisticated stochastic processes, statistical inference raises many open, difficult questions. In particular, we plan to develop inference tools for general classes of models such as PDMP, bifurcating Markov processes or branching processes. More generally, the inference of dynamical models relies strongly on their long-time behavior, for which the team has a strong expertise (see Section 3.3). Most often in biology, models rely on latent variables, that may follow complex dynamics as in the examples above. The inference of this kind of models requires us to develop efficient Bayesian algorithms, EM like algorithms and variational methods.
3.3 Stochastic modeling for ecology and evolution
In ecology, we are specifically interested in theoretical challenges in conservation biology (the branch of biology dealing with extinction and survival of species) and in the response of ecosystems to environmental perturbations such as climate change, anthropization or niche construction. In evolution, we have a strong expertise in the study of the long term evolution of biological populations using approximate models based on various biological assumptions. Although this application domain is different from the one of Section 3.1, the questions we address are close from the mathematical point of view.
3.3.1 Links between macroscopic and microscopic eco-evolutionary models
We develop here similar ideas as in Section 3.1.1, but the mathematical questions are of different nature for models of tumor growth in medicine than for population models in ecology or evolution, because in the first case one wants to capture transitory behavior (e.g., in growing populations for tumor growth), whereas in the second case, one usually models long term evolution assuming that the ecological dynamics is in a stationary (quasi-equilibrium) state and that evolution acts slowly.
In addition to the various classes of models described in Section 3.1.1, which are also relevant for ecology and evolution, some other types of models are well-developed in these domains, such as Dawson–Watanabe processes 62, Fleming–Viot processes 68 or a particular class of PDMPs called switched dynamical systems used to model abruptly changing environments. The general goal of designing macroscopic models from complex multiscale models through parameter scalings extends to these new classes of models. For example, Fleming-Viot processes appear as the fast dynamics in the long term evolution of biological populations under assumptions of small mutations and large population 2.
In ecology, our motivations are to highlight the biological assumptions underlying different classes of macroscopic models, or to take into account at the macroscopic scale complex local interactions between individuals. In evolutionary biology, the time scales involved are so long that it is hard to observe experimentally evolutionary phenomena such as diversification. Mathematical analysis of models is therefore of great importance, e.g. to construct approximate models allowing to predict long term evolution of biological populations.
3.3.2 Adaptive dynamics: concentration limits
Asymptotic analysis is particularly useful in the branch of evolutionary biology called adaptive dynamics. This biological theory which studies the interplay between ecological interactions and long term evolution was developed in the mid 90's 80, 63 and provides theoretical ecologists with useful tools to deduce evolutionary patterns from ecological parameters (directional evolution through canonical equations and diversification through evolutionary branching).
So far, two mathematical approaches to justify, study and improve these tools have been developed: a deterministic one, based on PDE models 64, 88, 79 and a probabilistic one, based on individual-based models 52, 56. Both approaches are concentration limits, aiming to construct approximate models where population densities are replaced by Dirac masses representing coexisting sub-populations. They both can be seen as particular parameter scalings on individual-based models of the form (2), combining large population scaling with scalings of rare mutations and/or small mutations.
However, the adaptive dynamics toolbox and the existing mathematical approaches are criticized because they are based on unrealistic biological assumptions on the scales involved in the process 92, 89. Mathematical analysis is needed both to quantify the underlying assumptions on scales and to propose alternative models based on more realistic assumptions. For example, we plan to design PDE models of adaptive dynamics allowing for local extinctions of populations 74, 81. Similarly, the actual occurrence of evolutionary branching in sexual populations is debated 92 and our goal is to shed light on these questions using asymptotic analysis starting from individual-based models.
3.3.3 Population dynamics with absorption and quasi-stationary distributions
In conservation biology, it is fundamental to quantify the chances of survival of species in a given environment. In addition, the observed biological populations are intrinsically conditioned to be non-extinct, which introduces an observation bias that is rarely taken into account. For a given model of population dynamics, it is therefore important to develop tools allowing to study the population size before extinction and to quantify its extinction probability in a given time window. When the population size is stable during long time intervals before extinction, it can be described by a quasi-stationary distribution (QSD), defined as a stationary distribution conditionally on non-extinction. The QSD also allows to quantify the population extinction rate.
Our research program on this topic builds on recent works of the team 59, 4, where we developed probabilistic criteria for the large time convergence of conditional distributions of stochastic population processes, that proved to apply to a wide range of stochastic processes. These works open many perspectives. We focus here on the methodological ones and will mention some numerical issues in Section 3.3.4 below. A first question is to obtain criteria for the convergence of conditional distributions for weaker distances than in 59, 4: instead of the total variation distance, we study convergence in Wasserstein distances. This is particularly relevant for PDMPs or for infinite dimensional processes such as individual-based models, where coupling properties are not strong enough to expect convergence in total variation. We also want to study other questions related to QSDs: Can we characterize the speed of convergence of conditional distributions to a QSD? Is it possible to study the path to extinction of a stochastic population process, in order to characterize the parameters improving their survival (e.g. for protected species) or their extinction (e.g. for pests in agronomy)? What can be said about the genealogy of populations before extinction?
3.3.4 Numerical analysis
The challenges detailed here are related to those described in Section 3.1.2. Ecological or evolutionary models pose specific numerical problems that we want to tackle. For example, in the numerical simulation of individual-based models like (1), when the population size is small, randomness cannot be neglected and exact algorithms have to be used; when the population size is large, such algorithms become too costly and one would like to take advantage of deterministic approximations like those we developed in 53, 70. The error analysis of such hybrid numerical schemes is difficult 40, 76 and the case of spatially or trait-structured individual-based models is still largely open.
We are also interested in numerical methods to approximate QSDs, among which the most developed are particle methods 91, 58 and stochastic algorithms such as self-interacting processes 44, 43. The analysis of these algorithms relies on long-time analysis of particle or nonlinear systems and our research project on QSDs detailed above will be of great help. In particular, Wasserstein distances are known to be well adapted to the study of convergence of particle systems thanks to their tensorization properties. We also plan to develop new numerical approaches based on the stationary distribution of approximating processes, such as those obtained by central limit theorems for stochastic processes 60.
4 Application domains
We have described in the last section our theoretical expertise and several mathematical challenges. However, our main motivation comes from our interactions with biologists, physicians or clinical researchers. Most often, these interactions involve several of the methodological tools developed above, that need to be combined for precise biological goals. Our strategy is to establish collaborations with few groups of biologists and physicians, that allow us to tackle ambitious, long term projects. In this section, we illustrate the different domains of application mentioned in the previous section by describing several ongoing pluridisciplinary projects that involve large subgroups of the team.
4.1 Tumor growth and heterogeneity
4.1.1 Reconstruction of tumor heterogeneity
Targeted therapies represent a real advance in the treatment of patients with cancer. Most of these therapies are kinase inhibitors and require precise analysis of tumor DNA mutations to ensure the absence of primary resistance. Indeed, tumors are often genetically heterogeneous with the presence of many subclones, but they release “circulating” cell-free DNA (cfDNA) that can be directly extracted from basic blood samples: as measurement sensitivity improves, such liquid biopsies increasingly appear as a mirror of tumor heterogeneity. In this context, we are taking a promising statistical approach to analyze longitudinal cfDNA data, with the purpose of gaining a deeper understanding of the mechanism by which resistance develops in individual patients. While addressing the standard problem of reconstructing the associated phylogenetic tree, this approach also describes the production of cfDNA from the temporal dynamics of cells, in order to best exploit the longitudinal structure of the data. This is a project in collaboration with physicians Jean-Louis Merlin (Institut de Cancérologie de Lorraine), Alexandre Harlé (Institut de Cancérologie de Lorraine) and Erwan Pencreac'h (CHRU Strasbourg). We currently also have another ongoing project on a tumor heterogeneity reconstruction for chronic lymphocytic leukemia in collaboration with Laurent Vallat (CHRU Strasbourg).
4.1.2 Evolution of low-grade gliomas
We have an ongoing collaboration with the Centre de Recherche en Automatique de Nancy CRAN (Jean-Marie Moureaux) and neuro-oncologist and surgeon from CHRU Nancy (Luc Taillandier, Fabien Rech) about diffuse low-grade gliomas (DLGG). These are slow-growing tumors that are often asymptomatic for a long period of time. They progress to a higher grade, resulting in the patient's death. The current treatment strategy aims to surgically reduce tumor volume as soon as possible. As DLGGs infiltrate functional areas, surgery is performed in an awake state, with active patient participation, while electrical brain stimulations are done, to identify functional structures. At CHRU Nancy, surgeries are filmed, but the patient's responses are recorded manually. We aim to develop an automatic tool to detect, analyse and register the patients responses. We use deep learning algorithms for motion and speech detection. In perspective, the fine anomalies that can be identified are likely to be correlated with the patient's short- and long-term cognitive outcome.
4.2 Telomeres
Telomeres are non-coding regions of repetitive nucleotide sequences located at each end of a chromosome. They protect the end of the chromosome from deterioration or from fusion with neighboring chromosomes, ensuring the integrity of genetic material over cell divisions. At each cell division, telomeres loose a short fragment, a phenomenon often called the `end replication problem'. When its telomeres are too short, the cell stops dividing and enters a senescence phase. In human, it is known that short telomere lengths are statistically linked to age related diseases 46.
In an ongoing collaboration with Athanase Benetos (CHRU of Nancy) and Simon Toupance (CHRU of Nancy), we study the telomere length distribution in a human body, its relation with the patient's phenotype, its evolution with age and across generations. Our contribution to the project is to bring competences ranging from theoretical probability to applied statistics, including modelling and numerical simulation of stochastic models and analysis of distributional data of telomeres length. A first goal is to provide physicians with additional medical statistics to better understand the health state of a patient using its telomere length distribution, based on our observation that the shape of the telomere length distribution is stable accross ages of an individual, leading to the concept of telomere signature 9. We plan to study larger cohorts of individuals with medical records and parental relationships to construct an equivalence relationship between shapes of distributions and to develop health scores for patients allowing to assess the risk of particular diseases. A second goal is to bring new insights for the description and modelling of the evolution of telomere length distribution with age and across a population. For this, we study a stochastic branching model of the telomere length into a given tissue of the form of (1) where is the vector of lengths of telomeres of cell . We also plan to study models for the evolution of the telomere length accross a population of individuals and on evolutionary time scales, where is the telomere lengths in gametes of individual . This project requires advanced mathematical tools related to the theory of branching processes and of non-conservative semi-groups. We also plan to tackle parameter estimation questions for these models.
In another ongoing project with Marie-Noëlle Simon (CRCM, Aix-Marseille Université), we study modeling and inference of the different mechanisms of telomere shortening or elongation in survivor cells of yeast Saccharomyces cerevisiae for which telomerase is inactivated. Telomerase is an enzyme that is active in normal yeasts and compensates telomere shortening due to the end replication problem. When telomerase in inactivated, most yeasts undergo replicative senescence, except a few ones called survivor cells, which are able to develop alternative telomere elongation mechanisms. The goal of the project is to develop a full model of the evolution of survivor cells, in particular by estimating the rates and sizes of abrupt telomere shortening or lengthening in survivor cells.
4.3 Gene networks and single-cell data
4.3.1 Modeling gene expression at single-cell level
Gene expression in cells has long been only observable through averaged quantities over cell populations. The development of single-cell transcriptomics has enabled gene expression to be measured in individual cells: it turns out that even for an isogenic population located in a homogeneous medium, molecular variability can be large. An average description is therefore not sufficient to account for fundamental phenomena such as cell differentiation. Recently, a view emerged that the dynamics governing the switching of cells from one differentiation state to another could be characterized by a peak in gene expression variability at the point of fate commitment 90. We are continuing on this path, working on the link between PDMP models and notions of entropy and epigenetic landscape.
4.3.2 Transcriptional bursting in regulatory networks
Working in the active field of single-cell dynamics and gene regulatory networks provides opportunities to interact with biologists such as Olivier Gandrillon from ENS Lyon 90, 77, and potentially also physicians such as Erwan Pencreac'h in CHRU Strasbourg. The biological literature increasingly highlights gene regulatory networks as playing an important role (independent of genetic mutations) in the acquisition of resistance to cancer treatments, hence this topic might become soon also relevant to the application area of oncology.
4.3.3 Prediction and identification of therapeutic targets for chronic lymphocytic leukemia
In an ongoing collaboration with Laurent Vallat (CHRU Strasbourg), we develop new models and inference methods for gene regulation networks allowing to make prediction of biological intervention experiments (such as gene knock-down). Inference is performed on gene expression data from cells of patients suffering from different forms of chronic lymphocytic leukemia. The goal is to use prediction to identify therapeutic targets which could be knocked-down to reduce the cells' proliferation. Biological experiments will be performed by Laurent Vallat and his group to assess the therapeutic potential of the new targets.
4.4 Chalara
The Chalara project 69 is a team project with Benoît Marçais (INRAE Champenoux) and Marie Grosdidier (INRAE Avignon). Chalara is an ash disease that arrived in France 12 years ago through Grand Est and has been spreading throughout France ever since. The disease spreads by means of fungus spores that are deposited on the leaves of trees during summer, fall at the foot of the trees during fall and give rise to fungi that release new spores that spread the following summer. Affected trees show signs of decline (defoliation, canker...) that can lead to their death.
The objective is to model the spread of chalara and to study and quantify the potential underlying environmental effects, such as humidity or high temperatures. We use a hybrid model where spores spread is based on reaction-diffusion PDEs and other steps of the disease cycle are stochastic. The project involves modeling, Bayesian statistics and intensive simulation.
5 Highlights of the year
5.1 Awards
- The K.P. Hadeler prize for best paper in the Journal of Mathematical Biology in 2024 goes to Michel Benaim, Claude Lobry, Tewfik Sari and Edouard Strickler for their paper entitled “When can a population spreading across sink habitats persist?” 45.
6 Latest software developments, platforms, open data
6.1 Latest software developments
6.1.1 cvmgof
-
Keywords:
Regression, Test, Estimators
-
Scientific Description:
Many goodness-of-fit tests have been developed to assess the different assumptions of a (possibly heteroscedastic) regression model. Most of them are "directional" in that they detect departures from a given assumption of the model. Other tests are "global" (or "omnibus") in that they assess whether a model fits a dataset on all its assumptions. cvmgof focuses on the task of choosing the structural part of the regression function because it contains easily interpretable information about the studied relationship. It implements 2 nonparametric "directional" tests and one nonparametric "global" test, all based on generalizations of the Cramer-von Mises statistic.
-
Functional Description:
cvmgof is an R library devoted to Cramer-von Mises goodness-of-fit tests. It implements three nonparametric statistical methods based on Cramer-von Mises statistics to estimate and test a regression model.
- URL:
- Publication:
-
Contact:
Romain Azais
-
Participants:
Sandie Ferrigno, Marie-Jose Martinez, Romain Azais
6.1.2 Harissa
-
Name:
Hartree approximation for inference along with a stochastic simulation algorithm
-
Keywords:
Gene regulatory networks, Reverse engineering, Molecular simulation
-
Functional Description:
Harissa is a Python package for both inference and simulation of gene regulatory networks, based on stochastic gene expression with transcriptional bursting. It was implemented in the context of a mechanistic approach to gene regulatory network inference from single-cell data.
- URL:
- Publications:
-
Contact:
Ulysse Herbach
6.1.3 MultiRNAflow
-
Name:
An R package for the analysis of RNAseq raw counts with multiple biological conditions and time points
-
Keywords:
RNA-seq, Gene regulatory networks, Integrated data analysis, Complex experimental design, Multiple temporal and biological conditions, Differential expression
-
Functional Description:
The R package MultiRNAflow provides an easy to use unified framework allowing to make both unsupervised and supervised analysis (differential expression analysis) for RNAseq datasets with an arbitrary number of biological conditions and time points. In particular, this package makes a deep downstream analysis of differential expression information, e.g. identifying temporal patterns across biological conditions and differentially expresses genes which are specific to a biological condition for each time.
-
Release Contributions:
First version
- URL:
-
Contact:
Nicolas Champagnat
-
Participants:
Rodolphe Loubaton, Nicolas Champagnat, Pierre Vallois, Laurent Vallat
-
Partner:
CHRU de Strasbourg
6.1.4 quantCurves
-
Keyword:
Statistical modeling
-
Functional Description:
Non-parametric methods as local normal regression, polynomial local regression and penalized cubic B-splines regression are used to estimate quantiles curves.
- URL:
-
Contact:
Sandie Ferrigno
6.1.5 PEOC
-
Name:
Parameters Estimation Of Chalara model
-
Keywords:
Statistical inference, Iterative algorithm, Monte Carlo estimation, Python, Statistical modeling
-
Functional Description:
PEOC is Python software which aims to simulate and estimate the parameters of the chalara model of the article 'Mechanistic-statistical model for the expansion of ash dieback' by Coralie Fritsch, Marie Grosdidier, Anne Gégout-Petit and Benoît Marçais. It allows to reproduce the simulations made in this article, which is available at the following url https://hal.science/hal-04690647.
- URL:
-
Contact:
Coralie Fritsch
7 New results
7.1 Stochastic modeling for health, ecology and evolution
7.1.1 Gene regulatory networks
Stochastic modeling and simulation of gene regulatory networks at single-cell level
Participants: Mathilde Gaillard, Ulysse Herbach.
Single-cell data reveal the presence of biological stochasticity between cells of identical genome and environment, in particular highlighting the transcriptional bursting phenomenon. To account for this property, gene expression may be modeled as a continuous-time Markov chain where biochemical species are described in a discrete way, leading to Gillespie's stochastic simulation algorithm (SSA) which turns out to be computationally expensive for realistic mRNA and protein copy numbers. Alternatively, hybrid models based on piecewise-deterministic Markov processes (PDMPs) offer an effective compromise for capturing cell-to-cell variability 73, but their simulation remains limited to specialized mathematical communities.
With a view to making them more accessible, we introduced a simple simulation method that is reminiscent of SSA, while allowing for much lower computational cost 23. We detailed the algorithm for a bursty PDMP describing an arbitrary number of interacting genes, and proved that it simulates exact trajectories of the model. As an illustration, we used the algorithm to simulate a two-gene toggle switch: this example highlights the fact that bimodal distributions as observed in real data are not explained by transcriptional bursting per se, but rather by distinct burst frequencies that may emerge from interactions between genes.
Cell trajectory inference based on Schrödinger problem and a mechanistic model of stochastic gene expression
Participants: Ulysse Herbach.
External collaborators: Université Paris Cité, Aix Marseille Université, Université Lyon 1, ENS Lyon.
Cellular differentiation is the biological process that leads a cell to opt for a particular cellular identity. Recently, single-cell RNA-sequencing has enabled the simultaneous measurement of gene expression levels at specific times for a large number of individual cells and a large number of genes. Repeating such measurements at different time points gives access to the temporal variation, or transport, of a distribution over gene expression space. The whole temporal trajectory of distributions thus characterizes the differentiation process at population level, but trajectories of individual cells are still out of reach since most measurement techniques are destructive.
The optimal transport theory that has been used so far to infer cellular differentiation trajectories from time-stamped single-cell RNA-seq data involves solving the so-called Schrödinger problem in its most common version. This implies assuming that cells move, in the gene expression space, by diffusion. Yet, real gene dynamics are much more complex. In 31, we assume that mRNA dynamics are characterized by brief and important production of RNA, with long periods of inactivity in between, and consider a bursty PDMP model of gene dynamics 23. We use this model to define a reference process for the Schrödinger problem. By comparing the solutions of the Schrödinger problems with diffusive and bursty reference processes, under different conditions, we show that the bursty model provides a better approximation of the underlying gene dynamics when inferring cell trajectories.
Prediction of silencing experiments on gene networks for chronic lymphocytic leukemia
Participants: Nicolas Champagnat, Anne Gégout-Petit, Anouk Rago, Pierre Vallois.
External collaborators: CHRU Strasbourg
In this collaboration with the group of Laurent Vallat in CHRU Strasbourg, we work on the inference of dynamical gene networks from RNA-seq data of chronic lymphocytic leukemia. The goal is to infer a model of gene expression allowing to predict gene expression in cells where the expression of specific genes is knocked-down (e.g. using siRNA, i.e. small interfering ribonucleic acid), in order to select the knock-down experiments which are more likely to reduce cell proliferation. We expect the selected genes to provide new therapeutic targets for the treatment of chronic lymphocytic leukemia. This year, we developed in 27 two general mathematical frameworks do model knock-down experiments on gene regulatory networks using gene expression data of cells without knock-down: the first one is based on conditional probabilities and applies to any quantitative model and the second assumes that the model is mechanistic.
7.1.2 Parameter scalings in population dynamics
Adaptive dynamics in biological populations with small mutations
Participants: Nicolas Champagnat.
External collaborators: Vincent Hass (Université de Franche-Comté)
In 15 we studied general individual-based models of biological populations with mutation and selection, under assumptions of large population and small mutations. We recover variants of the canonical equation of adaptive dynamics, which describes the long time evolution of the dominant phenotype in the population, under less stringent biological assumptions than in previous works such as 56.
Averaging principle for models with slow-fast components
Participants: Vincent Kagan, Edouard Strickler, Denis Villemonais.
In 33, we consider a slow-fast stochastic process where the slow component is a jump process on a measurable index set whose transition rates depend on the position of the fast component. Between the jumps, the fast component evolves according to an ergodic dynamic in a state space determined by the index process. We prove that, when the ergodic dynamics are accelerated, the slow index process converges to an autonomous pure jump process on the index set. These results can be used to legitimate classical toy-models long used in applications, providing a justification for their simplifying assumptions. In particular, we apply our results to prove the convergence of a typed branching process toward a continuous-time Galton-Watson process, and of an epidemic model with fast viral loads dynamics to a standard contact process.
Multiscale dynamics and condensation phenomenon in coagulation-fragmentation processes
Participants: Elie Cerf, Coralie Fritsch, Denis Villemonais.
External collaborator: Alex Watson (University College London)
This collaboration is an ongoing work aimed to study the condensation phenomenon in the context of a large population of particles randomly interacting through fragmentation and coagulation. More precisely, we want to exhibit the possible coexistence of two phases respectively made of microscopic and macroscopic aggregates. In the first part of this project, we extend the results of 49 on the chemostat model to study a stochastic growth-fragmentation-coagulation model in which a population of particles determined by their masses deterministically consume a substrate to grow and randomly interact, affecting the size of the population. Using usual tools of individual-based models (IBM), we establish the convergence of a process modeling the population to the solution of an integro-differential equation when the size of the population tends to infinity, for a general class of random fragmentation laws. In a second part, we aim to use our understanding of the previous IBM to study the stochastic Becker-Döring model of aggregation and fragmentation. In the deterministic case, it is well known that even though the Becker-Döring system conserves its total mass in finite times there are conditions under which, as the time goes to infinity, a positive fraction of the mass gets trapped in an ever-growing macroscopic aggregate. In particular, Niethammer 87 showed that under a precise normalization, the dynamic of the infinite cluster is given by a Lifshitz-Slyozov-Wagner coarsening model. Our goal is to obtain similar results in the stochastic framework.
7.1.3 Individual-based modeling in medicine, ecology and evolution
Telomeres
Participants: Sophie Baland, Nicolas Champagnat, Coralie Fritsch, Juan Mardomingo Sanz, Denis Villemonais.
External collaborators: CHRU Nancy, CRCM Aix-Marseille Université
In most cells, at each cell division, telomeres shorten due to the so-called end replication problem, which can lead to replicative senescence and a variety of age-related diseases. In certain cells, the presence of the enzyme telomerase can lead to the lengthening of telomeres, which may delay or prevent the onset of such diseases but can also increase the risk of cancer. In 14, C. Fritsch and D. Villemonais developed, in collaboration with reserachers of CHRU Nancy, a stochastic representation of this biological model, which takes into account multiple chromosomes per cell, the effect of telomerase, different cell types and the dependence of the distribution of telomere length on the dynamics of the process. We study theoretical properties of this model, including its long-term behavior. In addition, we investigate numerically the impact of the model parameters on biologically relevant quantities, such as the Hayflick limit and the Malthusian parameter of the population of cells. Similar models including mechanisms of hematopoiesis (the physiological process of blood cell production) are also currently being developed by S. Baland and D. Villemonais.
In another ongoing project with Marie-Noëlle Simon (CRCM, Aix-Marseille Université), N. Champagnat, C. Fritsch, J. Mardomingo Sanz and D. Villemonais started to study modeling and inference of the different mechanisms of telomere shortening or elongation in survivor cells of yeast Saccharomyces cerevisiae for which telomerase is inactivated. The telomerase enzyme is active in normal yeasts and compensates telomere shortening due to the end replication problem. When telomerase is inactivated, most yeasts undergo replicative senescence, except a few ones called survivor cells, which are able to develop alternative telomere elongation mechanisms. We are currently working on methods to estimate the shortening of telomeres at cell division for survivor cells using the full distribution of telomeres and on population models with rare events of appearance of survivors able to account for data of yeasts cultures with daily dilution.
Allometric relationships in Ecology
Participants: Virgile Brodu, Nicolas Champagnat, Coralie Fritsch.
External collaborator: Sylvain Billiard (Université de Lille)
In 26, we designed a stochastic individual-based model structured in energy, for single species consuming an external resource, where populations are characterized by a fixed positive energy at birth. The resource is maintained at a fixed amount, so we benefit from a branching property at the population level. We focus on individual trajectories, constructed as a PDMP with random jumps modelling births and deaths in the population and a continuous and deterministic evolution of energy between jumps. We are interested in the case where metabolic (i.e. energy loss for maintenance), growth, birth and death rates depend on the individual energy over time, and follow allometric scalings (i.e. power laws). Our goal is to determine in a bottom-up approach what are the possible allometric coefficients (i.e. exponents of these power laws) under elementary and ecologically relevant constraints, for our model to be valid for the whole spectrum of possible body sizes. We show in particular that assuming an allometric coefficient for metabolism strongly constrains the range of possible values for the allometric coefficients for birth, death and growth rates.
Niche construction
Participants: Nicolas Champagnat, Coralie Fritsch, Edouard Strickler.
External collaborators: Universidad de Valparaiso, Pontificia Universidad Catolica de Chile, Santa Fe Institute, Universidad de Santiago de Chile, INRAE Montpellier
In collaboration with Rolando Rebolledo, Pablo Marquet, Leonardo Videla, Cristobal Quininao and Nicolas Zalduendo-Vidal, we are working on the modeling of the eco-evolutionary process of niche construction, by which a species or an ecological community is able to modify its environment in such a way that the induced adaptation enhances survival of the species or the community.
In 28, we study the properties of extinction and survival of population models with sublinear growth rate parameterized by an exponent . We point out that this family of models, recently proposed to fit various population growths, may have important flaws depending on the value of the parameter , which make them unrealistic for ecological modeling. We both study birth-death processes and diffusion processes.
We also work on a general modeling approach for niche construction based on birth-death processes of interacting (sub)species immersed in an environment which is influenced by the population state (the so-called niche construction) and which evolves on a slower time-scale. Under the above hypotheses, extinction and/or re-emergence of negligible species on long time scales can be observed. We prove that the joint dynamics of the logarithm of the species sizes and the environment undergo a piecewise deterministic Markov process, which can be approximated by an explicit dynamical system in the limit of large populations. We apply this framework to study the long term co-existence of two specialist species consuming two resources, with a “joint”' niche construction where each species constructs the niche of the other while depleting its resources. We also study an example of immune escape in cancer, where the environmental variable is associated to the state of the immune system and the species are associated to three types of tumor cells. An article is currently being written.
7.1.4 Quasi-stationary distributions
Participants: Nicolas Champagnat, Edouard Strickler, Denis Villemonais.
External collaborators: Michel Benaïm (Univ. Neuchâtel), Alex Cox (Univ. Bath), Emma Horton (Univ. Warwick).
Denis Villemonais obtained in 34 a new criterion for existence and convergence to a quasi-stationary distribution (QSD) based on the spectral theory of positive operators on Banach lattices, building on domination properties for compactness of this type of operators as exposed in 39.
One central question in QSD theory is to construct and prove convergence of numerical schemes for their approximation. In 30, D. Villemonais studied a binary branching model with Moran type interactions introduced in 61. In this interacting particle system, particles evolve, reproduce and die independently and, with a probability that may depend on the configuration of the whole system, the death of a particle may trigger the reproduction of another particle, while a branching event may trigger the death of another particle. We prove optimal bounds for the distance between the empirical distribution of the particle system and the quasi-stationray distribution of the associated mean semi-group.
In 16, N. Champagnat, E. Strickler and D. Villemonais studied the convergence of general penalized Markov processes with soft killing in (Monge-Kantorovich) Wasserstein distance. We propose a simple criterion ensuring uniform convergence of conditional distributions to a unique QSD. We give several examples of application where our criterion can be checked, including Bernoulli convolutions and piecewise deterministic Markov processes, for which convergence in total variation is not possible.
In 29, N. Champagnat and D. Villemonais characterized large classes of non-irreducible absorbed Markov processes with polynomial speed of convergence to a QSD. They applied their criteria to prove the existence of a QSD for general processes in denumerable state spaces, assuming only aperiodicity, the existence of a Lyapunov function and the existence of a point in the state space from which the return time is finite with positive probability.
N. Champagnat and D. Villemonais obtained in 11 general criteria ensuring existence, uniqueness and/or exponential convergence properties for QSD. The criteria were specifically designed to apply to degenerate processes such as hypoelliptic diffusions and allow to improve existing results in this domain.
7.1.5 Multi-type bisexual and weighted branching process
Participants: Coralie Fritsch, Denis Villemonais.
External collaborators: Nicolas Zalduendo-Vidal (INRAE Montpellier).
The quasi-stationary behavior of asexual subcritical multi-type Galton-Watson branching processes is well-known. However, this question was open for bisexual processes, even in the single-type case. C. Fritsch and D. Villemonais studied in 17 the quasi-stationary behavior of the multi-type bisexual branching processes in the subcritical case. We establish the existence of an infinite number of QSDs for the process. Among these distributions, only a finite number has good integrability properties. In addition, when the process is irreducible, its conditonal distributions converge towards a unique QSD. This work is based on our previous work 71, which was the first one studying the criticality of general multi-type bisexual branching processes.
It is well-known that super-critical multi-type classical branching processes follow the strong law of large numbers at large times. In 35, D. Villemonais studied the long-term behavior of weighted multi-type branching processes, focusing on extending classical laws of large numbers and martingale convergence to settings with infinitely many weighted particles, arbitrary type spaces and non-geometric rescaling. We developed applications to Galton-Watson trees indexed by random weights and by random kernels, convergence in Wasserstein distance of the underlying mean semi-group, and convergence of ergodic averages along lineages.
7.1.6 ODE with semi-Markov switching
Participants: Edouard Strickler.
External collaborators: Tobias Hurth (Université de Neuchâtel, Suisse).
In classical PDMP, time durations between random events are distributed as exponential variables. This is made in order to ensure the Markov property of the process but can be debated from a modeling perspective. Indeed, the exponential law allows for arbitrarily fast switches, which is not realistic in most biological or physical models, where a minimum time is required between two environmental changes. Motivated by this question, in collaboration with Tobias Hurth, we developed in 32 a framework for PDMP where the times between jumps of the process can follow any law. We show that if the law of jumping time is sufficently regular, then certain conditions on the deterministic dynamics leading to ergodicity of the process in the case of exponentially distributed jump times can still be used in this general context, even though times between jumps are not arbitrarily short—which was a property of the exponential distributions used crucially in previous references.
7.1.7 Dispersal Induced Growth
Participants: Edouard Strickler.
External collaborators: Michel Benaïm (Université de Neuchâtel, Suisse), Claude Lobry (Université de Nice) and Tewfik Sari (Inrae Montpellier).
In this collaboration, we exhaustively studied the phenomenon of Dispersal Induced Growth (DIG), a term coined by Katriel in 75. In a population spreading across a finite number of patches, individuals in a patch may undergo a time-varying growth rate which is such that, in the absence of migration between the patches, the population will eventually become extinct in each patch. There is Dispersal Induced Growth when adding migration between such patches leads the whole population to grow and survive. In the paper 45 we developed several tools in order to understand when the DIG phenomenon can happen. In case where the graph of migration is connected at each time, then DIG happens if and only if the mean growth rate of an idealized habitat which would have the maximal growth rate at each time is positive. In 13, we consider the case where migration is time dependent and the graph of migration is globally connected but not necessarily connected at each time. In this situation, we exhibit example where there is no DIG. We also give an example of Dispersal Induced Decay, meaning that, in the absence of migration, the population will survive in each patch, while adding migration may lead the whole population to extinction. The study is performed via the sign of a Lyapunov exponent, leading to necessary and sufficient conditions for DIG.
In the recent paper 51, P. Carmona gives an asymptotic formula for the top Lyapunov exponent of a linear -periodic cooperative differential equation, in the limit . The short note 12 discusses and extends this result.
7.1.8 Modeling of chronic obstructive pulmonary disease
Participants: Pierre Vallois.
External collaborators: Isabelle Dupin (Univ. Bordeaux), Élise Maurat (Univ. Bordeaux).
Most of the direct respiratory effects of air pollution result from the disruption of the lungs' natural defense mechanisms. Among the most essential ones is mucociliary clearance, which describes the coordinated effort between mucus produced in the respiratory tract to trap microorganisms and particles and the unidirectional movement of cilia, that propels trapped particles towards the throat. The perturbation of mucociliary clearance can lead to the formation of mucus plugs, abnormally thick mucus accumulations that obstruct the airways. several fundamental aspects of mucus plugs remain poorly understood, particularly how they form, where exactly they localize within the bronchial tree, and how they evolve over time. This year, we have been working on using mathematical modeling to simulate the entire bronchial network and explore plug formation, distribution, and temporal behavior in ways that are currently inaccessible experimentally. A paper is currently being written.
7.1.9 Drawdowns of diffusions
Participants: Pierre Vallois.
External collaborators: Paavo Salminen (Åbo Akademi University, Finland).
In 22, using the excursion theory of diffusion processes, we provide new proofs of, firstly, Lehoczky's formula for the joint distribution of the first drawdown time and the maximum before this time, and, secondly, of Malyutin's formula for the joint distribution of the first hitting time and the maximum drawdown before this time. It is remarkable that the excursion approach which we developed first for Lehoczky's formula also provides a proof for Malyutin's formula. Moreover, we analyze the pure jump process describing the maximum before the first drawdown time when the size of the drawdown is varying.
7.2 Analysis of biological and medical data
7.2.1 Quantifying and predicting the evolution of clonal heterogeneity in chronic lymphocytic leukemia
Participants: Nicolas Champagnat, Coralie Fritsch, Ulysse Herbach, Pierre Vallois, Vidhi Vidhi.
External collaborators: CHRU Strasbourg
The development of targeted therapies has allowed considerable progress in the treatment of many cancers, but their efficacy is dependent on intra-tumor heterogeneity. In lymphomas and leukemias, the identification of gene alterations by high-throughput sequencing allows the characterization of this heterogeneity. In these hemopathies, the initial leukemic clone has a unique immune repertoire corresponding to specific human immunoglobulin genes called VDJ genes encoding the antigen receptor. The occurrence of additional mutations in VDJ genes may be responsible for the emergence of subclones with increased antigen receptor reactivity further complicating the clonal heterogeneity of these hemopathies. However, this second level of clonal heterogeneity and its evolution remain poorly characterized and is not considered in the management of these cancers.
In collaboration with the group of Laurent Vallat in CHRU Strasbourg, we aim to develop a mathematical model for the evolution of the two levels of clonal heterogeneity in leukemia, allowing to characterize their evolution from longitudinal bulk sequencing data of VDJ and cancer genes mutations using a Bayesian approach. This year, we developed a model of the phylogenetic tree of squences of VDJ genes based on the weighted uniform distribution over trees, with weights related to the Hamming distance which counts the number of mutations between sequences. This model allows us to represent with a graph the most probable phylogenetic trees and the uncertainties of certain parts of the tree. We are currently working on the development of variational methods of inference of the tree in the context of unobserved VDJ sequences.
7.2.2 Promoting physical activity and limiting sedentary behaviors to manage pain in endometriosis: analysis of psychosocial variables
Participants: Ulysse Herbach.
External collaborators: Université de Haute-Alsace, Université de Nîmes
Endometriosis is negatively linked to physical activity (PA), as its symptoms and psychosocial barriers may hinder participation in PA. This study compared PA, sedentary behavior (SED), and psychosocial variables in women with and without endometriosis 19. Women with endometriosis reported lower specific SED, poorer quality of life (QoL), greater score of kinesiophobia, more health-related barriers to PA, lower physical self-concepts, and negative PA-related stereotypes (beliefs or perceptions). The amount of PA was most related to previous PA behavior and intention in both groups. In endometriosis, PA and SED are correlated to barriers related to health, motivational variables, beliefs about the risks of PA, physical self-concepts and QoL. Findings highlight the importance of motivational variables and self-concepts, which could be targeted to support engagement in PA to improve symptom management and QoL in women with endometriosis. This study is a first step towards a finer multivariate analysis including a specific treatment of the ordinal nature of the data, for example using latent Gaussian variables.
7.2.3 Uncovering candidate Nanog-Helper genes in early mouse embryo differentiation using differential entropy and network inference
Participants: Ulysse Herbach.
External collaborators: Université libre de Bruxelles, ENS Lyon, Université Clermont Auvergne
In the preimplantation mammalian embryo, stochastic cell-to-cell expression heterogeneity is followed by signal reinforcement to initiate the specification of Inner Cell Mass (ICM) cells into Epiblast (Epi). The expression of NANOG, the key transcription factor for the Epi fate, is necessary but not sufficient: coincident expression of other factors is required. To identify possible Nanog-helper genes, we analyzed gene expression variability in five time-stamped single-cell transcriptomic datasets using differential entropy, a quantitative measure of cell-to-cell heterogeneity 21. The entropy of Nanog displays a peak-shaped temporal pattern from the 16-cell to the 64-cell stage, consistent with its key role in Epi specification. By estimating the entropy profiles of the 21 genes common to all five datasets, we identified three genes - Pecam1, Sox2, and Hnf4a - whose variability in expression patterns mirrors that of Nanog.
We further performed gene regulatory network inference using CARDAMOM 10, an algorithm that exploits temporal dynamics and transcriptional bursting. The results revealed that these three genes exhibit reciprocal activation with Nanog at the 32-cell stage. This regulatory motif reinforces fate-switching decisions and co-expression states. This analysis of single-cell transcriptomic data thus uncovers a likely role for Pecam1, Sox2, and Hnf4a as key genes that, when coincidentally expressed with Nanog, initiate ICM differentiation.
7.2.4 Harnessing ecological niche modeling of Listeria monocytogenes for biopreservation system engineering
Participants: Sandie Ferrigno.
External collaborators: LIBio - Laboratoire d'Ingénierie des Biomolécules (
In this collaboration with the LIBio - Laboratoire d'Ingénierie des Biomolécules of Lorraine University, we work on the presence of pathogens in food. To reduce this presence, hurdle technology, which is based on the use of a combination of several preservative methods, is used by food business operators. Among the multiple available hurdles, biopreservation consists of using microorganisms as protective cultures and/or their metabolites to improve the microbial quality of food. This study explores the potential of ecological niche modeling to guide the selection of biopreservation candidates. A luminescent strain of Listeria monocytogenes was utilized in a multivariate high-throughput competition assay. The resulting data were analyzed using two parallel methods: k-means clustering and Response Surface Modeling. An article 18 has been published on this work.
7.2.5 Nonparametric estimation
Participants: Sandie Ferrigno.
External collaborators: R. Azais (MOSAIC INRIA Team, ENS Lyon), M.-J. Martinez (LJK-Grenoble University)
Many goodness-of-fit tests have been developed to assess the different assumptions of a (possibly heteroscedastic) regression model. Most of them are `directional' in that they detect departures from a given assumption of the model. Other tests are `global' (or `omnibus') in that they assess whether a model fits a dataset on all its assumptions. We focus on the task of choosing the structural part of the regression and the variance functions because they contain easily interpretable information about the studied relationship. We consider two nonparametric `directional' tests and one nonparametric `global' test, all based on generalizations of the Cramér-von Mises statistic. To perform these goodness-of-fit tests, we have developed the R package cvmgof, an easy-to-use tool for practitioners, available from the Comprehensive R Archive Network (CRAN). The package was updated in 2022 (this is its third version)(6.1.1). This latest version currently allows testing the regression function in the model. Since 2025, we worked to enrich the package by allowing the user to test the homoscedasticity/heteroscedasticity of the model. This new version will be submitted to CRAN in 2026 and an associated article is currently being written. In parallel, with the aim of obtaining better estimators in the various tests we are studying, we have been working on bandwidth selection choice, a crucial parameter in nonparametric estimation. In particular, we have relied on cross-validation methods and have also proposed new ones. We plan to integrate these methods into the cvmgof package in 2026 to offer users choices that will improve the quality of the estimators used. We presented these results during the CMStatistics congress in London in December 2025 36.
7.2.6 Online big data analysis and online learning
Participants: Jean-Marie Monnez.
A tool for analyzing streaming data is stochastic approximation introduced by Robbins and Monro in 1951, that can be used for example to estimate online parameters of a regression function 66 or centers of clusters in unsupervised classification 50. Another type of stochastic approximation process, for estimating eigenvectors and eigenvalues of the unknown -symmetric expectation of a random matrix using independent observations of , was introduced by Benzécri in 1969 and studied by several authors, like the Oja process (1985). In this type of process, independent observations of the random matrix are observed and one or a mini-batch of observations per step are taken into account. We defined an extended Oja process where there is a correlation model between the random matrices introduced at each step and where all the observations up to the current step can be taken into account without storing them 85. Firstly, this extends the scope of application of these processes, secondly previous experiments we have conducted show that processes using all the data up to the current step generally converge faster than those using a mini-batch of data.
Canonical components of the canonical analysis of two random vectors are collinear with principal components of a PCA of the muldimensional linear regression function of one vector with respect to the other or projected PCA. In the context of streaming data, we define in 20 processes for estimating online in parallel this regression function and canonical components, possibly taking into account at each step all the data up to this step without storing them and using an extended Oja process 85. We deal with the cases of canonical correlation analysis, factorial correspondence analysis and factorial discriminant analysis.
We also considered the canonical analysis of two random vectors in the context of streaming data or big data. In the work described above, specific stochastic approximation processes of canonical components and of couples of canonical components were defined separately for canonical correlation analysis (CCA), factorial correspondence analysis (FCA) and factorial discriminant analysis (FDA). Here, using an extension of the Oja process, we define general stochastic approximation processes of canonical components and of couples of canonical components of the canonical analysis of two random vectors that can be directly applied to CCA, FCA and FDA. A paper is currently being written.
We are also currently working on an extension of the Oja process to estimate the general and canonical components of a generalized canonical analysis. It can be directly used in mixed data canonical analysis, generalized canonical correlation analysis, multiple correspondence analysis and in the estimation of couples of canonical components in canonical analysis of two random vectors.
Given a non-decreasing sequence of closed convex subsets in a separable real Hilbert space , we are also currently studying the convergence of a stochastic approximation process in with correlated observations, projected at step on , that extends processes of the Robbins-Monro type. We established theorems of almost sure convergence and in quadratic mean and applied them to streaming multidimensional linear regression, using at each step a mini-batch of data or all the data up to this step, and to dynamic generalized linear models when the parameter varies with time.
7.2.7 Diffuse low-grade gliomas
Participants: Sophie Wantz-Mézières.
External collaborators: CRAN BioSIS, CHRU Nancy.
The therapeutic management of patients with diffuse low-grade gliomas (DLGG) is based on monitoring progress through regular MRIs, usually through the reconstructed volume of the tumor (after semiautomatic delineation). But this seems not sufficient. Up to now, the diffuse nature of this kind of tumors has been observed but is not well measured. We designed a new MRI-based variable, the ESVR (ExtraSphere Volume Ratio) to quantify the DLGG brain infiltration and discriminate patterns of patients. A machine learning approach allows us to detect that patients' age and ESVR at diagnosis seem to play an important role, as well as the well-known anatomopathology results. This result has been submitted to Biomedical Signal Processing and Control.
A thesis is currently underway to develop a tool to assist in awake surgery. This tool should make it possible to better identify subtle disorders during surgery that are impossible or difficult for humans to assess, in order to better organize the procedure (detection of possible “tipping points” beyond which the patient is no longer performing) or to stop it (detection of alterations that are generally minor but which, in the patient's socio-professional context will have deleterious consequences on their quality of life). Significant progress has been made up to date. The foundations of this tool have been laid, based on two complementary processes: the detection of motor states through pose recognition, and the transcription of oral responses via speech recognition. In order to identify the most suitable technical configuration for the operating room context, work is currently being carried out on simulated data obtained from volunteers: comparison of voice transcription models on audio recordings, and implementation of a comparative study of pose recognition and motor state classification models, based on videos mimicking the exercises performed in the operating room. Finally, a tool for anonymizing videos from the operating room has been developed, allowing secure access to them without compromising patient identity. This work has been partly presented in congress 37, and a communication has been submitted in ICASSP 2026.
8 Partnerships and cooperations
Participants: Sophie Baland, Virgile Brodu, Nicolas Champagnat, Elie Cerf, Coralie Fritsch, Mathilde Gaillard, Anne Gégout-Petit, Ulysse Herbach, Juan Mardomingo Sanz, Anouk Rago, Édouard Strickler, Pierre Vallois, Vidhi Vidhi, Denis Villemonais, Sophie Wantz-Mézières.
8.1 International initiatives
8.1.1 Associate Teams in the framework of an Inria International Lab or in the framework of an Inria International Program
aStoNiche
-
Title
Towards a stochastic theory of niche construction
-
Duration
2024-2026
-
Coordinators
Nicolas Champagnat and Rolando Rebolledo
-
Partners
- Inria team SIMBA (N. Champagnat, C. Fritsch, E. Strickler)
- Universidad de Valparaiso (R. Rebolledo, N. Rivera)
- Pontificia Universidad Catolica de Chile (P. Marquet, C. Quininao)
- Universidad de Santiago de Chile (L. Videla, N. Zalduendo-Vidal)
-
Objectives
We aim to provide a general stochastic formulation of the niche construction process. In particular, we want to take into account the feedbacks of species on their environment, and the evolutionary aspects that follow. This requires to deal with different time-scales (ecological, niche construction, evolutionary…) and to keep track of non-extinct traits that may be positively selected after niche construction. We plan to use mean-field stochastic individual-based model and branching processes and consider appropriate parameter scalings.
8.1.2 Visits of international scientists
Other international visits to the team
Leonardo Videla
-
Status
Assistant professor
-
Institution of origin:
University of Santiago de Chile
-
Country:
Chile
-
Dates:
September 18 to 24
-
Context of the visit:
Collaboration within the associate team aStoNiche
-
Mobility program/type of mobility:
Research stay
Cristobal Quininao
-
Status
Assistant professor
-
Institution of origin:
University of Santiago de Chile
-
Country:
Chile
-
Dates:
November 3 to 7
-
Context of the visit:
Collaboration within the associate team aStoNiche
-
Mobility program/type of mobility:
Research stay
Nicolas Zalduendo-Vidal
-
Status
Assistant professor
-
Institution of origin:
University of Santiago de Chile
-
Country:
Chile
-
Dates:
December 1 to 5
-
Context of the visit:
Collaboration within the associate team aStoNiche
-
Mobility program/type of mobility:
Research stay
8.1.3 Visits to international teams
Research stays abroad
Elie Cerf
-
Visited institution:
University College London
-
Country:
United Kingdom
-
Dates:
June 30 - July 11
-
Context of the visit:
Collaboration with Alex Watson on multiscale dynamics in fragmentation-coagulation processes.
-
Mobility program/type of mobility:
Research stay
Nicolas Champagnat
-
Visited institution:
Pontificia Universidad Católica de Chile
-
Country:
Chile
-
Dates:
March 8 to 20
-
Context of the visit:
Collaboration within the associate team aStoNiche
-
Mobility program/type of mobility:
Research stay
8.2 European initiatives
8.2.1 H2020 projects
N. Champagnat is scientific collaborator of the ERC SINGER (AdG 101054787) on Stochastic dynamics of sINgle cells, coordinated by S. Méléard (Ecole Polytechnique). He is involved in the research axes “From stochastic processes to singular Hamilton-Jacobi equations” and “Lineages and time reversed trajectories” of this project.
8.3 National initiatives
Projects coordinated by the team:
-
INSERM funding
Project Predi-CLL, ITMO Physics, Mathematics applied to Cancer (from October 2023): “Quantifying and predicting the evolution of clonal heterogeneity in chronic lymphocytic leukemia”. Funding organisms: ITMO Cancer, ITMO Technologies pour la santé de l'alliance nationale pour les sciences de la vie et de la santé (AVIESAN), INCa. Partners: Inria and IECL (Institut Élie Cartan de Lorraine) and CHRU Strasbourg. Leader: N. Champagnat. Participants: C. Fritsch, U. Herbach, P. Vallois, V. Vidhi, D. Villemonais.
-
France 2030 funding
PEPR Exploratoire Maths-VivES (from July 2024), target project DyLT (Dynamics of Telomere Length) on “Influence of telomere length dynamics and environmental conditions on biological and clinical aspects of aging”'. Funding organisms: ANR. Partners: Inria Nancy and Saclay, Institut Élie Cartan de Lorraine (Nancy), CHRU Nancy, Centre de Recherche en Cancérologie de Marseille and Institut de recherche sur le cancer et le vieillissement (Nice). Coordinators: N. Champagnat and A. Benetos (CHRU Nancy). Participants: C. Fritsch, A. Gégout-Petit, J. Mardomingo Sanz, D. Villemonais, S. Baland.
Projects in which the team participates:
-
France 2030 funding
PEPR Santé Numérique (from July 2023), project AI4scMed (Multiscale AI for single-cell-based precision medicine), team involved in WP3: “Regulatory network inference: from dynamical models to logical models”. Funding organisms: ANR. Partners: Inria, Inserm, CNRS. Coordinator: F. Picard (CNRS, ENS Lyon). Participants: M. Gaillard, U. Herbach.
-
VEOLIA funding
Chair “Modélisation Mathématique et Biodiversité” between VEOLIA, Ecole Polytechnique, Museum National d'Histoire Naturelle and Fondation X. Coordinator: S. Méléard. Participants: V. Brodu, N. Champagnat, C. Fritsch, D. Villemonais.
-
ANR funding
ANR JCJC project CRESCENDO (inCRease physical Exercise and Sport to Combat ENDOmetriosis, AAPG 2022). Coordinator: G. Escriva-Boulley (LISEC, Université de Haute-Alsace). Participant: U. Herbach.
-
CNRS funding
GDR 720 IASIS. Leader: C. Richard. Participant: S. Wantz-Mézières.
-
CNRS funding
GDR Réseau Thématique MathSAV. Leader: F. Crauste. Participants: N. Champagnat, C. Fritsch, U. Herbach, A. Rago.
8.4 Regional initiatives
- A. Gégout-Petit is one the two PIs of the interdisciplinary program “Life Travel” of the I-Site “Lorraine Université d'Excellence” on life trajectories and longevity (launched on January 1, 2026.).
- S. Wantz-Mézières received a grant from Interdisciplinary Pilot AAP Life Travel 2025, with Sébastien Hergalant (NGERE).
9 Dissemination
Participants: Sophie Baland, Virgile Brodu, Nicolas Champagnat, Elie Cerf, Sandie Ferrigno, Coralie Fritsch, Mathilde Gaillard, Anne Gégout-Petit, Ulysse Herbach, Vincent Kagan, Juan Mardomingo Sanz, Jean-Marie Monnez, Anouk Rago, Édouard Strickler, Pierre Vallois, Vidhi Vidhi, Denis Villemonais, Sophie Wantz-Mézières.
9.1 Promoting scientific activities
9.1.1 Scientific events: organisation
Member of the organizing committees
- Nicolas Champagnat was member of the organizing committee of ReaDiNet2025: A ReaDiNet workshop on deterministic and stochastic PDEs, held in Obernai from November 24 to 27.
9.1.2 Scientific events: selection
Member of the conference program committees
- Nicolas Champagnat was member of the scientific committee of ReaDiNet2025: A ReaDiNet workshop on deterministic and stochastic PDEs, held in Obernai from November 24 to 27.
9.1.3 Journal
Member of the editorial boards
- N. Champagnat is associate editor for ESAIM: Probability & Statistics and Stochastic Models
- D. Villemonais is associate editor for Applied Probability Trust
9.1.4 Invited talks
- N. Champagnat has been invited to give talks at the Kick-off Programme de recherche Mathématiques en interaction (PEPR Maths Vives) in Montpellier in March, at the 44th Conference on Stochastic Processes and their Applications SPA 2025 in Wroclaw, Poland in July and at the Fall Conference “Model design, optimization & control” of the thematic semester on Population Dynamics in Nice in October. He also gave a seminar talk at “Séminaire du Pôle Probabilités” of CMAP (Ecole Polytechnique) in Palaiseau in April.
- N. Champagnat, E. Strickler and D. Villemonais have been invited to give talks at the Workshop on quasi-stationnary distributions held at CERMICS, Marne-la-Vallée, May 21 to 23.
- C. Fritsch has been invited to give a talk in a minisymposium at the 56ème Journées de Statistiques in Marseille in June. She also gave a talk during the scientific meeting between the Pôle AM2I and the Chamber of agriculture in Nancy in June.
- U. Herbach has been invited to give a talk (hands-on tutorial session) at the Advanced Lecture Course on Computational Systems Biology (CompSysBio 2025) in Aussois in October.
- V. Vidhi gave poster presentation at the European Mathematical Genetics Meeting organized by University of Brest in April and invited talks at International Conference on Mathematical Methods and Model in Biosciences and School for Young Scientists in Bulgaria in June and at InterDoctoral Conference of Lorraine and Luxembourg at IECL, University of Lorraine in November.
- D. Villemonais gave invited talks at the “Séminaire de probabilité et mathématiques financières” of Laboratoire de Mathématiques et Modélisation d'Évry in June, at the Congrès SMF in Dijon in June, at the 13th Aging Research and Drug Discovery Meeting in Copenhagen in August, at the 12th International Conference on Stochastic Analysis and its Applications in Bucarest in September, and at the “Chilean Probability Seminar” of Santiago (online) in October.
9.1.5 Scientific expertise
- N. Champagnat has been a member of the Committee for junior permanent research positions of Centre Inria de Lille.
- N. Champagnat and C. Fritsch were members of the Committee for career advancement of Inria personnel.
- C. Fritsch has been a member of the Committee for junior permanent research positions of Centre Inria de l'Université de Bordeaux and Centre Inria de Paris.
- A. Gégout-Petit has been member of four recruitment committes (Nancy (MCF), Montpellier University (PR), Troyes Technological University (PR) and Institut Henri Poincaré (Director)).
- D. Villemonais has been a member of a recruitment comittee (Strasbourg, PU).
9.1.6 Research administration
- V. Brodu was an elected representative of doctoral students at the doctoral school committee (local scale), and also at the doctoral college committee (regional scale). He was then replaced by M. Gaillard.
- N. Champagnat is elected member of the Commission d'Evaluation of Inria, member of the COMIPERS (hiring committee for non-permanent positions) of Centre Inria de Nancy, substitute member of the Comité de Centre of Centre Inria de Nancy and local researcher (correspondant local) representing the COERLE (Inria's Ethic Committee) at Centre Inria de Nancy.
- C. Fritsch is elected member of the Commission d'Evaluation of Inria.
- A. Gégout-Petit is director of the research unit IECL (Institut Elie Cartan de Lorraine), Mathematics laboratory of Univ. Lorraine (200 members).
- S. Wantz-Mézières is substitute member of the CNU, section 26, college B.
- D. Villemonais is head of the Probability Team in Strasbourg, member of the MSII doctoral school board in University of Strasbourg, and elected member of the “Conseil de l'UFR de mathématiques et informatique” in University of Strasbourg.
9.2 Teaching - Supervision - Juries - Educational and pedagogical outreach
9.2.1 Teaching
- S. Ferrigno is in charge of the “DU Big Data & Data Science” in ENSMN, Univ. Lorraine.
- D. Villemonais is head of L3 DUAS (actuarial science) in Univ. Strasbourg.
- Master: V. Brodu, Distribution theory and PDEs, 40h, M1, second year of ENSEM, Univ. Lorraine.
- Master: V. Brodu, Monte-Carlo methods, 20h, M1, second year of ENSMN, Univ. Lorraine.
- Master: N. Champagnat, Introduction to Quantitative Finance, 12h, M1, second year of ENSMN, Univ. Lorraine.
- Master: N. Champagnat, Introduction to Quantitative Finance, 9h, M2, third year of ENSMN, Univ. Lorraine.
- Master: S. Ferrigno, Experimental designs, 6h, M1, fourth year of EEIGM, Univ. Lorraine.
- Master: S. Ferrigno, Data analyzing and mining, 36h, M1, second year of ENSMN, Univ. Lorraine.
- Master: S. Ferrigno, Modeling and forecasting, 32h, M1, second year of ENSMN, Univ. Lorraine.
- Master: S. Ferrigno, Training projects, 18h, M1/M2, second and third year of ENSMN, Univ. Lorraine.
- Master: C. Fritsch, Inverse problem, 18h, M1, second year of ENSMN, Univ. Lorraine.
- Master: A. Gégout-Petit, Inferential statistics, 20h, M1 IMSD, Univ. Lorraine.
- Master: A. Gégout-Petit, Complex data modelling, 30h, M2 IMSD, Univ. Lorraine.
- Master: D. Villemonais, Probability M1 first and second semester, 56h
- Master: D. Villemonais, Tutoring of student in actuarial science (stage d'alternance)
- Agregation of mathematics training: Denis Villemonais, Modelling lesson
- Master: S. Wantz-Mézières, Unsupervised Learning, 25h, M2 IMSD, Univ. Lorraine
- Master: S. Wantz-Mézières, Advanced image analysis and digital optimization, M2 IS-SNIM, 36h, Univ. Lorraine.
- Licence: S. Baland, Applied probabilities, 20h, L2 Informatique, Univ. Lorraine.
- Licence: S. Baland, Mathematical tools for biology, 22h, L1 Sciences de la Vie, Univ. Lorraine.
- Licence: S. Baland, Probabilities, 20h, L2 Informatique, Univ. Lorraine.
- Licence: V. Brodu, Probability theory and Statistics, 60h, L3, first year of ENSEM, Univ. Lorraine.
- Licence: V. Brodu, Operational Research, 30h, L3, first year of ENSMN, Univ. Lorraine.
- Licence: V. Brodu, Mathematical tools for engineers, 20h, L3, first year of ENSEM, Univ. Lorraine.
- Licence: E. Cerf, Complementary Analysis, 70h, L1 Mathématiques, Univ. Lorraine.
- Licence: E. Cerf, Mathematics, 12h, L1 Professorat des Ecoles, Univ. Lorraine.
- Licence: E. Cerf, Mathematical tools for biology, 22h, L1 Sciences de la Vie, Univ. Lorraine.
- Licence: S. Ferrigno, Descriptive and inferential statistics, 60h, L2, second year of EEIGM, Univ. Lorraine.
- Licence: S. Ferrigno, Statistical modeling, 60h, L2, second year of EEIGM, Univ. Lorraine.
- Licence: S. Ferrigno, Mathematical and computational tools, 20h, L3, third year of EEIGM, Univ. Lorraine.
- Licence: S. Ferrigno, Training projects, 40h, L1/L3, first, second and third year of EEIGM, Univ. Lorraine.
- Licence: M. Gaillard, Inférence statistique, 40h, L3, first year of ENSEM, Univ. Lorraine.
- Licence: V. Kagan, Probability theory tutorial, 40h, L3, first year of ENSMN, Univ. Lorraine.
- Licence: V. Kagan, Numerical Analysis tutorial, 20h, L3, first year of ENSMN, Univ. Lorraine.
- Licence: J. Mardomingo Sanz, Analyse numérique et optimisation, 40h, L3, first year of ENSMN, Univ. Lorraine.
- Licence: S. Wantz-Mézières, Applied Mathematics: Probability, 48h, L3, first year of TELECOM-NANCY, Univ. Lorraine.
- Licence: S. Wantz-Mézières, Mathematical tools for management and Finance, 180h, L1/L2 first and second year of BUT GEA, IUT Nancy-Charlemagne, Univ. Lorraine.
9.2.2 Supervision
PhD
- PhD: Virgile Brodu, “Stochastic individual-based models with allometric dynamics: branching, convergence, numerical simulations”, grant ENS Lyon. Advisors: S. Billiard (Univ. Lille), N. Champagnat, C. Fritsch. Defense on August, 27 24.
- PhD: Anouk Rago, “Inférence de réseaux de gènes dynamiques et prédiction d'expériences d'interventions biologiques dans des cellules cancéreuses”, grant Région Grand-Est and Inria. Advisors: N. Champagnat, A. Gégout-Petit. Defense on June, 30.
- PhD in progress: Sophie Baland, “Telomere length dynamics : modelisation, estimation and application to diagnostic support systems”, funding LUE, since September 2023. Advisors: S. Toupance (Univ. Lorraine) and D. Villemonais.
- PhD in progress: Mathilde Gaillard, “Processus de Markov déterministes par morceaux et inférence bayésienne de réseaux de gènes”, grant PEPR Santé Numérique, since October 2023. Advisors: A. Gégout-Petit, U. Herbach.
- PhD in progress: Anouar Jeddi, “Convergence of individual-based population models to Hamilton-Jacobi equations”, grant ERC SINGER (Ecole Polytechnique), since September 2023. Advisors: S. Méléard (Ecole Polytechnique) and N. Champagnat.
- PhD in progress: Vincent Kagan, “Asymptotic behavior of epidemiological models with individual viral load”, funding Université de Lorraine, since September 2023. Advisors: E. Strickler (Univ. Lorraine) and D. Villemonais.
- PhD in progress: Juan Mardomingo Sanz, “Stochastic modeling and estimation of the distribution of elongation and abrupt shortening of ALT cells in yeast”, funding PEPR Math-VivES, since October 2025. Advisors: N. Champagnat, C. Fritsch, D. Villemonais.
- PhD in progress: Vidhi Vidhi, “Stochastic modeling and statistics for quantifying the evolution of tumor heterogeneity in chronic lymphocytic leukemia”, funding ITMO Cancer, since October 2024. Advisors: N. Champagnat, C. Fritsch, U. Herbach.
- PhD in progress: Léo Gérard, “Development of a tool to assist awake brain tumor surgery based on statistical learning methods”, funding CRAN BioSIS, since october 2024. Advisors: J.M. Moureaux (CRAN), F. Rech (CHRU- CRAN), S. Wantz-Mézières.
HDR
- HDR: Coralie Fritsch, “Asymptotic behavior of probabilistic models for biology: criticality of processes, limit theorems, quasi-stationary behavior, model approximation”. Defense on December, 15 25.
Other
- ENSMN third year (M2) internship: M. Ammar, “Machine learning tasks in the context of air freight through the Cargostack application”. Advisors: M. Bleu (Wiremind cargo), S. Ferrigno (ENSMN).
- ENSMN third year (M2) internship: A. Blanchard, “Development of a rating model specific to badminton players”. Advisors: E.Hollville (Fédération française de badminton), S. Ferrigno (ENSMN).
- M2 internship: Roxane Cellier (Sorbonne Univ.), “Inference of kinetics for large multi-scale chemical reaction networks”. Advisors: U. Herbach and J. Unterberger (IECL).
- M2 internship: Juan Mardomingo Sanz (M2 Mathématiques pour les Sciences du Vivant of Paris-Saclay Univ.), “Stochastic modeling and estimation of the distribution of elongation of ALT cells in yeast”, funded by the interdisciplinary program “Life Travel” of the I-Site “Lorraine Université d'Excellence”, from April to September 2025. Advisors: N. Champagnat, C. Fritsch, D. Villemonais.
- M2 internship: Mattheo Rapenne (Univ. Lorraine), “Diagnosis of heart failure in the emergency service”. Advisors: A. Gégout-Petit (IECL), N. Girerd (CHRU Nancy).
- M1 internship: Lorenzo Boussion (ENS Paris-Saclay), “Necessary condition for dispersal induced growth on time periodic networks”. Advisor: E. Strickler.
- M1 internship: Thibaut Pannet (ENSTA Paris), “Variational Bayesian inference for single-cell transcriptomic data”. Advisor: U. Herbach.
- TELECOM-NANCY PIDR second-year project: N. Chatonnier, R. Samba, N. Bermond, “Comparison of deep learning models for automatic speech recognition: application to awake brain surgery”. Advisors: S. Wantz-Mézières, with J.M. Moureaux and L. Gérard (CRAN).
- ENSMN second year (M1) Department project: Y. Souidi, Y. Laribi, “Multiple linear regression and regularization methods”. Advisor: S. Ferrigno.
9.2.3 Juries
- N. Champagnat was referee for the PhD thesis of Nathanaël Boutillon (Aix-Marseille Univ., 11/06/2025).
- N. Champagnat, C. Fritsch and A. Gégout-Petit were examiners for the PhD thesis of Virgile Brodu (Univ. Lorraine, 27/08/2025).
- N. Champagnat and A. Gégout-Petit were examiners for the PhD thesis of Anouk Rago (Univ. Lorraine, 30/06/2025).
- N. Champagnat and A. Gégout-Petit were examiners for the HDR jury of Coralie Fritsch (Univ. Lorraine, 15/12/2025).
- A. Gégout-Petit was president the PhD jury of Trinh Duong (Lorraine University, 06/2025).
- A. Gégout-Petit was referee for the PhD jury of Cristina Chavez (Nanterre University, 04/2025).
- A. Gégout-Petit was referee for the PhD jury of Julie Cartier (PSL University, 11/2025).
- A. Gégout-Petit was referee for the PhD jury of Valentin Portmann (Univ. Bordeaux, 11/2025)
- S. Wantz-Mézières was examiner for the PhD thesis of Ghislain Fievet (NGERE, Univ. Lorraine, 20/11/2025).
- U. Herbach was examiner for the PhD thesis of Emrys Reginato (Univ. Grenoble Alpes, 24/11/2025).
- U. Herbach was examiner for the PhD thesis of Gustavo Magaña Loópez (Univ. Bordeaux, 17/12/2025).
- E. Strickler was referee for the PhD thesis of Jérémy Colombo (Univ. Neuchâtel, 20/11/25).
- D. Villemonais was referee for the PhD thesis of Jules Olayé (Ecole Polytechnique, Palaiseau, 04/07/2025)
9.3 Popularization
J.-M. Monnez wrote lecture notes 38 on the interpretation of canonical analysis of two random vectors as a projected principal component analysis (PCA), an extended Oja process for estimating eigenvectors and stochastic approximation for streaming canonical correlation, factorial correspondence and factorial discriminant analyses.
9.3.1 Specific official responsibilities in science outreach structures
- S. Ferrigno: Advisor of groups of EEIGM students in the context of “La main à la Pâte” projects and “CGénial” projects, at middle schools Paul Verlaine in Malzéville and La Craffe in Nancy, at Institut médico-éducatif (IME) in Commercy and in elementary schools in Nancy.
- S. Ferrigno: Advisor of a group of EEIGM students, “Ateliers expérimentaux : Mécanique et Statistique” project, in various high schools in Nancy.
9.3.2 Participation in Live events
- J. Mardomingo Sanz and V. Brodu volunteered in the popularization event “Fête de la Science” in Univ. Lorraine in October.
10 Scientific production
10.1 Major publications
- 1 articlecvmgof: an R package for Cramér-von Mises goodness-of-fit tests in regression models.Journal of Statistical Computation and Simulation9262022, 1246-1266HALDOIback to text
- 2 articleConvergence of population processes with small and frequent mutations to the canonical equation of adaptive dynamics.The Annals of Applied Probability351February 2025, 1-63HALDOIback to textback to text
- 3 inproceedingsMultiscale eco-evolutionary models: from individuals to populations.International Congress of Mathematicians, ICM 202271fully virtually, RussiaEMS PressDecember 2023, 5656-5678HALDOI
- 4 articleGeneral criteria for the study of quasi-stationarity.Electronic Journal of Probability2023. In press. HALDOIback to textback to text
- 5 articleShort-range interactions between fibrocytes and CD8+ T cells in COPD bronchial inflammatory response.eLifeOctober 2022HALDOI
- 6 miscMechanistic-statistical model for the expansion of ash dieback.September 2024HAL
- 7 articleThe Multi-type Bisexual Galton-Watson Branching Process.Annales de l'Institut Henri Poincaré (B) Probabilités et Statistiques2024, 36p.In press. HALDOI
- 8 articleStochastic approximation of eigenvectors and eigenvalues of the Q-symmetric expectation of a random matrix.Communications in Statistics - Theory and Methods5352024, 1669-1683HALDOI
- 9 articleThe individual’s signature of telomere length distribution.Scientific Reports91January 2019, 1-8HALDOIback to text
- 10 articleOne model fits all: Combining inference and simulation of gene regulatory networks.PLoS Computational Biology193March 2023, e1010962HALDOIback to textback to text
10.2 Publications of the year
International journals
Conferences without proceedings
Doctoral dissertations and habilitation theses
Reports & preprints
Other scientific publications
Educational activities
10.3 Cited publications
- 39 bookPositive operators.119Springer Science & Business Media2006back to text
- 40 articleError analysis of tau-leap simulation methods.Ann. Appl. Probab.2162011, 2226--2262URL: https://doi.org/10.1214/10-AAP756DOIback to text
- 41 unpublishedcvmgof: Cramer-von Mises goodness-of-fit tests.November 2018, An R-package, available on the CRANHALSoftware Heritageback to text
- 42 unpublishedA statistical methodology to select covariates in high-dimensional data under dependence. Application to the classification of genetic profiles in oncology.September 2019, https://arxiv.org/abs/1909.05481 - working paper or preprintHALback to text
- 43 articleStochastic approximation of quasi-stationary distributions for diffusion processes in a bounded domain.Annales de l'Institut Henri Poincaré, Probabilités et Statistiques5722021, 726 -- 739URL: https://doi.org/10.1214/20-AIHP1093DOIback to text
- 44 articleStochastic approximation of quasi-stationary distributions on compact spaces and applications.Ann. Appl. Probab.2842018, 2370--2416URL: https://doi.org/10.1214/17-AAP1360DOIback to text
- 45 articleWhen can a population spreading across sink habitats persist?Journal of Mathematical Biology8819January 2024, 1-56HALDOIback to textback to text
- 46 articleShort leukocyte telomere length precedes clinical expression of atherosclerosis: The blood-and-muscle model.Circulation research1224Feb 2018, 616--623DOIback to text
- 47 articleWASABI: a dynamic iterative framework for gene regulatory network inference.BMC Bioinformatics2012019, 1--19back to text
- 48 articleSimulation and analysis of an individual-based model for graph-structured plant dynamics.Ecological Modelling2342012, 93--105back to textback to text
- 49 articleWeak convergence of a mass-structured individual-based model.Appl. Math. Optim.7212015, 37--73URL: https://doi.org/10.1007/s00245-014-9271-3DOIback to textback to text
- 50 articleA fast and recursive algorithm for clustering large datasets with k-medians.Computational Statistics & Data Analysis5662012, 1434--1449back to textback to text
- 51 inproceedingsAsymptotic of the largest Floquet multiplier for cooperative matrices.Annales de la Faculté des sciences de Toulouse: Mathématiques3142022, 1213--1221back to text
- 52 articleA microscopic interpretation for adaptive dynamics trait substitution sequence models.Stoch. Process. Appl.11682006, 1127--1160back to textback to text
- 53 articleFrom individual stochastic processes to macroscopic models in adaptive evolution.Stoch. Models24suppl. 12008, 2--44URL: http://dx.doi.org/10.1080/15326340802437710DOIback to text
- 54 articleUnifying evolutionary dynamics: From individual stochastic processes to macroscopic evolution.Theor. Pop. Biol.692006, 297--321back to textback to text
- 55 articleThe evolutionary limit for models of populations interacting competitively via several resources.J. Differential Equations25112011, 176--195URL: https://doi.org/10.1016/j.jde.2011.03.007DOIback to text
- 56 articlePolymorphic evolution sequence and evolutionary branching.Probab. Theory Related Fields1511-22011, 45--94URL: https://doi.org/10.1007/s00440-010-0292-9DOIback to textback to textback to text
- 57 articleStochastic analysis of emergence of evolutionary cyclic behavior in population dynamics with transfer.The Annals of Applied Probability3142021, 1820--1867back to text
- 58 articleConvergence of the Fleming-Viot process toward the minimal quasi-stationary distribution.ALEA - Latin American Journal of Probability and Mathematical Statisticsto appear2019back to text
-
59
articleExponential convergence to quasi-stationary distribution and
-process.Probab. Theory Related Fields1641-22016, 243--283URL: https://doi.org/10.1007/s00440-014-0611-7DOIback to textback to text - 60 articleGaussian approximations for chemostat models in finite and infinite dimensions.J. Math. Biol.7542017, 805--843URL: https://doi.org/10.1007/s00285-017-1097-6DOIback to text
- 61 inproceedingsBinary branching processes with Moran type interactions.Annales de l'Institut Henri Poincare (B) Probabilites et statistiques612Institut Henri Poincaré2025, 917--952back to text
- 62 incollectionMeasure-valued Markov processes.École d'Été de Probabilités de Saint-Flour XXI---19911541Lecture Notes in Math.BerlinSpringer1993, 1--260back to text
- 63 articleThe dynamical theory of coevolution: a derivation from stochastic ecological processes.J. Math. Biol.345-61996, 579--612back to text
- 64 articleThe dynamics of adaptation: An illuminating example and a Hamilton-Jacobi approach.Theor. Pop. Biol.672005, 257--271back to text
- 65 articleAdmissible speeds in spatially periodic bistable reaction-diffusion equations.Advances in Mathematics3892021, 107889back to text
- 66 articleSequential linear regression with online standardized data.Plos one1312018, e0191186back to text
- 67 articleAn omnibus test of goodness-of-fit for conditional distributions with applications to regression models.J. Statist. Plann. Inference142102012, 2748--2761URL: https://doi.org/10.1016/j.jspi.2012.04.008DOIback to text
- 68 articleFleming-Viot processes in population genetics.SIAM J. Control Optim.3121993, 345--386back to text
- 69 unpublishedMechanistic-statistical model for the expansion of ash dieback.September 2024, working paper or preprintHALback to text
- 70 articleA modeling approach of the chemostat.Ecological Modelling2014back to text
- 71 articleThe Multi-type Bisexual Galton-Watson Branching Process.Annales de l'Institut Henri Poincaré (B) Probabilités et Statistiques6042024, 2975-3008HALDOIback to text
- 72 articleThe revisited knockoffs method for variable selection in L 1-penalized regressions.Communications in Statistics - Simulation and Computation002020, 1-14URL: https://doi.org/10.1080/03610918.2020.1775850DOIback to text
- 73 articleInferring gene regulatory networks from single-cell data: a mechanistic approach.BMC Syst. Biol.1112017back to textback to textback to text
- 74 articleSmall populations corrections for selection-mutation models.Netw. Heterog. Media742012, 805--836URL: https://doi.org/10.3934/nhm.2012.7.805DOIback to text
- 75 articleDispersal-induced growth in a time-periodic environment.Journal of Mathematical Biology8532022, 24back to text
- 76 articleHybrid spatial Gillespie and particle tracking simulation.Bioinformatics28182012, i549--i555back to text
- 77 articleStochastic modeling of a gene regulatory network driving B cell development in germinal centers.PLoS ONE193March 2024, e0301022HALDOIback to text
- 78 articleStreaming constrained binary logistic regression with online standardized data.Journal of Applied StatisticsIn press2021HALDOIback to text
- 79 articleDirac mass dynamics in multidimensional nonlocal parabolic equations.Comm. Partial Differential Equations3662011, 1071--1098URL: http://dx.doi.org/10.1080/03605302.2010.538784DOIback to text
- 80 incollectionAdaptive dynamics, a geometrical study of the consequences of nearly faithful reproduction.Stochastic and spatial structures of dynamical systems (Amsterdam, 1995)Konink. Nederl. Akad. Wetensch. Verh. Afd. Natuurk. Eerste Reeks, 45AmsterdamNorth-Holland1996, 183--231back to text
- 81 articleA singular Hamilton-Jacobi equation modeling the tail problem.SIAM J. Math. Anal.4462012, 4297--4319URL: https://doi.org/10.1137/100819527DOIback to text
- 82 articleApproximation stochastique en analyse factorielle multiple.Ann. I.S.U.P.5032006, 27--45back to text
- 83 articleConvergence d'un processus d'approximation stochastique en analyse factorielle.Publ. Inst. Statist. Univ. Paris3811994, 37--55back to text
- 84 unpublishedConvergence of a normed eigenvector stochastic approximation process and application to online principal component analysis of a data stream.May 2019, working paper or preprintHALback to text
- 85 articleStochastic approximation of eigenvectors and eigenvalues of the Q-symmetric expectation of a random matrix.Communications in Statistics - Theory and Methods53515 pages2024, 1669-1683HALDOIback to textback to text
- 86 articleStochastic approximation of the factors of a generalized canonical correlation analysis.Statist. Probab. Lett.78142008, 2210--2216URL: http://dx.doi.org/10.1016/j.spl.2008.01.088DOIback to text
- 87 articleOn the evolution of large clusters in the Becker-Döring model.Journal of Nonlinear Science1312003, 115--122back to text
- 88 articleDirac concentrations in Lotka-Volterra parabolic PDEs.Indiana Univ. Math. J.5772008, 3275--3301URL: http://dx.doi.org/10.1512/iumj.2008.57.3398DOIback to text
- 89 articleSurvival thresholds and mortality rates in adaptive dynamics: conciliating deterministic and stochastic simulations.Math. Med. Biol.2732010, 195--210URL: https://doi.org/10.1093/imammb/dqp018DOIback to text
- 90 articleSingle-cell-based analysis highlights a surge in cell-to-cell molecular variability preceding irreversible commitment in a differentiation process.PLOS Biology14122016back to textback to text
- 91 articleInteracting particle systems and Yaglom limit approximation of diffusions with unbounded drift.Electron. J. Probab.162011, no. 61, 1663--1692URL: https://doi.org/10.1214/EJP.v16-925DOIback to text
- 92 article20 questions on adaptive dynamics.J. Evol. Biol.182005, 1139--1154back to textback to text