2025 Spring

The seminar of this semester is organized by Qiang Wu and Ming Li, and co-organized by the graduate student union in the School of Mathematical Sciences at Fudan. This section is partially sponsored by Shanghai Key Laboratory for Contemporary Applied Mathematics.

2025-04-24 16:10:00 - 17:00:00 @ Rm 1801, Guanghua East Tower [poster]

Abstract: Click to expand This work considers the simple bilevel optimization problem, which involves minimizing a composite convex function over the optimal solution set of another composite convex minimization problem. By reformulating this bilevel problem as finding the left-most root of a nonlinear equation and introducing a novel dual approach for the subproblems, we efficiently obtain an $(\epsilon, \epsilon)$-optimal solution. The proposed methods achieve near-optimal complexity of $\tilde{\mathcal{O}}(1/\sqrt{\epsilon})$ for both the upper- and lower-level objectives under mild assumptions, aligning with the optimal complexity bounds of first-order methods in unconstrained smooth or composite convex optimization when ignoring logarithmic terms.

Past Presentations

2025-02-20 16:10:00 - 17:00:00 @ Rm 1801, Guanghua East Tower [poster]

Abstract: Click to expand In this talk, I present an asymptotic stability result concerning the self-similar Blasius profiles $[\bar{u}, \bar{v}]$ of the stationary Prandtl boundary layer equation. Initially demonstrated by Serrin (1967, Proc.\ R.\ Soc.\ Lond), the profiles $[\bar{u}, \bar{v}]$ were shown to act as a self-similar attractor of solutions $[u, v]$ to the Prandtl equation through the use of von Mises transform and maximal principle techniques. Specifically, as $x \to \infty$, $\|u - \bar{u}\|_{L^{\infty}_{y}} \to 0$. Iyer(2020, ARMA) employed refined energy methods to derive an explicit convergence rate for initial data close to Blasius. Wang and Zhang(2023, Math.\ Ann.) utilized barrier function methods, removing smallness assumptions but imposing stronger asymptotic conditions on the initial data. It was suggested that the optimal convergence rate should be $\|u-\bar{u}\|_{L^{\infty}_{y}}\lesssim (x+1)^{-\frac{1}{2}}$, treating the stationary Prandtl equation as a 1-D parabolic equation in the entire space. In our work, we establish that $\|u - \bar{u}\|_{L^{\infty}_{y}} \lesssim (x+1)^{-1}$. Our proof relies on discovering nearly conserved low-frequency quantities and inherent degenerate structures at the boundary, which enhance the convergence rate through iteration techniques. Notably, the convergence rate we have demonstrated is optimal. We can find special solutions of Prandtl's equation such that the convergence between the solutions and the Blasius profile is exact, represented as $ (x+1)^{-1} $. This is a joint work with Prof. Hao Jia and Prof. Zhen Lei.

2025-02-27 16:10:00 - 17:00:00 @ Rm 1801, Guanghua East Tower [poster]

Abstract: Click to expand Solving the time-independent Schrödinger equation gives us full access to the chemical properties of molecules. Among all the ab-initio methods, full configuration interaction (FCI) provides the numerically exact solution under a predefined basis set. However, the FCI problem scales exponentially with respect to the number of bases and electrons and suffers from the curse of dimensionality. We develop a multi-threaded parallel coordinate descent full configuration interaction algorithm, for the electronic structure ground-state calculation in the configuration interaction framework. The algorithm solves an unconstrained nonconvex optimization problem, via a modified block coordinate descent method with a deterministic compression strategy. CDFCI captures and updates appreciative determinants with different frequencies proportional to their importance. We demonstrate the efficiency of the algorithm on practical systems.

2025-03-06 16:10:00 - 17:00:00 @ Rm 1801, Guanghua East Tower [poster]

Abstract: Click to expand In this talk, a numerical method for a class of nonlocal PDEs with long time delay is designed. The system involves a variable on $\Omega\times\mathbb{R}\times\mathbb{R}^{+}$, in which case for $\Omega\subset\mathbb{R}^{d}$, a $(d+2)$-dimensional problem is to be solved numerically, which is challenging, especially for $d=2$ or $d=3$. In this talk, we propose an effective numerical method: BDF schemes and Fourier spectral method are applied for time and space discretization respectively, and the long time delay term is treated by Laguerre spectral method. The unique solvability of the numerical schemes is proved, and the energy upper bound of the numerical solution for the long time is given by energy estimation. By applying the generalized Laguerre orthogonal projection, we obtain the error estimate within finite final time for the fully discretization. We present some numerical experiments to verify the energy bound and convergence order. Also, examples are given to show how the solutions evolve and approach the global attractor.

2025-03-13 16:10:00 - 17:00:00 @ Rm 1801, Guanghua East Tower [poster]

Abstract: Click to expand Interactive Theorem Provers (ITPs), often referred to as formal languages, offer a reliable method to eliminate errors in mathematical reasoning. Meanwhile, Large Language Models (LLMs) have shown great potential to accelerate—and even automate—the formalization process. In this talk, we will explore how LLMs are applied in key areas such as premise selection, tactic suggestion, auto-formalization, and automated theorem proving. Additionally, we will discuss how training datasets for these tasks are constructed, highlighting the impact of structural information on improving LLMs' performance in Lean-related tasks, particularly in LeanSearch and our statement formalizer.

2025-03-20 16:10:00 - 17:00:00 @ Rm 1801, Guanghua East Tower [poster]

Abstract: Click to expand In this talk, the computation of the matrix pair describing the first passage time of a Markov additive process is considered. This pair of matrices is characterized as a solution to an integral matrix equation, for which we develop an iterative method. At each step, it requires computing the extremal solution to a mixed linear-quadratic matrix equation, which is accomplished by a quadratically convergent algorithm. When all the jumps are of phase-type distribution, the integral matrix equation can be transformed into a single mixed linear-quadratic matrix equation, and thus the pair of matrices can be computed with quadratic convergence.

2025-03-27 16:10:00 - 17:00:00 @ Rm 1801, Guanghua East Tower [poster]

Abstract: Click to expand Accurately identifying and predicting dynamics from observational data with noise perturbations or data missing is a significant challenge in the field of dynamical systems. In my talk, I will introduce the Hamiltonian Neural Koopman Operator (HNKO), a novel approach that combines principles from Hamiltonian mechanics with the learning of the Koopman operator. This framework not only sustains but also discovers conservation laws automatically, leveraging my foundational knowledge of mathematical physics. The effectiveness of the HNKO and its extensions are demonstrated across various representative physical systems, even those with hundreds or thousands of degrees of freedom. The findings indicate that incorporating prior knowledge of the underlying system and relevant mathematical theories into the learning framework significantly enhances the ability of machine learning to address complex physical problems.

2025-04-03 16:10:00 - 17:00:00 @ Rm 1801, Guanghua East Tower [poster]

Abstract: Click to expand In this report, we present a neural network approach to address the dynamic unbalanced optimal transport problem on surfaces with point cloud representation. For surfaces with point cloud representation, traditional method is difficult to apply due to the difficulty of mesh generating. Neural network is easy to implement even for complicate geometry. Moreover, instead of solving the original dynamic formulation, we consider the Hamiltonian flow approach, i.e. Karush-Kuhn-Tucker system. Based on this approach, we can exploit mathematical structure of the optimal transport to construct the neural network and the loss function can be simplified. Extensive numerical experiments are conducted for surfaces with different geometry. We also test the method for point cloud with noise, which shows stability of this method. This method is also easy to generalize to diverse range of problems.

2025-04-10 16:10:00 - 17:00:00 @ Rm 1801, Guanghua East Tower [poster]

Abstract: Click to expand The nonlinear Schrödinger equation (NLSE) arises from various applications in quantum physics and chemistry, nonlinear optics, plasma physics, Bose--Einstein Condensates, etc. In these applications, it is necessary to incorporate low-regularity or singular potential and nonlinearity into the NLSE. Typical examples of such potential and nonlinearity include the discontinuous square-well potential, the singular Coulomb potential, the non-integer power nonlinearity, and the logarithmic nonlinearity. Such low regularity and singularity pose significant challenges in the analysis of standard numerical methods and the development of novel accurate, efficient, and structure-preserving schemes. In this talk, I will introduce several new analysis techniques to establish optimal error bounds for some widely used numerical methods under optimally weak regularity assumptions. Based on the analysis, we also propose novel temporal and spatial discretizations to handle the low regularity and singularity more effectively.

2025-04-17 16:10:00 - 17:00:00 @ Rm 1801, Guanghua East Tower [poster]

Abstract: Click to expand Policy optimization refers to a family of effective algorithms which search in the policy space based on policy parameterization to solve reinforcement learning problems. Inspired by the similar update pattern of softmax natural policy gradient and Hadamard policy gradient, we propose to study a general policy update rule called $\phi$-update, where $\phi$ refers to a scaling function on advantage functions. Under very mild conditions on $\phi$, the global asymptotic state value convergence of $\phi$-update is firstly established. Then we show that the policy produced by $\phi$-update indeed converges, even when there are multiple optimal policies. This is in stark contrast to existing results where explicit regularizations are required to guarantee the convergence of the policy. The exact asymptotic convergence rate of state values is further established based on the policy convergence. Lastly, we establish the global linear convergence of $\phi$-update.