2025 Spring
The seminar of this semester is organized by Qiang Wu and Ming Li, and co-organized by the graduate student union in the School of Mathematical Sciences at Fudan. This section is partially sponsored by Shanghai Key Laboratory for Contemporary Applied Mathematics.
2025-04-24 16:10:00 - 17:00:00 @ Rm 1801, Guanghua East Tower
[poster]
- Title:
Near-Optimal Algorithms for Convex Simple Bilevel Optimization under Weak Assumptions
- Speaker: Xu Shi (Fudan University)
- Advisor: Rujun Jiang (Rujun Jiang)
Abstract: Click to expand
This work considers the simple bilevel optimization problem, which involves minimizing a composite convex function over
the optimal solution set of another composite convex minimization problem. By reformulating this bilevel problem as finding
the left-most root of a nonlinear equation and introducing a novel dual approach for the subproblems, we efficiently obtain
an $(\epsilon, \epsilon)$-optimal solution. The proposed methods achieve near-optimal complexity of $\tilde{\mathcal{O}}(1/\sqrt{\epsilon})$
for both the upper- and lower-level objectives under mild assumptions, aligning with the optimal complexity bounds of
first-order methods in unconstrained smooth or composite convex optimization when ignoring logarithmic terms.
Past Presentations
2025-02-20 16:10:00 - 17:00:00 @ Rm 1801, Guanghua East Tower
[poster]
- Title:
Sharp Asymptotic Stability of Blasius Profile in the Steady Prandtl Equation
- Speaker: Cheng Yuan (Fudan University)
- Advisor: Zhen Lei (Fudan University)
Abstract: Click to expand
In this talk, I present an asymptotic stability result concerning
the self-similar Blasius profiles $[\bar{u}, \bar{v}]$ of the
stationary Prandtl boundary layer equation. Initially demonstrated by
Serrin (1967, Proc.\ R.\ Soc.\ Lond), the profiles $[\bar{u}, \bar{v}]$
were shown to act as a self-similar attractor of solutions $[u, v]$ to
the Prandtl equation through the use of von Mises transform and maximal
principle techniques. Specifically, as $x \to \infty$,
$\|u - \bar{u}\|_{L^{\infty}_{y}} \to 0$. Iyer(2020, ARMA) employed
refined energy methods to derive an explicit convergence rate for initial
data close to Blasius. Wang and Zhang(2023, Math.\ Ann.) utilized barrier
function methods, removing smallness assumptions but imposing stronger asymptotic
conditions on the initial data. It was suggested that the optimal convergence rate
should be $\|u-\bar{u}\|_{L^{\infty}_{y}}\lesssim (x+1)^{-\frac{1}{2}}$,
treating the stationary Prandtl equation as a 1-D parabolic equation
in the entire space.
In our work, we establish that $\|u - \bar{u}\|_{L^{\infty}_{y}} \lesssim (x+1)^{-1}$.
Our proof relies on discovering nearly conserved low-frequency quantities and inherent
degenerate structures at the boundary, which enhance the convergence rate through iteration
techniques. Notably, the convergence rate we have demonstrated is optimal. We can find
special solutions of Prandtl's equation such that the convergence between the solutions
and the Blasius profile is exact, represented as $ (x+1)^{-1} $. This is a joint work
with Prof. Hao Jia and Prof. Zhen Lei.
2025-02-27 16:10:00 - 17:00:00 @ Rm 1801, Guanghua East Tower
[poster]
- Title:
Parallel Multi-Coordinate Descent Methods for Full Configuration Interaction
- Speaker: Yuejia Zhang (Fudan University)
- Advisor: Yingzhou Li (Fudan University)
Abstract: Click to expand
Solving the time-independent Schrödinger equation gives us full access to the chemical
properties of molecules. Among all the ab-initio methods, full configuration interaction (FCI)
provides the numerically exact solution under a predefined basis set. However, the FCI
problem scales exponentially with respect to the number of bases and electrons and suffers
from the curse of dimensionality. We develop a multi-threaded parallel coordinate descent
full configuration interaction algorithm, for the electronic structure ground-state calculation
in the configuration interaction framework. The algorithm solves an unconstrained nonconvex
optimization problem, via a modified block coordinate descent method with a deterministic
compression strategy. CDFCI captures and updates appreciative determinants with different
frequencies proportional to their importance. We demonstrate the efficiency of the algorithm
on practical systems.
2025-03-06 16:10:00 - 17:00:00 @ Rm 1801, Guanghua East Tower
[poster]
- Title:
A BDF-Spectral Method for Nonlocal PDEs With Long Time Delay
- Speaker: Shuxun Shi (Fudan University)
- Advisor: Wenbin Chen (Fudan University)
Abstract: Click to expand
In this talk, a numerical method for a class of nonlocal PDEs with long time delay is designed.
The system involves a variable on $\Omega\times\mathbb{R}\times\mathbb{R}^{+}$, in which case for
$\Omega\subset\mathbb{R}^{d}$, a $(d+2)$-dimensional problem is to be solved numerically, which is
challenging, especially for $d=2$ or $d=3$. In this talk, we propose an effective numerical method:
BDF schemes and Fourier spectral method are applied for time and space discretization respectively,
and the long time delay term is treated by Laguerre spectral method. The unique solvability of the
numerical schemes is proved, and the energy upper bound of the numerical solution for the long time
is given by energy estimation. By applying the generalized Laguerre orthogonal projection, we
obtain the error estimate within finite final time for the fully discretization. We present some
numerical experiments to verify the energy bound and convergence order. Also, examples are given
to show how the solutions evolve and approach the global attractor.
2025-03-13 16:10:00 - 17:00:00 @ Rm 1801, Guanghua East Tower
[poster]
- Title:
Applications of Large Language Models in Formal Reasoning
- Speaker: Guoxiong Gao (Peking University)
- Advisor: Bin Dong (Peking University)
Abstract: Click to expand
Interactive Theorem Provers (ITPs), often referred to as formal languages, offer a reliable method
to eliminate errors in mathematical reasoning. Meanwhile, Large Language Models (LLMs) have shown
great potential to accelerate—and even automate—the formalization process. In this talk, we will
explore how LLMs are applied in key areas such as premise selection, tactic suggestion, auto-formalization,
and automated theorem proving. Additionally, we will discuss how training datasets for these tasks
are constructed, highlighting the impact of structural information on improving LLMs' performance
in Lean-related tasks, particularly in LeanSearch and our statement formalizer.
2025-03-20 16:10:00 - 17:00:00 @ Rm 1801, Guanghua East Tower
[poster]
- Title:
Computation of First Passage of Markov Additive Processes
- Speaker: Junxin Zhang (Fudan University)
- Advisor: Jungong Xue (Fudan University)
Abstract: Click to expand
In this talk, the computation of the matrix pair describing the first passage time of a Markov additive
process is considered. This pair of matrices is characterized as a solution to an integral matrix equation,
for which we develop an iterative method. At each step, it requires computing the extremal solution to
a mixed linear-quadratic matrix equation, which is accomplished by a quadratically convergent algorithm.
When all the jumps are of phase-type distribution, the integral matrix equation can be transformed into
a single mixed linear-quadratic matrix equation, and thus the pair of matrices can be computed with
quadratic convergence.
2025-03-27 16:10:00 - 17:00:00 @ Rm 1801, Guanghua East Tower
[poster]
- Title:
AI for Physics: Learning Hamiltonian Systems and Conservation Laws
- Speaker: Jingdong Zhang (Fudan University)
- Advisor: Wei Lin (Fudan University)
Abstract: Click to expand
Accurately identifying and predicting dynamics from observational data with noise perturbations or data
missing is a significant challenge in the field of dynamical systems. In my talk, I will introduce the
Hamiltonian Neural Koopman Operator (HNKO), a novel approach that combines principles from Hamiltonian
mechanics with the learning of the Koopman operator. This framework not only sustains but also discovers
conservation laws automatically, leveraging my foundational knowledge of mathematical physics.
The effectiveness of the HNKO and its extensions are demonstrated across various representative physical
systems, even those with hundreds or thousands of degrees of freedom. The findings indicate that
incorporating prior knowledge of the underlying system and relevant mathematical theories into the learning
framework significantly enhances the ability of machine learning to address complex physical problems.
2025-04-03 16:10:00 - 17:00:00 @ Rm 1801, Guanghua East Tower
[poster]
- Title:
A Network Based Approach for Unbalanced Optimal Transport on Surfaces
- Speaker: Jiangong Pan (Tsinghua University)
- Advisor: Zuoqiang Shi (Tsinghua University)
Abstract: Click to expand
In this report, we present a neural network approach to address the dynamic unbalanced optimal transport
problem on surfaces with point cloud representation. For surfaces with point cloud representation, traditional
method is difficult to apply due to the difficulty of mesh generating. Neural network is easy to implement
even for complicate geometry. Moreover, instead of solving the original dynamic formulation, we consider the
Hamiltonian flow approach, i.e. Karush-Kuhn-Tucker system. Based on this approach, we can exploit mathematical
structure of the optimal transport to construct the neural network and the loss function can be simplified.
Extensive numerical experiments are conducted for surfaces with different geometry. We also test the method
for point cloud with noise, which shows stability of this method. This method is also easy to generalize to
diverse range of problems.
2025-04-10 16:10:00 - 17:00:00 @ Rm 1801, Guanghua East Tower
[poster]
- Title:
Numerical Analysis for Nonlinear Schrödinger Equations with Low Regularity or Singularity
- Speaker: Chushan Wang (National University of Singapore)
- Advisor: Weizhu Bao (National University of Singapore)
Abstract: Click to expand
The nonlinear Schrödinger equation (NLSE) arises from various applications in quantum physics and chemistry,
nonlinear optics, plasma physics, Bose--Einstein Condensates, etc. In these applications, it is necessary to
incorporate low-regularity or singular potential and nonlinearity into the NLSE. Typical examples of such potential
and nonlinearity include the discontinuous square-well potential, the singular Coulomb potential, the non-integer
power nonlinearity, and the logarithmic nonlinearity. Such low regularity and singularity pose significant
challenges in the analysis of standard numerical methods and the development of novel accurate, efficient,
and structure-preserving schemes.
In this talk, I will introduce several new analysis techniques to establish optimal error bounds for some widely
used numerical methods under optimally weak regularity assumptions. Based on the analysis, we also propose novel
temporal and spatial discretizations to handle the low regularity and singularity more effectively.
2025-04-17 16:10:00 - 17:00:00 @ Rm 1801, Guanghua East Tower
[poster]
- Title:
$\phi$-Update: A Class of Policy Update Methods with Policy Convergence Guarantee
- Speaker: Wenye Li (Fudan University)
- Advisor: Ke Wei (Fudan University)
Abstract: Click to expand
Policy optimization refers to a family of effective algorithms which search in the policy space based on policy
parameterization to solve reinforcement learning problems. Inspired by the similar update pattern of softmax
natural policy gradient and Hadamard policy gradient, we propose to study a general policy update rule called
$\phi$-update, where $\phi$ refers to a scaling function on advantage functions. Under very mild conditions on $\phi$,
the global asymptotic state value convergence of $\phi$-update is firstly established. Then we show that the policy
produced by $\phi$-update indeed converges, even when there are multiple optimal policies. This is in stark contrast
to existing results where explicit regularizations are required to guarantee the convergence of the policy. The exact
asymptotic convergence rate of state values is further established based on the policy convergence. Lastly, we establish
the global linear convergence of $\phi$-update.