Mehler’s Formula, Branching Process, and Compositional Kernels of Deep Neural Networks

We utilize a connection between compositional kernels and branching processes via Mehler’s formula to study deep neural networks. This new probabilistic insight provides us a novel perspective on the mathematical role of activation functions in compositional neural networks. We study the unscaled and rescaled limits of the compositional kernels and explore the different phases of the limiting behavior, as the compositional depth increases. We investigate the memorization capacity of the compositional kernels and neural networks by characterizing the interplay among compositional depth, sample size, dimensionality, and non-linearity of the activation. Explicit formulas on the eigenvalues of the compositional kernel are provided, which quantify the complexity of the corresponding reproducing kernel Hilbert space. On the methodological front, we propose a new random features algorithm, which compresses the compositional layers by devising a new activation function.

View Journal Publication View Working Paper View On SSRN

Related People

Tengyuan Liang

Research Briefs

BFI Data Studio

Podcasts

Videos

Upcoming Events

2025 IO+ Conference

2025 Relational Contracts Conference

Mehler’s Formula, Branching Process, and Compositional Kernels of Deep Neural Networks

More on this topic

The Promise of Digital Technology and Generative AI for Supporting Parenting Interventions in Latin America

Chat2Learn: A Proof-of-Concept Evaluation of a Technology-Based Tool to Enhance Parent-Child Language Interaction

Artificial Writing and Automated Detection