How to construct Kolmogorov-Arnold Network based on MATLAB’s Deep Learning Toolbox
Kolmogorov-Arnold Networks (KANs) are theoretically grounded alternatives to conventional Multilayer Perceptrons (MLPs). KANs are built upon the Kolmogorov-Arnold representation theorem, a profound mathematical result stating that any multivariate continuous function can be expressed through a finite composition of univariate continuous functions and binary additions. This theoretical guarantee takes concrete form in the equation:
where and are univariate continuous functions. The structure corresponds to a two-layer network: the first layer applies the inner functions , while the second layer applies the outer functions , with layer widths and , respectively. Each univariate function using B-spline curves with learnable coefficients.
This leads to a fundamentally different network architecture compared to MLPs. As illustrated in the following figure, traditional MLPs apply fixed nonlinear activations to weighted sums at each node, with learning concentrated in edge weights. In contrast, KANs invert this paradigm: nodes perform simple summations, and the learning occurs on the edges, where each connection is associated with a trainable B-spline activation function.
Can anyone has some advices about how to construct this KAN using Deep Learning Toolbox ?Kolmogorov-Arnold Networks (KANs) are theoretically grounded alternatives to conventional Multilayer Perceptrons (MLPs). KANs are built upon the Kolmogorov-Arnold representation theorem, a profound mathematical result stating that any multivariate continuous function can be expressed through a finite composition of univariate continuous functions and binary additions. This theoretical guarantee takes concrete form in the equation:
where and are univariate continuous functions. The structure corresponds to a two-layer network: the first layer applies the inner functions , while the second layer applies the outer functions , with layer widths and , respectively. Each univariate function using B-spline curves with learnable coefficients.
This leads to a fundamentally different network architecture compared to MLPs. As illustrated in the following figure, traditional MLPs apply fixed nonlinear activations to weighted sums at each node, with learning concentrated in edge weights. In contrast, KANs invert this paradigm: nodes perform simple summations, and the learning occurs on the edges, where each connection is associated with a trainable B-spline activation function.
Can anyone has some advices about how to construct this KAN using Deep Learning Toolbox ? Kolmogorov-Arnold Networks (KANs) are theoretically grounded alternatives to conventional Multilayer Perceptrons (MLPs). KANs are built upon the Kolmogorov-Arnold representation theorem, a profound mathematical result stating that any multivariate continuous function can be expressed through a finite composition of univariate continuous functions and binary additions. This theoretical guarantee takes concrete form in the equation:
where and are univariate continuous functions. The structure corresponds to a two-layer network: the first layer applies the inner functions , while the second layer applies the outer functions , with layer widths and , respectively. Each univariate function using B-spline curves with learnable coefficients.
This leads to a fundamentally different network architecture compared to MLPs. As illustrated in the following figure, traditional MLPs apply fixed nonlinear activations to weighted sums at each node, with learning concentrated in edge weights. In contrast, KANs invert this paradigm: nodes perform simple summations, and the learning occurs on the edges, where each connection is associated with a trainable B-spline activation function.
Can anyone has some advices about how to construct this KAN using Deep Learning Toolbox ? deep learning, kan, neural networks MATLAB Answers — New Questions









