Transactions on Machine Learning Research, June 2026

A Unified Theory of Sinusoidal Activation Families for Implicit Neural Representations

Alireza Morsali, MohammadJavad Vaez, Mohammadhossein Soltani, Amirhossein Kazerouni, Babak Taati, Morteza Mohammad-Noori

McGill University, The University of Melbourne, MACSYS, University of Toronto, Vector Institute, University Health Network, University of Tehran


Abstract

Implicit Neural Representations (INRs) model continuous signals with compact neural networks and have become a standard tool in vision, graphics, and signal processing. STAF studies a broad family of sinusoidal activations for INRs and instantiates that view with a trainable Fourier-like activation whose amplitudes, frequencies, and phases are learned directly. The paper develops a unified theoretical and practical framework, including a Kronecker-equivalence result, an NTK-based capacity and convergence analysis, and an initialization scheme with unit-variance post-activations. Empirically, STAF is competitive and often stronger on images, audio, shapes, inverse problems, and NeRF.


Key Contributions


Image Representation

On image fitting tasks, STAF produces sharper reconstructions and stronger late-stage convergence than several INR baselines in the showcased examples from the paper.

Image reconstruction comparison and PSNR convergence curve for STAF and baselines.
Featured DIV2K example with qualitative reconstructions and PSNR over training.
Layer-wise activation maps comparing STAF with FINER, SIREN, and WIRE.
Layer-wise activation maps during image reconstruction.

Theory and Initialization

The paper connects trainable sinusoidal activations to both expressive growth and optimization dynamics. The figures below summarize the empirical NTK analysis and the initialization comparison used in the paper.

Empirical NTK eigenfunction comparison for STAF and baselines.
Empirical NTK eigenfunctions.
Empirical NTK eigenvalue spectrum for STAF and baselines.
Empirical NTK eigenvalue spectrum.
STAF initialization figure.
STAF initialization.
SIREN initialization figure.
SIREN reference initialization.

Audio

For audio reconstruction, the paper reports that STAF achieves the highest PSNR and the lowest reconstruction error on the featured Bach cello example.

Audio waveform and error comparison for STAF and baseline methods.
Waveform and reconstruction error comparison from the paper.
Audio samples to be added soon.

Shapes

On 3D shape representation tasks, STAF achieves the strongest average IoU and the lowest average Chamfer distance among the reported methods.

3D shape reconstruction comparison for STAF and baselines.
Shape reconstruction comparisons from the paper.
Interactive shape demos to be added soon.

Inverse Problems

The paper also evaluates STAF on super-resolution and denoising, where the model is particularly effective at recovering fine structure while remaining competitive in efficiency.

Super-resolution comparison figure for STAF and baseline methods.
4× super-resolution example.
Image denoising comparison figure for STAF and baseline methods.
Denoising example under severe low-light corruption.

NeRF

In the paper’s NeRF experiments without positional encodings, STAF remains competitive and often stronger in PSNR while also improving perceptual quality on several scenes.

NeRF novel-view synthesis comparison for STAF and baseline methods.
Novel-view synthesis comparison with zoomed detail crops.
Rendered NeRF videos to be added soon.

BibTeX

@article{morsali2026staf,
  title   = {A Unified Theory of Sinusoidal Activation Families for Implicit Neural Representations},
  author  = {Alireza Morsali and MohammadJavad Vaez and Mohammadhossein Soltani and Amirhossein Kazerouni and Babak Taati and Morteza Mohammad-Noori},
  journal = {Transactions on Machine Learning Research},
  year    = {2026},
  url     = {https://openreview.net/forum?id=ZDmBPYptbL}
}