Neural Networks Fail to Learn Periodic Functions and How to Fix It

Liu Ziyin, T. Hartwig, Masahito Ueda

2020

3 references

Abstract

Previous literature offers limited clues on how to learn a periodic function using modern neural networks. We start with a study of the extrapolation properties of neural networks; we prove and demonstrate experimentally that the standard activations functions, such as ReLU, tanh, sigmoid, along with their variants, all fail to learn to extrapolate simple periodic functions. We hypothesize that this is due to their lack of a "periodic" inductive bias. As a fix of this problem, we propose a new activation, namely, $x + \sin^2(x)$, which achieves the desired periodic inductive bias to learn a periodic function while maintaining a favorable optimization property of the ReLU-based activations. Experimentally, we apply the proposed method to temperature and financial data prediction.

View Paper

🤖 Machine Learning

1 repository

3 references

Code References

▶ huggingface/transformers

3 files

▶ src/transformers/models/qwen2_5_omni/modeling_qwen2_5_omni.py

L3155

- This activation function is a modified version based on this paper by Liu Ziyin, Tilman Hartwig, Masahito Ueda:

▶ src/transformers/models/qwen2_5_omni/modular_qwen2_5_omni.py

L3313

- This activation function is a modified version based on this paper by Liu Ziyin, Tilman Hartwig, Masahito Ueda:

▶ src/transformers/models/qwen3_omni_moe/modeling_qwen3_omni_moe.py

L3652

- This activation function is a modified version based on this paper by Liu Ziyin, Tilman Hartwig, Masahito Ueda:

Link copied to clipboard!