Activation functions in deep learning: A comprehensive survey and benchmark
Article Type
Research Article
Publication Title
Neurocomputing
Abstract
Neural networks have shown tremendous growth in recent years to solve numerous problems. Various types of neural networks have been introduced to deal with different types of problems. However, the main goal of any neural network is to transform the non-linearly separable input data into more linearly separable abstract features using a hierarchy of layers. These layers are combinations of linear and nonlinear functions. The most popular and common non-linearity layers are activation functions (AFs), such as Logistic Sigmoid, Tanh, ReLU, ELU, Swish and Mish. In this paper, a comprehensive overview and survey is presented for AFs in neural networks for deep learning. Different classes of AFs such as Logistic Sigmoid and Tanh based, ReLU based, ELU based, and Learning based are covered. Several characteristics of AFs such as output range, monotonicity, and smoothness are also pointed out. A performance comparison is also performed among 18 state-of-the-art AFs with different networks on different types of data. The insights of AFs are presented to benefit the researchers for doing further research and practitioners to select among different choices. The code used for experimental comparison is released at: https://github.com/shivram1987/ActivationFunctions.
First Page
92
Last Page
108
DOI
10.1016/j.neucom.2022.06.111
Publication Date
9-7-2022
Recommended Citation
Dubey, Shiv Ram; Singh, Satish Kumar; and Chaudhuri, Bidyut Baran, "Activation functions in deep learning: A comprehensive survey and benchmark" (2022). Journal Articles. 2963.
https://digitalcommons.isical.ac.in/journal-articles/2963
Comments
Open Access, Green