Related Question 1
Which activation function is commonly used in hidden layers of deep neural networks due to its ability to mitigate vanishing gradient problems?
-
A.
Sigmoid
-
B.
Tanh
-
C.
ReLU
-
D.
Softmax
Correct Answer: C
Explanation:
ReLU (Rectified Linear Unit) is popular for hidden layers because it accelerates convergence by avoiding vanishing gradients.