Kinder Chen – Medium

Kinder Chen

ReLU Activation Function Variants

The ReLU activation function suffers from a problem known as the dying ReLUs: during training, some neurons effectively die, meaning they…

2 min readDec 29, 2021

--

ReLU Activation Function Variants

--

Kinder Chen

The Vanishing/Exploding Gradients Problems

Gradients often get smaller and smaller as the algorithm progresses down to the lower layers. As a result, the Gradient Descent update…

2 min readNov 4, 2021

--

--

Kinder Chen

Loss Function in Deep Learning

In the context of an optimization algorithm, the function used to evaluate a candidate solution (i.e. a set of weights) is referred to as…

2 min readOct 11, 2021

--

--

Kinder Chen

Hidden Layer Activation Functions

This blog introduces three most commonly used activation functions in hidden layers: Rectified Linear Activation (ReLU), Logistic (Sigmoid)…

2 min readOct 10, 2021

--

Hidden Layer Activation Functions

--

Kinder Chen

Activation Functions in Neural Networks

An activation function in a neural network defines how the weighted sum of the input is transformed into an output from a node or nodes in…

2 min readOct 9, 2021

--

--

Kinder Chen

Forward & Backward Propagation

Neural Networks have two major processes: Forward Propagation and Back Propagation. During Forward Propagation, we start at the input layer…

2 min readOct 8, 2021

--

--

Kinder Chen

Multilayer Perceptron

An MLP (Multilayer Perceptron) is composed of one passthrough input layer, one or more layers of TLUs (threshold logic units), called…

2 min readOct 8, 2021

--

Multilayer Perceptron

--

Kinder Chen

Perceptron

An ANN (artificial neural network) is a Machine Learning model inspired by the networks of biological neurons found in the brains. An…

2 min readOct 8, 2021

--

--

Kinder Chen

AIC and BIC

This blog introduces to two measures: AIC (Akaike information criterion) and BIC (Bayesian information criterion), which give a…

2 min readOct 6, 2021

--

--

Kinder Chen

Gaussian Mixture Model

A Gaussian mixture model (GMM) is a probabilistic model that assumes that the instances were generated from a mixture of several Gaussian…

2 min readOct 5, 2021

--

--

Kinder Chen

Kinder Chen

What happened couldn’t have happened any other way…

Help
Status
About
Careers
Blog
Privacy
Terms
Text to speech
Teams