Normalization and scaling in neural networks

Wednesday, January 28, 2015 3:40:00 PM

Hello everybody.

I'm passing coursera course about neural networks.

Today I discovered for myself reason why normalization and scaling in neural networks provides faster learning. Everything is related with error surface and optimization. If to put simply

the task of neural network is to find a global minimub at error surface. Algorithms of study of neural networks gradually move at error surface in order to finally find global minima of error

surface. Going to global minima in the circle will go faster then going to global minima in some ellipse or other kind of error surface.

Suppose we have training for neural network with two samples:

101,101 - > 2

101, 99 - > 0

Then error surface will be oval, and convergence will be relatively slow. But if to normalize data from range [99 ; 101] to range [-1; 0 ] we will gt error surface as circle, which converges much more faster.

See the picture:

The same is true for case of scaling. Let's say we have two inputs, and two outputs:

0.1, 10 -> 2

0.1, -10 -> 0.

If to scale the second part to the following look how error surface are changed:

No Comments

Add a Comment

Name
Email
Url

Comment

How to use IKVM and Java in Acumatica along with jar

Four types of Security types in Acumatica

The entry form cannot be automated. The view doesn't exist

Acumatica TreeView: Understanding and Customizing in Acumatica ERP

Acumatica: redirection to screens from grid or how to enable hyper-link for grid fields

How to turn find out duration of SQL Query execution time

Find the custom fields in Acumatica using ScreenUtils class

How to call Persist of PXgraph without triggering the code written in that graph

Localized reports in Acumatica

OData Version 4.0 Changes in Acumatica 24R1