Академический Документы
Профессиональный Документы
Культура Документы
HANDWRITING
RECOGNITION
IS THE SCIENCE OF TEACHING
COMPUTER VISION COMPUTERS HOW TO SEE
HANDWRITING
RECOGNITION
OBJECT
RECOGNITION
IS THE SCIENCE OF TEACHING
COMPUTER VISION COMPUTERS HOW TO SEE
HANDWRITING OBJECT
RECOGNITION RECOGNITION
FACE
DETECTION
IS THE SCIENCE OF TEACHING
COMPUTER VISION COMPUTERS HOW TO SEE
CLASSIFY IT AS ONE OF
THE DIGITS BETWEEN 0-9
HANDWRITTEN DIGIT RECOGNITION
YOU ARE GIVEN AN IMAGE OF A CLASSIFY IT AS ONE OF
HANDWRITTEN DIGIT
THE DIGITS BETWEEN 0-9
THE BASIC IDEA IS THAT THE INPUT IS THE NETWORK HERE USUALLY
FED TO A NETWORK WHICH PROCESSES REFERS TO AN ARTIFICIAL
IT AND THEN PRODUCES AN OUTPUT NEURAL NETWORK
LAYER 1
LAYER 2
LAYER 3
INSIDE THE NETWORK THERE ARE
LAYERS INPUT
NETWORK OUTPUT
LAYER 2
LAYER 3
INPUT NETWORK OUTPUT
LETS START WITH THE SIMPLEST POSSIBLE
ARTIFICIAL NEURAL NETWORK
A PERCEPTRON
LETS START WITH THE SIMPLEST POSSIBLE
ARTIFICIAL NEURAL NETWORK A PERCEPTRON
HERE IS HOW A PERCEPTRON CAN BE REPRESENTED AS A NETWORK
. WN
XN EACH INPUT IN THE FEATURE VECTOR IS
MULTIPLIED BY A WEIGHT AND FED TO AN
OUTPUT NODE
LETS START WITH THE SIMPLEST POSSIBLE
ARTIFICIAL NEURAL NETWORK A PERCEPTRON
HERE IS HOW A PERCEPTRON CAN BE REPRESENTED AS A NETWORK
X1
X2
X3
X4
.
XN-1
XN
EACH INPUT IS MULTIPLIED BY A WEIGHT AND
FED INTO A NODE THAT PROCESSES THEM
X1
X2 OUTPUT 1
X3
X4
.
OUTPUT 2
.
XN-1
XN
INPUT LAYER
EACH SET OF NODES IS CALLED A LAYER
X1
X2 OUTPUT 1
X3
X4
.
OUTPUT 2
.
OUTPUT LAYER
XN-1
XN
EACH SET OF NODES IS CALLED A LAYER
X1
X2 OUTPUT 1
X3
X4
.
OUTPUT 2
.
XN-1
HIDDEN LAYER
XN
THERE CAN BE ANY NUMBER OF HIDDEN LAYERS,
EACH ONES OUTPUTS FEEDING INTO THE NEXT
THINK OF AN ORACLE LOOKING AT EACH NODE IN THE HIDDEN LAYER
TEA LEAVES INTERPRETS THE INPUTS IN SOME
WAY AND PRODUCES A SIGN
X1
X2 OUTPUT 1
X3
X4
.
OUTPUT 2
. EACH OUTPUT IS LIKE AN ORACLE-
XN-1 LOOKING AT THESE SIGNS AND
PREDICTING AN OUTCOME
XN
DIFFERENT ORACLES INTERPRET
THE SIGNS IN DIFFERENT WAYS
EACH SET OF NODES IS CALLED A LAYER
X1
X2 OUTPUT 1
X3
X4
.
OUTPUT 2
OUTPUT LAYER
.
HIDDEN LAYER
XN-1
THIS KIND OF NETWORK, IS CALLED EACH LAYER FEEDS
XN A FEED-FORWARD NEURAL INTO THE NEXT LAYER
INPUT LAYER NETWORK
EACH SET OF NODES IS CALLED A LAYER
X1
X2 OUTPUT 1
X3
X4
.
OUTPUT 2
OUTPUT LAYER
.
HIDDEN LAYER
XN-1
IF EVERY NODE IN THE PREVIOUS
XN LAYER IS CONNECTED TO THE LAYER
INPUT LAYER - ITS CALLED A DENSE LAYER
EACH SET OF NODES IS CALLED A LAYER
X1
X2 OUTPUT 1
X3
X4
.
OUTPUT 2
OUTPUT LAYER
.
HIDDEN LAYER
XN-1
XN TO AVOID OVERFITTING, YOU
CAN FEED ONLY SOME OF THE THIS IS CALLED
INPUT LAYER
INPUTS FROM ONE LAYER TO
EACH NODE IN THE NEXT LAYER
DROPOUT
EACH SET OF NODES IS CALLED A LAYER
X1
X2 OUTPUT 1
X3
X4
.
OUTPUT 2
OUTPUT LAYER
.
HIDDEN LAYER
XN-1
THE OBJECTIVE IN TRAINING
XN EACH EDGE IN THE NETWORK
IS TO LEARN THE VALUES OF
INPUT LAYER REPRESENTS A WEIGHT
THESE WEIGHTS
EACH SET OF NODES IS CALLED A LAYER
X1
X2 OUTPUT 1
X3
X4
.
OUTPUT 2
OUTPUT LAYER
.
HIDDEN LAYER
XN-1
LETS GO BACK TO OUR DIGIT
XN RECOGNITION PROBLEM
INPUT LAYER
HANDWRITTEN DIGIT RECOGNITION
MNIST IS A DATABASE OF HANDWRITTEN DIGIT IMAGES
THESE IMAGES CAN BE USED TO TRAIN A DIGIT CLASSIFIER
.
WELL START WITH
X783 AN INPUT LAYER OF
X784 784 NODES
LETS WALK THROUGH THE FEED THESE INPUTS TO A HIDDEN
COMPLETE TRAINING PROCESS NOW LAYER (OR MORE THAN 1 HIDDEN
LAYER)
X1 THE DIGIT IS 0 (0/1)
HIDDEN LAYER 1
X2
X3 THE DIGIT IS 1 (0/1)
X4 THE DIGIT IS 2 (0/1)
.
X783
X784 THE OUTPUT LAYER WILL HAVE 10
NODES , REPRESENTING 0/1 FOR
EACH DIGIT
LETS WALK THROUGH THE
COMPLETE TRAINING PROCESS NOW DURING TRAINING, THE OBJECTIVE IS TO FIND
THE WEIGHTS FOR EACH LAYER, WHICH
MINIMIZE THE ERROR ON THE TRAINING SET
X1
1) INITIALIZE SOME RANDOM
HIDDEN LAYER 1
X2 WEIGHTS
X3 2) FIND THE OUTPUT
X4
3) EVALUATE SOME ERROR FUNCTION
. EX: (DESIRED OUTPUT - CURRENT
.
OUTPUT)
4) ADJUST THE WEIGHTS BY A SMALL
X783 INCREMENT
X784
5) REPEAT STEPS 2-4 UNTIL THE ERROR IS
MINIMIZED (OR A CERTAIN NUMBER OF
TIMES)