GitHub - zyman1000/task13_individual

understanding neural networks:

To understand neural networks, I got most of my knowledge from the book recommended in the task PDF, and these are the keypoints I learnet:
based on what i learn in chapter 1, the expected accuracy should be over 96%
The book mainly talks about 2 types of artificial neorns, Perceptrons and sigmoids, both of them output the result based on comparing inputs, weights and biases to a threshhold but there is one critical difference that makes sigmoids a much better choice

perceptrons:

The output is 0 if W.X + B <= 0, and 1 if W.X + b > 0
where W is the wieght, X is the input, and B is the bias
more weight means the input is more important
bias makes the node biased towards a certain output (easier to reach an output)
both weight and bias can be negative

main disadvantage:

if the neural network has multiple layers (input + hidden + output), then making small modifications to the weights and biases of some nodes can change the final output drastically from 0 to 1 or from 1 to 0

sigmoid neurons:

The output of a sigmoid neuron is binary like the output of a perceptron, it could be 1, 0 or anything in between based on the sigmoid/logistic function.
at the end, we are going to deal with an output >= 0.5 as true and an output < 0.5 as false, so what is the difference? when there are multiple layers, each layer will get a more accurate input from the layer before it

implementing the neural network

in order to implement a nerual network, we need to find the appropriate values for the weights and biases, we can do so using a cost/objective function:

C(w,b) = 1/2n Σ||y(x) - a||^2

WHERE y(x) is the output we want to acheive and a is the actual output

if the output we want is 5 then y(x) = (0,0,0,0,0,1,0,0,0,0)
To find the minimum value for the cost/objective function, we need to apply a technique called the gradient descent

code:

we start with random values for biases and weights, and the network will try to get it as close to the ideal value as it can

class network(object): -> making a class for the network

'def _init_(self,sizes):' -> a constructor taking a list called size as an argument

each element in this list represents the number of nodes in a layer

self.biases = [np.random.randn(y,1) for y in sizes[1:]' -> this makes a matrix of random numbers for each element(layer) in the list of size (number of nodes in the layer)x(1)

self.weights = [np.random.randn(y, x) for x, y in zip(sizes[:-1], sizes[1:])]

-> zip(sizes[:-1], sizes[1:]), this part combines the 2 lists making a new list with pairs of values where the first value in the pair is from the first list, and the second value is from the second list, the result is a new list where each element is a pair of 2 elements from the sizes list

-> the entire line makes a matrix of size y,x of random elements, for each pair of elements from the sizes list, what is the purpose? this makes a unique random weight for each output from the y layer that is considered an input for a node in the x layer

Honest note:

I could'nt continue working in the task, I have no excuse, I was just tired

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
test.txt		test.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

understanding neural networks:

perceptrons:

main disadvantage:

sigmoid neurons:

implementing the neural network

code:

Honest note:

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

understanding neural networks:

perceptrons:

main disadvantage:

sigmoid neurons:

implementing the neural network

code:

Honest note:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages