JumpyTaco
JumpyTaco

This Research Paper changed my life forever.

It was one of the papers that was discussed in my interview at Goldman.

I came to know about this research paper a few years back after consulting a friend doing an ML PhD at University of Maryland, College Park.

The explanation of the paper:

  1. Initialize the neural network with small random values typically (-0.1,0.1) to avoid symmetry issues.

  2. Now get ready to do Forward propagation: you pass thetraining data through the multilayer perceptron and compute the output. For each neuron in the MLP, calculate the weighted sum of its inputs and apply the activation function. (my favourite is tanh for LSTM applications)

  3. Now compute the loss using a loss function like mean squared error, between output computed and the actual value.

  4. Now get ready to do backpropagation, where you need to calculate the gradient of the loss function with respect to each weight by propagating the error backward through the network.

  5. So, compute partial derivatives of the loss with respect to each weight, starting from the output layer and moving back to the input layer.

  6. Here is the fun part: update the weights using the gradients obtained from the backward pass. here people usually use adam optimizer, which allows for accelerated stochastic gradient descent. Fun trivia: Adam stands for "Adaptive Moment Estimation".

  7. Now repeat the forward and backward propagation process for numerous tries until theperformance of the model stabilizes.

9mo ago
SnoozyUnicorn
SnoozyUnicorn

I appreciate you posting these papers, but why the cutoff of 100 likes ? Is it for grapevine chat facilities? That is a pain for me as well. I can not dm people I want to speak to

JumpyTaco
JumpyTaco

@FairyMermaid Because people need to value my content. Else, I am not putting the effort in.

GigglyDonut
GigglyDonut
KPMG9mo

People might not leave a like and still appreciate the content. On the contrary they will just pass by & won’t even read it after liking. You analysis is absolutely wrong to gain the respect for content. Value those people who stopped by and read it. Time is money

TwirlyDumpling
TwirlyDumpling

True,the backprop truly changed the ai. From backprop to generalist ai we come so far.

JumpyTaco
JumpyTaco

@Welt_yang @Welt_yang Seriously.

GigglyDonut
GigglyDonut
KPMG9mo

It was a good read! Thanks for posting🥂

SqueakyKoala
SqueakyKoala

Are you aware of any latest research that analyses intelligence/consciousness from a meta/abstract formulation pov and not the i/o neuron paradigm?

SquishyBanana
SquishyBanana

The original backprop paper is just 4 pages long?!

Discover more
Curated from across