img

This Research Paper allowed me to 10x my salary

This paper was discussed during my interview at Goldman. This completely changed my life and I used it in my first project as well.

This research paper changed my fucking life.

https://arxiv.org/pdf/1506.03134

img
img

Soufle

Salesforce

4 months ago

img

jake_peralta_B99

Unemployed

4 months ago

img

salt

Gojek

4 months ago

img

salt

Gojek

4 months ago

img

D0nkey05

Stealth

4 months ago

See more comments
img

steppenwolf

Stealth

4 months ago

img

AjaxNinja

Consultant

4 months ago

img

Tyty

Tredence

4 months ago

img

Sane

KPMG

4 months ago

Sign in to a Grapevine account for the full experience.

Discover More

Curated from across

img

Data Scientists on

by Gooner7

Goldman Sachs

This Research Paper changed my life forever.

It was one of the papers that was discussed in my interview at Goldman. I came to know about this research paper a few years back after consulting a friend doing an ML PhD at University of Maryland, College Park. The explanation of the paper: 1. Initialize the neural network with small random values typically (-0.1,0.1) to avoid symmetry issues. 2. Now get ready to do Forward propagation: you pass thetraining data through the multilayer perceptron and compute the output. For each neuron in the MLP, calculate the weighted sum of its inputs and apply the activation function. (my favourite is tanh for LSTM applications) 3. Now compute the loss using a loss function like mean squared error, between output computed and the actual value. 4. Now get ready to do backpropagation, where you need to calculate the gradient of the loss function with respect to each weight by propagating the error backward through the network. 5. So, compute partial derivatives of the loss with respect to each weight, starting from the output layer and moving back to the input layer. 6. Here is the fun part: update the weights using the gradients obtained from the backward pass. here people usually use adam optimizer, which allows for accelerated stochastic gradient descent. Fun trivia: Adam stands for "Adaptive Moment Estimation". 7. Now repeat the forward and backward propagation process for numerous tries until theperformance of the model stabilizes.

https://www.iro.umontreal.ca/~vincentp/ift3395/lectures/backprop_old.pdf

img