The idea is to take repeated steps in the opposite direction of the gradient (or approximate gradient) of the function at the current point, because this is the direction of steepest descent. Conversely, stepping in the direction of the gradient will lead to a local maximum of that function; the procedure is then known as gradient ascent. ---------------- Data Science and Machine Learning (Part 01): Linear RegressionData Science and Machine Learning (Part 02): Logistic ...