The gradient for a particular node is the value of the derivative times the difference between the target output value and the computed output value. But if you assume you want to minimize mean cross ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results