Using TensorFlow to create a logistic regression model training results are nan

Question

In TensorFlow, I want to create a logistic regression model with the following cost function: The screenshot of the data set used is as follows: My code is as follows: {code...} The screenshot of the running result is as follows: As you can see, after iterating twice, we get Both W and b have become nan, what is the problem?

大家讲道理 · Answer

After some searching, I found the problem.

In the sentence about selecting the iteration method:

optimizer = tf.train.GradientDescentOptimizer(0.1)

The learning rate of 0.1 here is too large, which leads to the situation log(0) appearing in the loss function for unknown reasons. As a result, the value of the loss function is nan. The solution is to reduce the learning rate. For example, if you lower it to 1e-5 or 1e-6, you can train normally. I adjusted the learning rate to 1e-3 according to my own situation, and the program runs perfectly.

Attached is the final fitting result:

Using TensorFlow to create a logistic regression model training results are nan

reply all(1)I'll reply