Home >Technology peripherals >AI >I used ChatGPT to write a neural network: I didn't change a word, and the result turned out to be very useful.
Since its launch at the end of last year, the conversational AI model ChatGPT has become popular throughout the community.
ChatGPT is indeed an amazing tool, like a "Pandora's Box". Once you find the right way to open it, you may find that you can't live without it anymore.
As an all-rounder, people have various requests for ChatGPT. Some people use it to write papers, and some people use it to chat. These are common ways to play. Open your mind a little more. Since ChatGPT is the "king" of AI, will it write an AI?
Recently, a blogger in the field of machine learning had a sudden idea. He decided to let ChatGPT build a neural network and use Keras to solve the MNIST problem.
MNIST is a classic problem in the field of machine learning. Even though this handwritten digit data set has gradually faded out of the field of computer vision, its commemorative significance cannot be ignored. The first data set for many people to get started with deep learning and the first experiment to verify algorithms are MNIST.
Without saying much, I started working as soon as I thought of it. The author simply asked ChatGPT to write a piece of code that looked perfect without even having to change anything.
This is a great start.
ChatGPT chose categorical cross-entropy as the loss function. The author proposed to rewrite the categorical cross-entropy code, and ChatGPT replaced it with sparse categorical cross-entropy. The updated code can still work. It can be found that the new loss function gives up the classification conversion of labels.
The code generated by ChatGPT uses fully connected layers, but the author wants to use convolutional layers. Therefore, ChatGPT modified the code as instructed and added a Conv2D and a MaxPooling layer.
Then it came to the stage of evaluating the final model, where the author had ChatGPT write the code using the test set.
The results look great, and every ChatGPT reply comes with a full explanation. For example, in this evaluation, it is explained like this:
The "Evaluate" method takes the test data and test labels as parameters and returns the values specified during model compilation (here, accuracy) Loss and Measurement. The code snippet above outputs the accuracy of the test, which should give you an idea of how well the model generalizes to new data.
The model used the entire test set as validation data during training. The authors then asked ChatGPT to change this to 20% of the training data.
Additionally, the author wanted smaller batches and ran the code for fewer epochs. Although ChatGPT is not necessary for this step, he does not want to change the code without updating the ChatGPT context.
So this task still falls on ChatGPT:
Next, the author intends to plot the training and test loss during the training process. ChatGPT's suggestion is that the line fitting the model needs to be changed to capture its resulting value.
To show some examples from the dataset, the author asked ChatGPT to write code to output a combination of images and tags . The code for these outputs is also perfect and comes with a collection of 20 images.
When building a model, it is also necessary to look at its structure. If you ask ChatGPT this question, the reply is:
## ChatGPT gives a summary of the model:
A summary of the model is useful, but the author would have liked to see a diagram showing the structure of the model. So continue to ask:
##The result is satisfactory. Finally, we are ready to deploy this model and let ChatGPT Save the model to disk:Now, the author wants to create a class that uses the saved model for prediction. This prompt is interesting and the solution is perfect.
Now write an example that uses the class of the predictor to predict the labels of 10 random images:
In order to accomplish this, the author asked ChatGPT to display a confusion matrix:
Should be said or not, The style used by ChatGPT is really nice.
After completing all the experiments, the author published all the codes generated by ChatGPT. You can also try it:
Address: https://colab.research.google.com/drive/1JX1AVIfGtIlnLGqgHrK6WPylPhZvu9qe?usp=sharing
The above is the detailed content of I used ChatGPT to write a neural network: I didn't change a word, and the result turned out to be very useful.. For more information, please follow other related articles on the PHP Chinese website!