GitHub open source 130+Stars: teach you step by step to reproduce the target detection algorithm based on the PPYOLO series-AI-php.cn

GitHub open source 130+Stars: teach you step by step to reproduce the target detection algorithm based on the PPYOLO series

WBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWB

Apr 09, 2023 pm 06:41 PM

githubalgorithmOpen source

Object detection is a basic task in the field of computer vision. How can we do it without a suitable Model Zoo?

Today I will give you a simple and easy-to-use target detection algorithm model library miemiedetection. It has currently gained 130 stars on GitHub

GitHub open source 130+Stars: teach you step by step to reproduce the target detection algorithm based on the PPYOLO series

Code link: https ://github.com/miemie2013/miemiedetection

miemiedetection is a personal detection library developed based on YOLOX. It also supports PPYOLO, PPYOLOv2, PPYOLOE, FCOS and other algorithms.

Thanks to the excellent architecture of YOLOX, the algorithm training speed in miemiedetection is very fast, and data reading is no longer the bottleneck of training speed.

The deep learning framework used in code development is pyTorch, which implements deformable convolution DCNv2, Matrix NMS and other difficult operators, and supports single-machine single-card, single-machine multi-card, and multi-machine multi-card training modes. (Linux system is recommended for multi-card training mode), supports Windows and Linux systems.

And because miemiedetection is a detection library that does not require installation, users can directly change its code to change the execution logic, so it is also easy to add new algorithms to the library.

The author stated that more algorithm support (and women’s clothing) will be added in the future.

GitHub open source 130+Stars: teach you step by step to reproduce the target detection algorithm based on the PPYOLO series

The algorithm is guaranteed to be genuine

The most important thing to reproduce the model is that the accuracy rate should be basically the same as the original one.

Let’s first look at the three models of PPYOLO, PPYOLOv2, and PPYOLOE. The author has all undergone experiments on loss alignment and gradient alignment.

GitHub open source 130+Stars: teach you step by step to reproduce the target detection algorithm based on the PPYOLO series

In order to preserve evidence, you can also see the commented out reading and writing *.npz parts in the source code, which are all left over from the alignment experiment. code.

And the author also recorded the process of performance alignment in detail. For novices, following this path is also a good learning process!

GitHub open source 130+Stars: teach you step by step to reproduce the target detection algorithm based on the PPYOLO series

All training logs are also recorded and stored in the warehouse, which is enough to prove the correctness of reproducing the PPYOLO series algorithms!

GitHub open source 130+Stars: teach you step by step to reproduce the target detection algorithm based on the PPYOLO series

The final training results show that the reproduced PPYOLO algorithm has the same loss and gradient as the original warehouse.

In addition, the author also tried to use the original warehouse and miemiedetection transfer learning voc2012 data set, and also obtained the same accuracy (using the same hyperparameters).

The same as the original implementation, using the same learning rate, the same learning rate decay strategy warm_piecewisedecay (used by PPYOLO and PPYOLOv2) and warm_cosinedecay (used by PPYOLOE), and the same exponential moving average EMA , the same data preprocessing method, the same parameter L2 weight attenuation, the same loss, the same gradient, the same pre-training model, transfer learning has obtained the same accuracy.

We have done enough experiments and done a lot of testing to ensure that everyone has a wonderful experience!

No 998 or 98, just click star and take home all the target detection algorithms for free!

Model download and conversion

If you want to run through the model, the parameters are very important. The author provides the converted pre-training pth weight file, which can be downloaded directly through Baidu Netdisk .

GitHub open source 130+Stars: teach you step by step to reproduce the target detection algorithm based on the PPYOLO series

Link: https://pan.baidu.com/s/1ehEqnNYKb9Nz0XNeqAcwDw

Extraction code: qe3i

Or follow the steps below to obtain:

The first step is to download the weight file and execute it in the project root directory (that is, download the file. Windows users can use Thunder or browser to download wget. Link, here to show the beauty, only ppyoloe_crn_l_300e_coco is used as an example):

GitHub open source 130+Stars: teach you step by step to reproduce the target detection algorithm based on the PPYOLO series

## Note that the model with the word pretrained is pre-trained on ImageNet Backbone network, PPYOLO, PPYOLOv2, PPYOLOE load these weights to train the COCO data set. The rest are pre-trained models on COCO.

The second step, convert the weight, execute in the project root directory:

GitHub open source 130+Stars: teach you step by step to reproduce the target detection algorithm based on the PPYOLO series

The meaning of each parameter is:

- -f represents the configuration file used;

- -c represents the read source weight file;

- -oc represents the output (saved) pytorch weight file;

- -nc represents the number of categories in the data set;

- --only_backbone means only converting the weight of the backbone network when it is True ;

After execution, the converted *.pth weight file will be obtained in the project root directory.

Step-by-step tutorial

Most of the following commands will use the model's configuration file, so it is necessary to explain the configuration file in detail at the beginning.

mmdet.exp.base_exp.BaseExp is the configuration file base class. It is an abstract class that declares a bunch of abstract methods, such as get_model() indicating how to obtain the model, and get_data_loader() indicating how to obtain the model. How to obtain the trained dataloader, get_optimizer() indicates how to obtain the optimizer, etc.

mmdet.exp.datasets.coco_base.COCOBaseExp is the configuration of the data set and inherits BaseExp. It only gives the configuration of the data set. This warehouse only supports training of data sets in COCO annotation format!

Datasets in other annotation formats need to be converted into COCO annotation format before training (if too many annotation formats are supported, the workload will be too large). Customized data sets can be converted into COCO label format through miemieLabels. All detection algorithm configuration classes will inherit COCOBaseExp, which means that all detection algorithms share the same data set configuration.

The configuration items of COCOBaseExp are:

GitHub open source 130+Stars: teach you step by step to reproduce the target detection algorithm based on the PPYOLO series

Among them,

- self.num_classes represents the number of categories in the data set;

- self.data_dir represents the root directory of the data set;

- self.cls_names represents the category name file path of the data set. It is a txt file, and one line represents a category name. If it is a custom data set, you need to create a new txt file and edit the category name, and then modify self.cls_names to point to it;

- self.ann_folder represents the annotation file of the data set The root directory needs to be located in the self.data_dir directory;

- self.train_ann represents the annotation file name of the training set of the data set and needs to be located in the self.ann_folder directory;

- self.val_ann represents the annotation file name of the verification set of the data set, which needs to be located in the self.ann_folder directory;

- self. train_image_folder represents the image folder name of the training set of the data set, which needs to be located in the self.data_dir directory;

- self.val_image_folder represents the image file of the verification set of the data set The folder name needs to be located in the self.data_dir directory;

For the VOC 2012 data set, you need to modify the configuration of the data set to:

GitHub open source 130+Stars: teach you step by step to reproduce the target detection algorithm based on the PPYOLO series

In addition, you can also modify the configuration of self.num_classes and self.data_dir in the subclass as in exps/ppyoloe/ppyoloe_crn_l_voc2012.py, so that the configuration of COCOBaseExp will be overwritten. It's gone (invalid).

After downloading the previously mentioned model, create a new folder annotations2 in the self.data_dir directory of the VOC2012 data set, and put voc2012_train.json and voc2012_val.json into this file folder.

Finally, the placement locations of the COCO data set, VOC2012 data set, and this project should be like this:

GitHub open source 130+Stars: teach you step by step to reproduce the target detection algorithm based on the PPYOLO series

The data set root directory and miemiedetection-master are the same level directory. I personally do not recommend putting the data set in miemiedetection-master, otherwise PyCharm will be extremely laggy when opening it; moreover, when multiple projects (such as mmdetection, PaddleDetection, AdelaiDet) share data sets, you can set the data set path and project The name doesn't matter.

mmdet.exp.ppyolo.ppyolo_method_base.PPYOLO_Method_Exp is a class that implements all abstract methods of specific algorithms. It inherits COCOBaseExp, which implements all abstract methods.

exp.ppyolo.ppyolo_r50vd_2x.Exp is the final configuration class of the Resnet50Vd model of the PPYOLO algorithm, which inherits PPYOLO_Method_Exp;

#PPYOLOE configuration file It is also a similar structure.

Prediction

First, if the input data is a picture, execute it in the project root directory:

GitHub open source 130+Stars: teach you step by step to reproduce the target detection algorithm based on the PPYOLO series

The meaning of each parameter is:

- -f represents the configuration file used;

- -c represents is the weight file read;

- --path represents the path of the image;

- --conf represents the score Threshold, only prediction boxes higher than this threshold will be drawn;

- --tsize represents the resolution of resize the image to --tsize during prediction;

After the prediction is completed, the console will print the saving path of the result image, which the user can open and view. If you are using a model saved in a training custom data set for prediction, just modify -c to the path of your model.

If the prediction is for all pictures in a folder, execute it in the project root directory:

GitHub open source 130+Stars: teach you step by step to reproduce the target detection algorithm based on the PPYOLO series

Change --path to the path of the corresponding image folder.

Training COCO2017 data set

If you read the ImageNet pre-training backbone network training COCO data set, execute it in the project root directory:

GitHub open source 130+Stars: teach you step by step to reproduce the target detection algorithm based on the PPYOLO series

One command directly starts the single-machine eight-card training. Of course, the premise is that you really have a single-machine eight-card supercomputer.

The meaning of each parameter is:

-f represents the configuration file used;

-d represents the number of graphics cards;

-b represents the batch size during training (for all cards);

-eb represents the batch size during evaluation (for all cards);

-c represents the read weight file;

--fp16, automatic mixed precision training;

--num_machines, the number of machines, it is recommended to train with multiple cards on a single machine;

-- resume indicates whether to resume training;

Train custom data set

It is recommended to read COCO pre-training weights for training because the convergence is fast.

Take the above VOC2012 data set as an example. For the ppyolo_r50vd model, if it is 1 machine and 1 card, enter the following command to start training:

GitHub open source 130+Stars: teach you step by step to reproduce the target detection algorithm based on the PPYOLO series

If training is interrupted for some reason and you want to read the previously saved model to resume training, just modify -c to the path to the model you want to read, and add the --resume parameter. Can.

If it is 2 machines and 2 cards, that is, 1 card on each machine, enter the following command on machine 0:

GitHub open source 130+Stars: teach you step by step to reproduce the target detection algorithm based on the PPYOLO series

And enter the following command on machine 1:

GitHub open source 130+Stars: teach you step by step to reproduce the target detection algorithm based on the PPYOLO series

You only need to change 192.168.0.107 in the above two commands to 0 The LAN IP of the machine is enough.

If it is 1 machine and 2 cards, enter the following command to start training:

GitHub open source 130+Stars: teach you step by step to reproduce the target detection algorithm based on the PPYOLO series

Transfer learning VOC2012 data set, the measured AP (0.50:0.95) of ppyolo_r50vd_2x can reach 0.59, AP (0.50) can reach 0.82, and AP (small) can reach 0.18. Regardless of whether it is a single card or multiple cards, this result can be obtained.

During transfer learning, it has the same accuracy and convergence speed as PaddleDetection. The training logs of both are located in the train_ppyolo_in_voc2012 folder.

If it is the ppyoloe_l model, enter the following command on a single machine to start training (the backbone network is frozen)

GitHub open source 130+Stars: teach you step by step to reproduce the target detection algorithm based on the PPYOLO series

Transfer learning VOC2012 data set, the measured AP (0.50:0.95) of ppyoloe_l can reach 0.66, AP (0.50) can reach 0.85, and AP (small) can reach 0.28.

Evaluation

The commands and specific parameters are as follows.

GitHub open source 130+Stars: teach you step by step to reproduce the target detection algorithm based on the PPYOLO series

The result of running in the project root directory is:

GitHub open source 130+Stars: teach you step by step to reproduce the target detection algorithm based on the PPYOLO series

The accuracy after converting the weights is A little loss, about 0.4%.

The above is the detailed content of GitHub open source 130+Stars: teach you step by step to reproduce the target detection algorithm based on the PPYOLO series. For more information, please follow other related articles on the PHP Chinese website!

Statement

This article is reproduced at:51CTO.COM. If there is any infringement, please contact admin@php.cn delete

Are You At Risk Of AI Agency Decay? Take The Test To Find OutApr 21, 2025 am 11:31 AM

This article explores the growing concern of "AI agency decay"—the gradual decline in our ability to think and decide independently. This is especially crucial for business leaders navigating the increasingly automated world while retainin

How to Build an AI Agent from Scratch? - Analytics VidhyaApr 21, 2025 am 11:30 AM

Ever wondered how AI agents like Siri and Alexa work? These intelligent systems are becoming more important in our daily lives. This article introduces the ReAct pattern, a method that enhances AI agents by combining reasoning an

Revisiting The Humanities In The Age Of AIApr 21, 2025 am 11:28 AM

"I think AI tools are changing the learning opportunities for college students. We believe in developing students in core courses, but more and more people also want to get a perspective of computational and statistical thinking," said University of Chicago President Paul Alivisatos in an interview with Deloitte Nitin Mittal at the Davos Forum in January. He believes that people will have to become creators and co-creators of AI, which means that learning and other aspects need to adapt to some major changes. Digital intelligence and critical thinking Professor Alexa Joubin of George Washington University described artificial intelligence as a “heuristic tool” in the humanities and explores how it changes

Understanding LangChain Agent FrameworkApr 21, 2025 am 11:25 AM

LangChain is a powerful toolkit for building sophisticated AI applications. Its agent architecture is particularly noteworthy, allowing developers to create intelligent systems capable of independent reasoning, decision-making, and action. This expl

What are the Radial Basis Functions Neural Networks?Apr 21, 2025 am 11:13 AM

Radial Basis Function Neural Networks (RBFNNs): A Comprehensive Guide Radial Basis Function Neural Networks (RBFNNs) are a powerful type of neural network architecture that leverages radial basis functions for activation. Their unique structure make

The Meshing Of Minds And Machines Has ArrivedApr 21, 2025 am 11:11 AM

Brain-computer interfaces (BCIs) directly link the brain to external devices, translating brain impulses into actions without physical movement. This technology utilizes implanted sensors to capture brain signals, converting them into digital comman

Insights on spaCy, Prodigy and Generative AI from Ines MontaniApr 21, 2025 am 11:01 AM

This "Leading with Data" episode features Ines Montani, co-founder and CEO of Explosion AI, and co-developer of spaCy and Prodigy. Ines offers expert insights into the evolution of these tools, Explosion's unique business model, and the tr

A Guide to Building Agentic RAG Systems with LangGraphApr 21, 2025 am 11:00 AM

This article explores Retrieval Augmented Generation (RAG) systems and how AI agents can enhance their capabilities. Traditional RAG systems, while useful for leveraging custom enterprise data, suffer from limitations such as a lack of real-time dat

See all articles

Hot AI Tools

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress images for free

Clothoff.io

AI clothes remover

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Assassin's Creed Shadows: Seashell Riddle Solution

3 weeks agoByDDD

What's New in Windows 11 KB5054979 & How to Fix Update Issues

2 weeks agoByDDD

Where to find the Crane Control Keycard in Atomfall

3 weeks agoByDDD

Assassin's Creed Shadows - How To Find The Blacksmith And Unlock Weapon And Armour Customisation

1 months agoByDDD

Roblox: Dead Rails - How To Complete Every Challenge

3 weeks agoByDDD

Hot Tools

MinGW - Minimalist GNU for Windows

This project is in the process of being migrated to osdn.net/projects/mingw, you can continue to follow us there. MinGW: A native Windows port of the GNU Compiler Collection (GCC), freely distributable import libraries and header files for building native Windows applications; includes extensions to the MSVC runtime to support C99 functionality. All MinGW software can run on 64-bit Windows platforms.

SublimeText3 English version

Recommended: Win version, supports code prompts!

SublimeText3 Chinese version

Chinese version, very easy to use

VSCode Windows 64-bit Download

A free and powerful IDE editor launched by Microsoft

DVWA

Damn Vulnerable Web App (DVWA) is a PHP/MySQL web application that is very vulnerable. Its main goals are to be an aid for security professionals to test their skills and tools in a legal environment, to help web developers better understand the process of securing web applications, and to help teachers/students teach/learn in a classroom environment Web application security. The goal of DVWA is to practice some of the most common web vulnerabilities through a simple and straightforward interface, with varying degrees of difficulty. Please note that this software

Hot Topics

Where is the login entrance for gmail email?

7608

CakePHP Tutorial

1387

What is the format of the account name of steam

win11 activation key permanent

nyt connections hints and answers

135