ACM MM 2023 | DiffBFR: Noise suppression face restoration method jointly proposed by Meitu & Chinese University of Science and Technology-AI-php.cn

ACM MM 2023 | DiffBFR: Noise suppression face restoration method jointly proposed by Meitu & Chinese University of Science and Technology

WBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWB

Sep 03, 2023 am 08:05 AM

theorybeautiful pictures

The goal of Blind Face Restoration (BFR) is to restore high-quality face images from low-quality face images. This is an important task in the field of computer vision and graphics, and is widely used in various scenarios such as surveillance image restoration, old photo restoration, and facial image super-resolution.

However, this task is very challenging. nature, because the degradation of uncertainty will damage the quality of the image and even lead to the loss of image information, such as blur, noise, downsampling and compression artifacts. Previous BFR methods usually rely on generative adversarial networks (GAN) to solve these problems by designing various face-specific priors, including generative priors, reference priors, and geometric priors. Although these methods have reached the state-of-the-art level, they still cannot fully achieve the goal of obtaining realistic textures while restoring details

In the image restoration process, the datasets of face images are usually scattered in high-dimensional space , and the characteristic dimension of the distribution takes the form of a long-tail distribution. Different from the long-tail distribution of image classification tasks, the long-tail regional features in image restoration refer to attributes that have a small impact on identity but a large impact on visual effects, such as moles, wrinkles, and tones, etc.

According to the simplicity shown in Figure 1, in order not to change the original meaning, the experimental results need to be rewritten into Chinese. We can find that the past GAN-based method has obvious problems when processing the head and tail samples of long-tail distribution at the same time. Repair the image Over-smoothing and loss of detail may occur. The method based on Diffusion Probistic Models (DPM) can better fit the long-tail distribution and retain the tail characteristics while fitting the real data distribution

ACM MM 2023 | DiffBFR: 美图&国科大联合提出的噪音抑制人脸修复方法

The content that needs to be rewritten is: GAN-based and DPM-based testing on long-tail issues

Meitu Imaging Research Institute (MT Lab) and the Chinese Academy of Sciences University researchers jointly proposed a new blind face image repair method, DiffBFR, which is based on DPM technology and successfully restored blind face images, repairing low-quality (LQ) face images to high-quality (HQ). A clear image of

ACM MM 2023 | DiffBFR: 美图&国科大联合提出的噪音抑制人脸修复方法

What needs to be rewritten is: Paper link: https://arxiv.org/abs/2305.04517

This study explores The adaptability of two generative models, Generative Adversarial Network (GAN) and Deep Partial Model (DPM), in dealing with long-tail problems. By designing an appropriate face restoration module, more accurate detailed information can be obtained, thereby reducing the over-smoothing of the face that may occur in generative methods and improving the precision and accuracy of restoration. This research paper has been accepted by ACM MM 2023

DPM-based blind face image repair method - DiffBFR

The study found that the diffusion model is good at avoiding training mode collapse and fitting It is better than the GAN method in generating long-tail distribution. Therefore, DiffBFR chooses to use the diffusion probability model to enhance the embedding of face prior information, and uses this as the basic framework to choose DPM as the solution. This is because the diffusion model has the powerful ability to produce high-quality images within any distribution range

In order to solve the long-tail distribution of features on the face dataset found in the paper and the over-smoothing problem based on GAN methods in the past, This study explores a reasonable design to better fit the approximate long-tail distribution and overcome the over-smoothing problem in the repair process. Through a simple experiment of GAN and DPM with the same parameter size on the MNIST data set (Figure 1), the study found that the DPM method can reasonably fit the long-tail distribution, while GAN pays too much attention to the head features and ignores the tail features. As a result, tail features cannot be generated. Therefore, DPM is chosen as a solution to BFR

By introducing two intermediate variables, DiffBFR proposes two specific repair modules. The design adopts a two-stage approach, first recovering identity information from LQ images, and then enhancing texture details based on the distribution of real faces. This design consists of two key parts:

(1) Identity Restoration Module (IRM):

The purpose of this module is to retain the Face details. At the same time, a truncated sampling method is proposed, which replaces the denoising method using pure Gaussian random distribution in the reverse process by adding part of the noise to the low-quality image. The paper theoretically proves that this change shrinks the theoretical evidence lower bound (ELBO) of DPM, thereby restoring more original details. Based on theoretical proofs, two cascaded conditional diffusion models with different input sizes are introduced to enhance the sampling effect and reduce the training difficulty of directly generating high-resolution images. At the same time, it is further proved that the higher the quality of the conditional input, the closer it is to the real data distribution, and the more accurate the restored image is. This is also the reason why DiffBFR first restores low-resolution images

(2) Texture Enhancement Module (TEM):

The method used to texture polish images is to introduce an unconditional diffusion model. This model is completely independent of low-quality images, further making the restored results closer to real image data. The paper theoretically proves that an unconditional diffusion model trained on purely high-quality images contributes to the correct distribution of the output image in pixel-level space. That is, after using this model, the distribution of inpainted images has a lower FID than before using it, and is overall more similar to the distribution of high-quality images. Specifically, the identity information is retained by truncating the sampling at the time step, and the pixel-level texture is polished.

The sampling inference steps of DiffBFR are shown in Figure 2, and the schematic diagram of the sampling inference process is shown in Figure 3

ACM MM 2023 | DiffBFR: 美图&国科大联合提出的噪音抑制人脸修复方法

The content that needs to be rewritten is: Figure 2 shows the sampling inference steps of the DiffBFR method

ACM MM 2023 | DiffBFR: 美图&国科大联合提出的噪音抑制人脸修复方法

The content that needs to be rewritten is: Figure 3 shows the schematic diagram of the sampling inference process of the DiffBFR method

In order not to change the original meaning, the experimental results need to be rewritten into Chinese

ACM MM 2023 | DiffBFR: 美图&国科大联合提出的噪音抑制人脸修复方法

Compare the visualization effects of the GAN-based BFR method and the DPM-based method, as shown in Figure 4

ACM MM 2023 | DiffBFR: 美图&国科大联合提出的噪音抑制人脸修复方法

For Figure 5, the performance of the SOTA method for BFR is compared

ACM MM 2023 | DiffBFR: 美图&国科大联合提出的噪音抑制人脸修复方法

The performance of the BFR method The comparison of visualization effects is shown in Figure 6

ACM MM 2023 | DiffBFR: 美图&国科大联合提出的噪音抑制人脸修复方法

In the model, we can compare the performance of IRM and TEM through visualization

ACM MM 2023 | DiffBFR: 美图&国科大联合提出的噪音抑制人脸修复方法

In the model, the performance of IRM and TEM is compared, as shown in Figure 8

ACM MM 2023 | DiffBFR: 美图&国科大联合提出的噪音抑制人脸修复方法

What needs to be rewritten is: Compare the IRM performance of Figure 9 under different parameters

ACM MM 2023 | DiffBFR: 美图&国科大联合提出的噪音抑制人脸修复方法

For Figure 10, we need to compare the different Performance of parameters

ACM MM 2023 | DiffBFR: 美图&国科大联合提出的噪音抑制人脸修复方法

The content that needs to be rewritten is: Figure 11 shows the parameter settings of each module of DiffBFR

Summary is the process of re-expressing information or ideas in a concise and clear way. It does not change the original meaning but presents the same idea by using different vocabulary and sentence structure. The purpose of a summary is to provide a clearer, more concise presentation so that readers can more easily understand and digest the information conveyed. Summarizations are useful in a variety of situations, whether in academic papers, business reports, or everyday communications, where they can be used to convey important ideas and conclusions. In short, summary is an important communication tool that can help us convey and understand information more effectively

This paper proposes a blind degraded face image restoration model DiffBFR based on the diffusion model to Solve the problems of training model collapse and long tail disappearance based on previous GAN methods. By embedding prior knowledge into the diffusion model, high-quality and clear restored images can be generated from random severely degraded face images. Specifically, this study proposes two modules, IRM and TEM, which are used to restore reality and restore details respectively. Through theoretical derivation and experimental image demonstration, the superiority of the model is proven, and qualitative and quantitative comparisons are made with existing state-of-the-art methods

The content that needs to be rewritten is: Research Team

This paper was jointly proposed by researchers from Meitu Imaging Research Institute (MT Lab) and the University of Chinese Academy of Sciences. Meitu Imaging Research Institute (MT Lab) was established in 2010. It is a team of Meitu focusing on algorithm research, engineering development and product implementation in the fields of computer vision, deep learning, augmented reality and other fields. Since its establishment, the team has been committed to exploring research in the field of computer vision, and began deploying deep learning in 2013 to provide technical support for Meitu's software and hardware products. At the same time, they also provide targeted SaaS services for multiple vertical fields in the imaging industry, and promote the ecological development of Meitu's artificial intelligence products through cutting-edge imaging technology. They have participated in top international competitions such as CVPR, ICCV, and ECCV, won more than ten championships and runner-ups, and published more than 48 top international academic conference papers. Meitu Imaging Research Institute (MT Lab) has long been committed to research and development in the imaging field, has accumulated rich technical reserves, and has rich technology implementation experience in the fields of pictures, videos, design and digital people

The above is the detailed content of ACM MM 2023 | DiffBFR: Noise suppression face restoration method jointly proposed by Meitu & Chinese University of Science and Technology. For more information, please follow other related articles on the PHP Chinese website!

Statement

This article is reproduced at:机器之心. If there is any infringement, please contact admin@php.cn delete

The AI Skills Gap Is Slowing Down Supply ChainsApr 26, 2025 am 11:13 AM

The term "AI-ready workforce" is frequently used, but what does it truly mean in the supply chain industry? According to Abe Eshkenazi, CEO of the Association for Supply Chain Management (ASCM), it signifies professionals capable of critic

How One Company Is Quietly Working To Transform AI ForeverApr 26, 2025 am 11:12 AM

The decentralized AI revolution is quietly gaining momentum. This Friday in Austin, Texas, the Bittensor Endgame Summit marks a pivotal moment, transitioning decentralized AI (DeAI) from theory to practical application. Unlike the glitzy commercial

Nvidia Releases NeMo Microservices To Streamline AI Agent DevelopmentApr 26, 2025 am 11:11 AM

Enterprise AI faces data integration challenges The application of enterprise AI faces a major challenge: building systems that can maintain accuracy and practicality by continuously learning business data. NeMo microservices solve this problem by creating what Nvidia describes as "data flywheel", allowing AI systems to remain relevant through continuous exposure to enterprise information and user interaction. This newly launched toolkit contains five key microservices: NeMo Customizer handles fine-tuning of large language models with higher training throughput. NeMo Evaluator provides simplified evaluation of AI models for custom benchmarks. NeMo Guardrails implements security controls to maintain compliance and appropriateness

AI Paints A New Picture For The Future Of Art And DesignApr 26, 2025 am 11:10 AM

AI: The Future of Art and Design Artificial intelligence (AI) is changing the field of art and design in unprecedented ways, and its impact is no longer limited to amateurs, but more profoundly affecting professionals. Artwork and design schemes generated by AI are rapidly replacing traditional material images and designers in many transactional design activities such as advertising, social media image generation and web design. However, professional artists and designers also find the practical value of AI. They use AI as an auxiliary tool to explore new aesthetic possibilities, blend different styles, and create novel visual effects. AI helps artists and designers automate repetitive tasks, propose different design elements and provide creative input. AI supports style transfer, which is to apply a style of image

How Zoom Is Revolutionizing Work With Agentic AI: From Meetings To MilestonesApr 26, 2025 am 11:09 AM

Zoom, initially known for its video conferencing platform, is leading a workplace revolution with its innovative use of agentic AI. A recent conversation with Zoom's CTO, XD Huang, revealed the company's ambitious vision. Defining Agentic AI Huang d

The Existential Threat To UniversitiesApr 26, 2025 am 11:08 AM

Will AI revolutionize education? This question is prompting serious reflection among educators and stakeholders. The integration of AI into education presents both opportunities and challenges. As Matthew Lynch of The Tech Edvocate notes, universit

The Prototype: American Scientists Are Looking For Jobs AbroadApr 26, 2025 am 11:07 AM

The development of scientific research and technology in the United States may face challenges, perhaps due to budget cuts. According to Nature, the number of American scientists applying for overseas jobs increased by 32% from January to March 2025 compared with the same period in 2024. A previous poll showed that 75% of the researchers surveyed were considering searching for jobs in Europe and Canada. Hundreds of NIH and NSF grants have been terminated in the past few months, with NIH’s new grants down by about $2.3 billion this year, a drop of nearly one-third. The leaked budget proposal shows that the Trump administration is considering sharply cutting budgets for scientific institutions, with a possible reduction of up to 50%. The turmoil in the field of basic research has also affected one of the major advantages of the United States: attracting overseas talents. 35

All About Open AI's Latest GPT 4.1 Family - Analytics VidhyaApr 26, 2025 am 10:19 AM

OpenAI unveils the powerful GPT-4.1 series: a family of three advanced language models designed for real-world applications. This significant leap forward offers faster response times, enhanced comprehension, and drastically reduced costs compared t

See all articles

Hot AI Tools

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress images for free

Clothoff.io

AI clothes remover

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Assassin's Creed Shadows: Seashell Riddle Solution

4 weeks agoByDDD

What's New in Windows 11 KB5054979 & How to Fix Update Issues

3 weeks agoByDDD

Where to find the Crane Control Keycard in Atomfall

4 weeks agoByDDD

Roblox: Dead Rails - How To Complete Every Challenge

1 months agoByDDD

How to fix KB5055523 fails to install in Windows 11?

2 weeks agoByDDD

Hot Tools

WebStorm Mac version

Useful JavaScript development tools

mPDF

mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),

EditPlus Chinese cracked version

Small size, syntax highlighting, does not support code prompt function

DVWA

Damn Vulnerable Web App (DVWA) is a PHP/MySQL web application that is very vulnerable. Its main goals are to be an aid for security professionals to test their skills and tools in a legal environment, to help web developers better understand the process of securing web applications, and to help teachers/students teach/learn in a classroom environment Web application security. The goal of DVWA is to practice some of the most common web vulnerabilities through a simple and straightforward interface, with varying degrees of difficulty. Please note that this software

SublimeText3 English version

Recommended: Win version, supports code prompts!

Hot Topics

Where is the login entrance for gmail email?

7733

1643

1397

1290

1233