Zhejiang University proposes new SOTA technology SIFU: only one picture can reconstruct high-quality 3D human body model-AI-php.cn

Zhejiang University proposes new SOTA technology SIFU: only one picture can reconstruct high-quality 3D human body model

WBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWB

Jan 18, 2024 pm 02:15 PM

method3dsota

In many fields such as AR, VR, 3D printing, scene construction, and film production, high-quality 3D models of the human body wearing clothes are very important.

Traditional methods to create models require a lot of time and can only be completed by professional equipment and technical personnel.

Zhejiang University proposes new SOTA technology SIFU: only one picture can reconstruct high-quality 3D human body model

On the contrary, in daily life, we usually use mobile phone cameras or Portrait photos found on the web.

Therefore, a method that can accurately reconstruct a 3D human model from a single image can significantly reduce costs and simplify the independent creation process.

Zhejiang University proposes new SOTA technology SIFU: only one picture can reconstruct high-quality 3D human body model Comparison of the technical route of previous methods (left) and this method (right)

Previous depth Learning models for 3D human body reconstruction often require three steps: extracting 2D features from images, transferring 2D features to 3D space, and using 3D features for human body reconstruction.

However, these methods often ignore the introduction of human body priors in the stage of converting 2D features into 3D space, resulting in insufficient feature extraction and various defects in the final reconstruction results. .

Zhejiang University proposes new SOTA technology SIFU: only one picture can reconstruct high-quality 3D human body model Comparison of the reconstruction effect of SIFU and other SOTA models

In addition, in the stage of texture prediction, In the past, models only relied on the knowledge learned in the training set and lacked prior knowledge of the real world, which often resulted in poor texture prediction in invisible areas.

Zhejiang University proposes new SOTA technology SIFU: only one picture can reconstruct high-quality 3D human body model

SIFU introduces prior knowledge in the texture prediction stage to enhance the texture effect of invisible areas (back, etc.).

In this regard, researchers from Zhejiang University's ReLER Laboratory proposed the SIFU model, which relies on side view conditional implicit functions to reconstruct a 3D human body model from a single image.

Zhejiang University proposes new SOTA technology SIFU: only one picture can reconstruct high-quality 3D human body model Picture

Paper address: https://arxiv.org/abs/2312.06704

Project address : https://github.com/River-Zhang/SIFU

This model enhances the geometric reconstruction effect by introducing the side view of the human body as a priori condition by converting 2D features into 3D space. And a pre-trained diffusion model is introduced in the texture optimization stage to solve the problem of poor texture in invisible areas.

Model structure

The model pipeline is as follows:

Zhejiang University proposes new SOTA technology SIFU: only one picture can reconstruct high-quality 3D human body model Pictures

The model operation can be divided into two stages. The first stage uses the side implicit function to reconstruct the geometry (mesh) and rough texture (coarse texture) of the human body. The second stage uses the pre-trained Diffusion models refine textures.

In the first stage, the author designed a unique Side-view Decoupling Transformer. After extracting 2D features through the global encoder, the human body prior model SMPL- was introduced in the decoder. The side view of

This method successfully combines prior knowledge of the human body when converting 2D features into 3D space, resulting in a better reconstruction effect of the model.

In the second stage, the author proposes a 3D Consistent Texture Refinement process. First, the invisible areas of the human body (sides and backs) can be differentiated into A collection of pictures with continuous viewing angles, and then with the help of a diffusion model that learns prior knowledge from massive data, the rough texture pictures can be edited consistently to obtain more refined results. Finally, the texture map of the 3D model is optimized by calculating the loss from the images before and after refinement.

Experimental part

Higher reconstruction accuracy

In the experimental part, the author uses comprehensive Their models were tested on diverse test sets, including CAPE-NFP, CAPE-FP and THuman2.0, and compared with previous single-image human reconstruction SOTA models published at major conferences. After quantitative testing, the SIFU model showed the best results in both geometric reconstruction and texture reconstruction.

Zhejiang University proposes new SOTA technology SIFU: only one picture can reconstruct high-quality 3D human body model Quantitative evaluation of geometric reconstruction accuracy

Zhejiang University proposes new SOTA technology SIFU: only one picture can reconstruct high-quality 3D human body model Quantitative evaluation of texture reconstruction effect

Zhejiang University proposes new SOTA technology SIFU: only one picture can reconstruct high-quality 3D human body model Use public pictures on the Internet as input to demonstrate qualitative effects

Stronger robustness

Previous When the model is applied to data other than the training set, because the estimated human body prior model SMPL/SMPL-X is not accurate enough, the reconstruction results are often far different from the input image, making it difficult to put it into practical application.

In this regard, the author specifically tested the robustness of the model. By adding perturbations to the ground truth prior model parameters, the pose was shifted to simulate the real scene. SMPL-X estimates inaccurate situations to evaluate the accuracy of model reconstruction. The results show that the SIFU model still has the best reconstruction accuracy in this case.

Zhejiang University proposes new SOTA technology SIFU: only one picture can reconstruct high-quality 3D human body model Evaluate the robustness of the model when facing a human body prior model with errors

Zhejiang University proposes new SOTA technology SIFU: only one picture can reconstruct high-quality 3D human body model Using real-world pictures, SIFU still has a better reconstruction effect when the prior human body model estimation is inaccurate

Broader Application scenarios

The high-precision and high-quality reconstruction effect of the SIFU model makes it suitable for a variety of application scenarios, including 3D printing, scene construction, texture editing, etc.

Zhejiang University proposes new SOTA technology SIFU: only one picture can reconstruct high-quality 3D human body model 3D printed SIFU reconstructed human body model

Zhejiang University proposes new SOTA technology SIFU: only one picture can reconstruct high-quality 3D human body model

Zhejiang University proposes new SOTA technology SIFU: only one picture can reconstruct high-quality 3D human body model ##SIFU is used for 3D scene construction

Zhejiang University proposes new SOTA technology SIFU: only one picture can reconstruct high-quality 3D human body model

##With the help of public action sequence data, the model reconstructed by SIFU can be driven Zhejiang University proposes new SOTA technology SIFU: only one picture can reconstruct high-quality 3D human body model Summary

This article proposes a side view conditional implicit function and a 3D consistent texture editing method to make up for the It overcomes the shortcomings of prior knowledge introduced in previous work when converting 2D features to 3D space and texture prediction, greatly improving the accuracy and effect of human body reconstruction in a single picture, giving the model significant advantages in real-world applications, and also It provides new ideas for future research in this field.

Reference:

https://arxiv.org/abs/2312.06704

The above is the detailed content of Zhejiang University proposes new SOTA technology SIFU: only one picture can reconstruct high-quality 3D human body model. For more information, please follow other related articles on the PHP Chinese website!

Statement

This article is reproduced at:51CTO.COM. If there is any infringement, please contact admin@php.cn delete

Meta's New AI Assistant: Productivity Booster Or Time Sink?May 01, 2025 am 11:18 AM

Meta has joined hands with partners such as Nvidia, IBM and Dell to expand the enterprise-level deployment integration of Llama Stack. In terms of security, Meta has launched new tools such as Llama Guard 4, LlamaFirewall and CyberSecEval 4, and launched the Llama Defenders program to enhance AI security. In addition, Meta has distributed $1.5 million in Llama Impact Grants to 10 global institutions, including startups working to improve public services, health care and education. The new Meta AI application powered by Llama 4, conceived as Meta AI

80% Of Gen Zers Would Marry An AI: StudyMay 01, 2025 am 11:17 AM

Joi AI, a company pioneering human-AI interaction, has introduced the term "AI-lationships" to describe these evolving relationships. Jaime Bronstein, a relationship therapist at Joi AI, clarifies that these aren't meant to replace human c

AI Is Making The Internet's Bot Problem Worse. This $2 Billion Startup Is On The Front LinesMay 01, 2025 am 11:16 AM

Online fraud and bot attacks pose a significant challenge for businesses. Retailers fight bots hoarding products, banks battle account takeovers, and social media platforms struggle with impersonators. The rise of AI exacerbates this problem, rende

Selling To Robots: The Marketing Revolution That Will Make Or Break Your BusinessMay 01, 2025 am 11:15 AM

AI agents are poised to revolutionize marketing, potentially surpassing the impact of previous technological shifts. These agents, representing a significant advancement in generative AI, not only process information like ChatGPT but also take actio

How Computer Vision Technology Is Transforming NBA Playoff OfficiatingMay 01, 2025 am 11:14 AM

AI's Impact on Crucial NBA Game 4 Decisions Two pivotal Game 4 NBA matchups showcased the game-changing role of AI in officiating. In the first, Denver's Nikola Jokic's missed three-pointer led to a last-second alley-oop by Aaron Gordon. Sony's Haw

How AI Is Accelerating The Future Of Regenerative MedicineMay 01, 2025 am 11:13 AM

Traditionally, expanding regenerative medicine expertise globally demanded extensive travel, hands-on training, and years of mentorship. Now, AI is transforming this landscape, overcoming geographical limitations and accelerating progress through en

Key Takeaways From Intel Foundry Direct Connect 2025May 01, 2025 am 11:12 AM

Intel is working to return its manufacturing process to the leading position, while trying to attract fab semiconductor customers to make chips at its fabs. To this end, Intel must build more trust in the industry, not only to prove the competitiveness of its processes, but also to demonstrate that partners can manufacture chips in a familiar and mature workflow, consistent and highly reliable manner. Everything I hear today makes me believe Intel is moving towards this goal. The keynote speech of the new CEO Tan Libo kicked off the day. Tan Libai is straightforward and concise. He outlines several challenges in Intel’s foundry services and the measures companies have taken to address these challenges and plan a successful route for Intel’s foundry services in the future. Tan Libai talked about the process of Intel's OEM service being implemented to make customers more

AI Gone Wrong? Now There's Insurance For ThatMay 01, 2025 am 11:11 AM

Addressing the growing concerns surrounding AI risks, Chaucer Group, a global specialty reinsurance firm, and Armilla AI have joined forces to introduce a novel third-party liability (TPL) insurance product. This policy safeguards businesses against

See all articles