Is it expected to replace Deepfake? Revealing how awesome this year's most popular NeRF technology is-AI-php.cn

Home

Technology peripherals

Is it expected to replace Deepfake? Revealing how awesome this year's most popular NeRF technology is

PHPz

Apr 12, 2023 pm 09:37 PM

AI technologynerfdeepfakes

What, you don’t know NeRF yet?

As the hottest AI technology in the field of computer vision this year, NeRF can be said to be widely used and has a bright future.

The friends on Station B have used this technology in new ways.

Is it expected to replace Deepfake? Revealing how awesome this years most popular NeRF technology is

To attract good news

So, what exactly is NeRF?

NeRF (Neural Radiance Fields) is a concept first proposed in the best paper at the 2020 ECCV conference. It pushes implicit expression to a new level, using only 2D posed images as supervision , which can represent complex three-dimensional scenes.

One stone stirred up a thousand waves. Since then, NeRF has developed rapidly and been applied to many technical directions, such as "new viewpoint synthesis, three-dimensional reconstruction", etc.

NeRF inputs sparse multi-angle images with poses for training to obtain a neural radiation field model. According to this model, clear photos from any viewing angle can be rendered, as shown in the figure below. It can also be briefly summarized as using an MLP to implicitly learn a three-dimensional scene.

Is it expected to replace Deepfake? Revealing how awesome this years most popular NeRF technology is

#Netizens will naturally compare NeRF with the equally popular Deepfake.

A recent article published by MetaPhysics took stock of the evolutionary history, challenges and advantages of NeRF, and predicted that NeRF will eventually replace Deepfake.

Most of the eye-catching topics about deepfake technology refer to the two open source software packages that have become popular since deepfakes entered the public eye in 2017: DeepFaceLab (DFL) and FaceSwap.

Is it expected to replace Deepfake? Revealing how awesome this years most popular NeRF technology is

While both packages have extensive user bases and active developer communities, neither project deviates significantly from the GitHub code.

Of course, the developers of DFL and FaceSwap have not been idle: it is now possible to train deepfake models using larger input images, although this requires more expensive GPUs.

Is it expected to replace Deepfake? Revealing how awesome this years most popular NeRF technology is

#But in fact, in the past three years, the improvement in deepfake image quality promoted by the media is mainly due to end users.

They have accumulated "time-saving and rare" experience in data collection, as well as the best methods to train models (sometimes a single experiment can take weeks), and learned how to leverage and extend the original 2017 code. The outermost limit.

Some in the VFX and ML research communities are trying to break through the "hard limits" of the popular deepfake package by extending the architecture so that machine learning models can be trained on images up to 1024×1024.

Is it expected to replace Deepfake? Revealing how awesome this years most popular NeRF technology is

The pixels are twice the current actual range of DeepFaceLab or FaceSwap, closer to the resolutions useful in film and television production.

Let’s learn about NeRF together~

Unveiling

NeRF (Neural Radiance Fields), which appeared in 2020, is a method that passes through the neural A method of reconstructing objects and environments by splicing photos from multiple viewpoints within the network.

It achieves the best results for synthesizing complex scene views by optimizing the underlying continuous volumetric scene function using a sparse set of input views.

The algorithm also uses a fully connected deep network to represent a scene, its input is a single continuous 5D coordinate (spatial position (x, y, z) and viewing direction (θ, φ)), and its output is the Volumetric density at a spatial location and associated emission amplitude brightness.

The view is synthesized by querying 5D coordinates along the camera ray, and using classic volume rendering techniques to project the output color and density into the image.

Implementation process:

Is it expected to replace Deepfake? Revealing how awesome this years most popular NeRF technology is

First represent a continuous scene as a 5D vector value function, whose input is a 3D position and 2D viewing direction, corresponding to The output of is an emission color c and volume density σ.

In practice, the 3D Cartesian unit vector d is used to represent the direction. This continuous 5D scene representation is approximated with an MLP network and its weights are optimized.

Additionally, the representation is encouraged to be consistent across multiple views by restricting the network to predict volume density σ as a function of position x, while also allowing RGB color c to be predicted as a function of position and viewing direction.

To achieve this, the MLP first processes the input 3D coordinates x with 8 fully connected layers (using ReLU activation and 256 channels per layer), and outputs σ and 256-dimensional feature vectors.

This feature vector is then concatenated with the viewing direction of the camera ray and passed to an additional fully connected layer that outputs the view-dependent RGB color.

In addition, NeRF also introduces two improvements to achieve the representation of high-resolution complex scenes. The first is positional encoding to help MLP represent high-frequency functions, and the second is a stratified sampling process to enable it to efficiently sample high-frequency representations.

As we all know, the position encoding in the Transformer architecture can provide the discrete position of the mark in the sequence as the input of the entire architecture. NeRF uses position coding to map continuous input coordinates to a higher dimensional space, making it easier for MLP to approximate higher frequency functions.

Is it expected to replace Deepfake? Revealing how awesome this years most popular NeRF technology is

As can be observed from the figure, removing positional encoding will greatly reduce the model's ability to represent high-frequency geometry and texture, ultimately leading to an over-smooth appearance.

Since the rendering strategy of densely evaluating the neural radiation field network at N query points along each camera ray is very inefficient, NeRF finally adopted a hierarchical representation, by proportioning the expected effect of the final rendering Allocate samples to improve rendering efficiency.

In short, NeRF no longer uses only one network to represent the scene, but optimizes two networks at the same time, a "coarse-grained" network and a "fine-grained" network.

The future is promising

NeRF solves the shortcomings of the past, that is, using MLP to represent objects and scenes as continuous functions. Compared with previous methods, NeRF can produce better rendering effects.

However, NeRF also faces many technical bottlenecks. For example, NeRF's accelerator will sacrifice other relatively useful functions (such as flexibility) to achieve low latency, more interactive environments, and less training time.

So, although NeRF is a key breakthrough, it still takes a certain amount of time to achieve perfect results.

Technology is progressing, and the future is still promising!

The above is the detailed content of Is it expected to replace Deepfake? Revealing how awesome this year's most popular NeRF technology is. For more information, please follow other related articles on the PHP Chinese website!

Statement

This article is reproduced at:51CTO.COM. If there is any infringement, please contact admin@php.cn delete

Tesla's Robovan Was The Hidden Gem In 2024's Robotaxi TeaserApr 22, 2025 am 11:48 AM

Since 2008, I've championed the shared-ride van—initially dubbed the "robotjitney," later the "vansit"—as the future of urban transportation. I foresee these vehicles as the 21st century's next-generation transit solution, surpas

Sam's Club Bets On AI To Eliminate Receipt Checks And Enhance RetailApr 22, 2025 am 11:29 AM

Revolutionizing the Checkout Experience Sam's Club's innovative "Just Go" system builds on its existing AI-powered "Scan & Go" technology, allowing members to scan purchases via the Sam's Club app during their shopping trip.

Nvidia's AI Omniverse Expands At GTC 2025Apr 22, 2025 am 11:28 AM

Nvidia's Enhanced Predictability and New Product Lineup at GTC 2025 Nvidia, a key player in AI infrastructure, is focusing on increased predictability for its clients. This involves consistent product delivery, meeting performance expectations, and

Exploring the Capabilities of Google's Gemma 2 ModelsApr 22, 2025 am 11:26 AM

Google's Gemma 2: A Powerful, Efficient Language Model Google's Gemma family of language models, celebrated for efficiency and performance, has expanded with the arrival of Gemma 2. This latest release comprises two models: a 27-billion parameter ver

The Next Wave of GenAI: Perspectives with Dr. Kirk Borne - Analytics VidhyaApr 22, 2025 am 11:21 AM

This Leading with Data episode features Dr. Kirk Borne, a leading data scientist, astrophysicist, and TEDx speaker. A renowned expert in big data, AI, and machine learning, Dr. Borne offers invaluable insights into the current state and future traje

AI For Runners And Athletes: We're Making Excellent ProgressApr 22, 2025 am 11:12 AM

There were some very insightful perspectives in this speech—background information about engineering that showed us why artificial intelligence is so good at supporting people’s physical exercise. I will outline a core idea from each contributor’s perspective to demonstrate three design aspects that are an important part of our exploration of the application of artificial intelligence in sports. Edge devices and raw personal data This idea about artificial intelligence actually contains two components—one related to where we place large language models and the other is related to the differences between our human language and the language that our vital signs “express” when measured in real time. Alexander Amini knows a lot about running and tennis, but he still

Jamie Engstrom On Technology, Talent And Transformation At CaterpillarApr 22, 2025 am 11:10 AM

Caterpillar's Chief Information Officer and Senior Vice President of IT, Jamie Engstrom, leads a global team of over 2,200 IT professionals across 28 countries. With 26 years at Caterpillar, including four and a half years in her current role, Engst

New Google Photos Update Makes Any Photo Pop With Ultra HDR QualityApr 22, 2025 am 11:09 AM

Google Photos' New Ultra HDR Tool: A Quick Guide Enhance your photos with Google Photos' new Ultra HDR tool, transforming standard images into vibrant, high-dynamic-range masterpieces. Ideal for social media, this tool boosts the impact of any photo,

See all articles

Hot AI Tools

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress images for free

Clothoff.io

AI clothes remover

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Assassin's Creed Shadows: Seashell Riddle Solution

3 weeks agoByDDD

What's New in Windows 11 KB5054979 & How to Fix Update Issues

2 weeks agoByDDD

Where to find the Crane Control Keycard in Atomfall

3 weeks agoByDDD

Assassin's Creed Shadows - How To Find The Blacksmith And Unlock Weapon And Armour Customisation

1 months agoByDDD

Roblox: Dead Rails - How To Complete Every Challenge

3 weeks agoByDDD

Hot Tools

SublimeText3 English version

Recommended: Win version, supports code prompts!

mPDF

mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),

SublimeText3 Mac version

God-level code editing software (SublimeText3)

MinGW - Minimalist GNU for Windows

This project is in the process of being migrated to osdn.net/projects/mingw, you can continue to follow us there. MinGW: A native Windows port of the GNU Compiler Collection (GCC), freely distributable import libraries and header files for building native Windows applications; includes extensions to the MSVC runtime to support C99 functionality. All MinGW software can run on 64-bit Windows platforms.

Atom editor mac version download

The most popular open source editor

Hot Topics

Where is the login entrance for gmail email?

7637

CakePHP Tutorial

1391

What is the format of the account name of steam

win11 activation key permanent

nyt connections hints and answers

150