Technology peripherals

'It's hard to distinguish between true and false'! Clever use of autonomous driving simulation data generated by NeRF

'It's hard to distinguish between true and false'! Clever use of autonomous driving simulation data generated by NeRF

Apr 01, 2024 am 11:31 AM

dataAutopilotemulator

Written before & the author’s personal understanding

Neural Radiation Field (NeRF) has become a tool to advance the prelude to the re-search of autonomous driving (AD), providing scalable closed-loop simulation and data enhancement capabilities. However, in order to trust the results obtained in the simulation, it is necessary to ensure that the AD system perceives the real data and the rendered data in the same way. Although the performance of rendering methods is improving, many scenes remain inherently challenging to faithfully reconstruct. To this end, we propose a new perspective to address the gap between real and simulated data. We not only focus on improving rendering fidelity, but explore simple yet effective methods to enhance the robustness of perceptual models to NeRF artifacts without affecting real data performance. Furthermore, we conduct the first large-scale investigation of the real-to-simulated data gap in AD settings using state-of-the-art neural rendering techniques. Specifically, our study evaluates object detectors and online mapping models on real and simulated data and investigates the effects of different pre-training strategies. Our results show significantly improved model accuracy on simulated data, even improving real-world performance in some cases. Finally, we delve into the real-to-simulated similarities, identifying FID and LPIPS as strong indicators.

In this article, we propose a new perspective to bridge the gap between intelligent driving systems and perception modules. Our goal is not to improve rendering quality but to make the perceptual model more robust to NeRF artifacts without degrading performance on real data. We believe that this direction is complementary to improving NeRF performance and is key to enabling virtual AV testing. As a first step in this direction, we show that even simple data augmentation techniques can have a strong impact on the model's robustness to NeRF artifacts.

We conduct the first extensive real2sim gap study on a large-scale AD dataset and evaluate the performance of multiple object detectors as well as online mapping models on real data and state-of-the-art (SOTA) neural rendering method data. . Our study includes the impact of different data augmentation techniques during training, and the fidelity of NeRF rendering during inference. We found that during model fine-tuning, these data showed the impact of the augmentation technique, and the fidelity of NeRF rendering even improved performance on real data in some cases. Finally, we study the correlation between implicit and common image reconstruction metrics in real2sim and gain insights into the significance of using NeRFs for CAD data simulators. We find that LPIPS and FID are powerful indicators of real2sim disparity, and further verify that our proposed enhancement reduces the sensitivity of contrast-poor visual compositionality.

Its hard to distinguish between true and false! Clever use of autonomous driving simulation data generated by NeRF

Detailed explanation of method

To test and validate the AD functionality of the NeRF-driven simulation engine, they can use the data that has been collected to Explore new virtual scenes. However, in order for the results using such models to be trustworthy, the AD system must behave in the same way when processing the data and the actual data. In this work, we propose an alternative and complementary approach to adapt AD systems to make them less sensitive to differences between real and simulated data. In this way, we can tune the AD system to be less sensitive to differences between real and simulated data and thus better handle differences between real and simulated data.

Its hard to distinguish between true and false! Clever use of autonomous driving simulation data generated by NeRF

As a first step in exploring how fine-tuning strategies can make perceptual models more robust to artifacts in rendered data, we used different fine-tuning strategies. Specifically, given an already trained model, we fine-tune the perceptual model using images that focus on improving the performance of rendered images while maintaining performance on real data, see Figure 2. In addition to reducing the real2sim gap, this may also reduce the requirements for sensor realism, pave the way for wider application of neural rendering methods, and reduce the computational requirements for the training and evaluation of description methods. Note that while we focus on perceptual models, our approach can be easily extended to end-to-end models as well.

Finally, we can imagine multiple ways to achieve the goal of making models more robust, such as drawing inspiration from the domain adaptation and multi-task learning literature. However, fine-tuning requires minimal model-specific adjustments, allowing us to easily study a range of models.

Image augmentations

Image augmentation is a commonly used method. The classic strategy for enhancing robustness against artifacts is to use image augmentation. Here, we choose enhancements to represent the various distortions present in the rendered image. More specifically, we add random Gaussian noise, convolve the image with a Gaussian blur kernel, and apply photometric distortion similar to that found in SimCLR. Finally, the image is downsampled and upsampled. Augments are applied sequentially, and each augmentation has a certain probability.

Fine-tuning with mixed-in rendered images

#NeRF是用來渲染3D場景的深度學習模型。在微調期間，模型可以適應另一種自然形式，即在微調期間包含這種數據。這有助於訓練NeRF模型的能力，以便可以在監控感知模型的相同資料集上訓練NeRF方法。然而，在大型資料集上訓練NeRF的成本可能高昂，其中一些可能需要進行3D物件偵測、語義分割或多種類別標籤等任務的標籤。此外，AD的NeRF常常也增加了中資料的順序要求。為了適應這些要求，標籤可能需要更多的特殊處理，例如3D物件偵測，語意分割或多種類別標籤等。

接下來，我們將中所選序列的影像分割為NeRF訓練集和保持集。感知模型的微調是在它們的整個訓練資料集D上完成的，對於在D中具有渲染對應關係的影像，我們使用具有機率p的渲染影像。這意味著用於微調的圖像沒有被NeRF模型看到。

Image-to-image translation

如前所述，渲染NeRF資料是一種昂貴的資料增強技術。此外，除了感知任務所需的資料外，它還需要順序資料和潛在的額外標記。也就是說，為了獲得可擴展的方法，我們理想地想要一種有效的策略來獲得單一影像的NeRF資料。為此，我們建議使用圖像對圖像的方法來學習生成類別NeRF圖像。給定真實影像，該模型將影像轉換到NeRF域，有效地引入了NeRF的典型偽影。這使我們能夠在有限的運算成本下，在微調過程中大幅增加類NeRF影像的數量。我們使用渲染圖像Dnerf及其對應的真實圖像來訓練圖像到圖像模型。不同增強策略的可視化範例見圖3。

Its hard to distinguish between true and false! Clever use of autonomous driving simulation data generated by NeRF

結果

Its hard to distinguish between true and false! Clever use of autonomous driving simulation data generated by NeRF

Its hard to distinguish between true and false! Clever use of autonomous driving simulation data generated by NeRF

Its hard to distinguish between true and false! Clever use of autonomous driving simulation data generated by NeRF

Its hard to distinguish between true and false! Clever use of autonomous driving simulation data generated by NeRF

Its hard to distinguish between true and false! Clever use of autonomous driving simulation data generated by NeRF

Its hard to distinguish between true and false! Clever use of autonomous driving simulation data generated by NeRF

##神經輻射場（NeRF ）已成為模擬自動駕駛（AD）數據的一種很有前途的途徑。然而，為了實用，必須了解AD系統在模擬資料上執行的行為是如何轉換為真實的資料。我們的大規模調查揭示了暴露於模擬影像和真實影像的感知模型之間的表現差距。

與早期專注於提升渲染品質的方法不同，本文研究如何改變感知模型，使其對NeRF模擬資料更具穩健性。我們表明，使用NeRF或類似NeRF的資料進行微調，在不犧牲真實資料效能的情況下，大大減少了物件偵測和線上映射方法的real2sim間隙。此外，我們還表明，在現有的列車分佈之外產生新的場景，例如模擬車道偏離，可以提高實際數據的效能。對NeRF社群內常用影像指標的研究表明，LPIPS和FID分數與感知表現表現出最強的相關性。這表明，與單純的重建品質相比，感知相似性對感知模型具有更大的意義。

總之，我們認為NeRF模擬資料對AD有價值，尤其是當使用我們提出的方法來增強感知模型的穩健性時。此外，NeRF資料不僅有助於在模擬資料上測試AD系統，而且有助於提高感知模型在真實資料上的表現。

#

The above is the detailed content of 'It's hard to distinguish between true and false'! Clever use of autonomous driving simulation data generated by NeRF. For more information, please follow other related articles on the PHP Chinese website!

Statement

This article is reproduced at:51CTO.COM. If there is any infringement, please contact admin@php.cn delete

Related Article

Tesla's Robovan Was The Hidden Gem In 2024's Robotaxi Teaser

Tesla's Robovan Was The Hidden Gem In 2024's Robotaxi TeaserApr 22, 2025 am 11:48 AM

Since 2008, I've championed the shared-ride van—initially dubbed the "robotjitney," later the "vansit"—as the future of urban transportation. I foresee these vehicles as the 21st century's next-generation transit solution, surpas

Sam's Club Bets On AI To Eliminate Receipt Checks And Enhance Retail

Sam's Club Bets On AI To Eliminate Receipt Checks And Enhance RetailApr 22, 2025 am 11:29 AM

Revolutionizing the Checkout Experience Sam's Club's innovative "Just Go" system builds on its existing AI-powered "Scan & Go" technology, allowing members to scan purchases via the Sam's Club app during their shopping trip.

Nvidia's AI Omniverse Expands At GTC 2025

Nvidia's AI Omniverse Expands At GTC 2025Apr 22, 2025 am 11:28 AM

Nvidia's Enhanced Predictability and New Product Lineup at GTC 2025 Nvidia, a key player in AI infrastructure, is focusing on increased predictability for its clients. This involves consistent product delivery, meeting performance expectations, and

Exploring the Capabilities of Google's Gemma 2 Models

Exploring the Capabilities of Google's Gemma 2 ModelsApr 22, 2025 am 11:26 AM

Google's Gemma 2: A Powerful, Efficient Language Model Google's Gemma family of language models, celebrated for efficiency and performance, has expanded with the arrival of Gemma 2. This latest release comprises two models: a 27-billion parameter ver

The Next Wave of GenAI: Perspectives with Dr. Kirk Borne - Analytics Vidhya

The Next Wave of GenAI: Perspectives with Dr. Kirk Borne - Analytics VidhyaApr 22, 2025 am 11:21 AM

This Leading with Data episode features Dr. Kirk Borne, a leading data scientist, astrophysicist, and TEDx speaker. A renowned expert in big data, AI, and machine learning, Dr. Borne offers invaluable insights into the current state and future traje

AI For Runners And Athletes: We're Making Excellent Progress

AI For Runners And Athletes: We're Making Excellent ProgressApr 22, 2025 am 11:12 AM

There were some very insightful perspectives in this speech—background information about engineering that showed us why artificial intelligence is so good at supporting people’s physical exercise. I will outline a core idea from each contributor’s perspective to demonstrate three design aspects that are an important part of our exploration of the application of artificial intelligence in sports. Edge devices and raw personal data This idea about artificial intelligence actually contains two components—one related to where we place large language models and the other is related to the differences between our human language and the language that our vital signs “express” when measured in real time. Alexander Amini knows a lot about running and tennis, but he still

Jamie Engstrom On Technology, Talent And Transformation At Caterpillar

Jamie Engstrom On Technology, Talent And Transformation At CaterpillarApr 22, 2025 am 11:10 AM

Caterpillar's Chief Information Officer and Senior Vice President of IT, Jamie Engstrom, leads a global team of over 2,200 IT professionals across 28 countries. With 26 years at Caterpillar, including four and a half years in her current role, Engst

New Google Photos Update Makes Any Photo Pop With Ultra HDR Quality

New Google Photos Update Makes Any Photo Pop With Ultra HDR QualityApr 22, 2025 am 11:09 AM

Google Photos' New Ultra HDR Tool: A Quick Guide Enhance your photos with Google Photos' new Ultra HDR tool, transforming standard images into vibrant, high-dynamic-range masterpieces. Ideal for social media, this tool boosts the impact of any photo,

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Assassin's Creed Shadows: Seashell Riddle Solution

3 weeks agoByDDD

What's New in Windows 11 KB5054979 & How to Fix Update Issues

2 weeks agoByDDD

Where to find the Crane Control Keycard in Atomfall

3 weeks agoByDDD

Roblox: Dead Rails - How To Complete Every Challenge

4 weeks agoByDDD

Atomfall guide: item locations, quest guides, and tips

4 weeks agoByDDD

Hot Tools

Atom editor mac version download

Atom editor mac version download

The most popular open source editor

Dreamweaver Mac version

Dreamweaver Mac version

Visual web development tools

PhpStorm Mac version

PhpStorm Mac version

The latest (2018.2.1) professional PHP integrated development tool

mPDF

mPDF

mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),

EditPlus Chinese cracked version

EditPlus Chinese cracked version

Small size, syntax highlighting, does not support code prompt function

Hot Topics

Where is the login entrance for gmail email?

7652

15

CakePHP Tutorial

1393

52

What is the format of the account name of steam

91

11

win11 activation key permanent

73

19

nyt mini crossword answers

37

110