search
Homeweb3.0IVG: Integrating Human Values into Large Language Models at Inference Time

IVG: Integrating Human Values into Large Language Models at Inference Time

Oct 03, 2024 pm 03:16 PM
AIIntegrated Value Guidance Implicit and Explicit Value Functions Token-Wise Sampling Chunk-Level Beam Search

Researchers developed Inference-time alignment methods to integrate human values after fine-tuning LLMs using the implicit and explicit functions without changing the base model.

IVG: Integrating Human Values into Large Language Models at Inference Time

Integrating human values after training a model with Learning-based algorithms requires fine-tuning LLMs, which is computationally expensive and time-consuming. Moreover, it generates biased and undesirable responses by the user. A model that can efficiently adapt to user preferences in real time by integrating algorithms that can interfere at inference time is needed. This method will avoid retraining the models repeatedly for desired results by freezing the base model and reducing the computational cost of fine-tuning LLMs.

Researchers developed Inference-time alignment methods to integrate human values after fine-tuning LLMs using the implicit and explicit functions without changing the base model. Implicit functions are used for token generation, which conducts word-by-word evaluations and prefers the output with the highest probability. In contrast, explicit functions require a rigid structure to evaluate larger chunks of text and generate the following sequence of words with the highest probability while maintaining overall context. The explicit function is inflexible and computationally expensive, failing to address token-level optimization, while the implicit function faces interpretability issues and requires frequent forward passes, leading to low real-time efficiency.

To tackle the disadvantages of both functions, the proposed method, Integrated Value Guidance (IVG), combines the implicit function’s token-level optimization and the explicit function’s broader perspective. It was able to ward off adaptation challenges and trade-offs in alignment efficacy, leading to decreased performance discrepancies and making it easier to implement. These advantages facilitated better performance on tasks like controlled sentiment generation and summarization. IVG, combined with the smaller models like GPT-2, could compete with higher models.

IVG incorporates the two value functions, the implicit and explicit functions, to align the model with human values. First, token-wise sampling fine-tunes individual tokens to a specific sequence length, generating multiple sequences. Then, chunk-level beam search compares the probabilities of these sequences and selects the one with the highest probability. Although this method ensures that the output is more robust, the computational power increases during the inference time due to frequent forward passes, leading to slower responses.

Researchers have used two experimental set-ups to evaluate IVG: 1. Controlled sentiment generation and Summarization, and 2. Instruction-following. In the first one, the GPT-2 model family is used by leveraging synthetic datasets from a gold-reward model to generate positive movie reviews and summarise Reddit posts. In comparison, the second one requires an instruction-tuned model, AlpacaEval 2.0. It employs Tulu Guidance, which uses specific models for implicit function and trains a reward-based model for the explicit function, and Ultraguidance, which fine-tunes a model with Direct Preference Optimization (DPO) for both functions. GPT-4-turbo was used as a reference to assess responses in the second experiment, and IVG consistently performed well.

In addition to these two experiments, an ablation study proved that Chunk-Level Beam Search (CBS) had higher speed efficiency than Emulator Fine-Tuning (EFT), which uses the implicit function for fine-tuning. These results have proved that CBS is much better to use in practice.

In conclusion, Integrated Value Guidance (IVG) offers a novel and efficient approach to aligning large language models with human preferences purely at inference time, bypassing the complexities of traditional fine-tuning. By leveraging implicit and explicit value functions, IVG enhances performance in both token-wise sampling and chunk-level decoding, as demonstrated through significant improvements in sentiment generation, summarization, and instruction-following tasks. The results showed that IVG is a versatile method, providing strong empirical evidence of its ability to outclass existing approaches, making it a promising solution for fine-tuning large models in real-world applications.

Don’t Forget to join our 50k ML SubReddit

Want to get in front of 1 Million AI Readers? Work with us here

The above is the detailed content of IVG: Integrating Human Values into Large Language Models at Inference Time. For more information, please follow other related articles on the PHP Chinese website!

Statement
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn
As Trump's Pro-Coin Turn Shapes How Legislators Approach Digital Asset Policy, a New Bipartisan Push to Control Stablecoins is Running AgroundAs Trump's Pro-Coin Turn Shapes How Legislators Approach Digital Asset Policy, a New Bipartisan Push to Control Stablecoins is Running AgroundMay 16, 2025 am 11:42 AM

The path of the measure has been reinterpreted by President Trump's sudden crypto embrace. Originally a strident opponent of digital currencies, Trump today supports blockchain innovation

Eric Trump says father's energy policies will help cryptoEric Trump says father's energy policies will help cryptoMay 16, 2025 am 11:40 AM

Eric Trump said his latest venture into the crypto industry, American Bitcoin, aims to mine the digital currency for cheaper than its rivals

Ethereum Launches the 'Trillion Dollar Security Initiative” to Consolidate Its Leadership PositionEthereum Launches the 'Trillion Dollar Security Initiative” to Consolidate Its Leadership PositionMay 16, 2025 am 11:38 AM

As the crypto landscape evolves at a frantic pace, Ethereum is launching a strategic offensive to consolidate its leadership position.

The crypto rally took a long-overdue pause on Thursday as traders took some profits following weeks of relentless advance that lifted bitcoin BTC$ close to record prices.The crypto rally took a long-overdue pause on Thursday as traders took some profits following weeks of relentless advance that lifted bitcoin BTC$ close to record prices.May 16, 2025 am 11:36 AM

The consolidation occurred amid a slew of U.S. economic data releases. April retail sales missed expectations, producer prices rose less than forecast, jobless claims stayed on track

US President Donald Trump's son Eric assures the world's leading crypto conference that Washington would hoard 'a tremendous amount of bitcoin'US President Donald Trump's son Eric assures the world's leading crypto conference that Washington would hoard 'a tremendous amount of bitcoin'May 16, 2025 am 11:34 AM

US President Donald Trump's son Eric on Thursday assured the world's leading crypto conference that Washington would hoard "a tremendous amount of bitcoin"

Eric Trump Promises Washington Will Hoard 'a Tremendous Amount of Bitcoin'Eric Trump Promises Washington Will Hoard 'a Tremendous Amount of Bitcoin'May 16, 2025 am 11:32 AM

The US crypto industry has welcomed Trump's return to the White House, praising policies its says mark a clear departure from the deep skepticism of the previous Democratic administration toward digital currencies.

Stablecoins Are Solutions in Search of ProblemsStablecoins Are Solutions in Search of ProblemsMay 16, 2025 am 11:30 AM

Stablecoins, which have the potential to become widely used for payments, are trying to avoid that fate.

PayPal's Jose Fernandez da Ponte Predicts Banks Will Be the Key to Stablecoin SuccessPayPal's Jose Fernandez da Ponte Predicts Banks Will Be the Key to Stablecoin SuccessMay 16, 2025 am 11:28 AM

Seated amidst the futuristic ambiance of Toronto's Consensus 2025 conference, PayPal's voice in digital currencies, Jose Fernandez da Ponte

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Tools

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

VSCode Windows 64-bit Download

VSCode Windows 64-bit Download

A free and powerful IDE editor launched by Microsoft

ZendStudio 13.5.1 Mac

ZendStudio 13.5.1 Mac

Powerful PHP integrated development environment