search
HomeTechnology peripheralsAITian Yuandong's team released the second version of the DOC of 'Long Story Generator': the coherence has been greatly improved, and the fun has increased by 20.7%!

Some time ago, Dr. Tian Yuandong’s team released a story generator Re3 (Recursive Reprompting and Revision) framework based on a large-scale language model at EMNLP2022. By designing prompts, the model can generate consistent stories without any need. Fine-tuning large models can generate stories of up to 7,500 words.

Re3’s author team recently released the second version of the long story generation framework DOC (Detailed Outline Control) , which uses a hierarchical outline (outline) to describe the story For more detailed depictions and a more coherent continuation of the generated content using the fine-tuned OPT-350m model, human evaluations rated DOC as more capable of writing than the previous generation Re3.

Tian Yuandongs team released the second version of the DOC of Long Story Generator: the coherence has been greatly improved, and the fun has increased by 20.7%!

##Paper link: https://arxiv.org/abs/2212.10077

Paper link: https://github.com/yangkevin2/doc-story-generation

DOC consists of two complementary components:

1. Detailed outline generator (detailed outliner)Can create a more detailed, hierarchical structure of the outline, the creative work from the main drafting process Move to the planning stage;

##2.

detailed controllerEnsure more detail by controlling the story paragraphs to be consistent with the outline details The outline can still play a role in the generation process.

In the human evaluation of automatically generated stories, DOC achieved an absolute gain of 22.5% in plot consistency, a 28.2% increase in outline relevance, and a 20.7% increase in interest, which is significantly better than previous Re3 baseline model, and human evaluators also found DOC to be easier to control in an interactive generation environment.

The first author of the article, Kevin Yang, is a fourth-year doctoral student at the University of California, Berkeley. His main research interest is controllable natural language text generation in structured settings, such as using controllable Generative structured methods to improve the consistency of long texts.

The second author, Dr. Tian Yuandong, is a researcher and senior manager at Meta Artificial Intelligence Research Institute. His research interests include deep reinforcement learning and its application in games, as well as theoretical analysis of deep learning models. . He received his bachelor's and master's degrees from Shanghai Jiao Tong University in 2005 and 2008, and his doctorate from the Robotics Institute of Carnegie Mellon University in the United States in 2013.

DOC Framework

With the continuous development of natural language technology, the understanding of short texts by large-scale language models is gradually approaching the bottleneck, and people are gradually becoming more and more interested in generating longer texts. Generate interest, such as generating thousands of words at once.

Compared with short text generation tasks, long text contains more content and restrictions. The model needs to maintain overall consistency, long-term factual consistency, and maintain consistency with user output. The premise or plan remains relevant.

Compared with humans, story generation systems like Re3 still have shortcomings in many aspects, such as the inability to guarantee plot coherence over long distances, global inconsistencies, and story content deviating from the setting. plans etc.

To bridge this gap, the Detailed Outline Control (DOC) framework reuses Re3’s high-level planning-drafting-revision structure through two complementary approach improves long-term consistency.

Tian Yuandongs team released the second version of the DOC of Long Story Generator: the coherence has been greatly improved, and the fun has increased by 20.7%!

Detailed Outliner

First, the detailed outliner refines a brief initial outline into a more detailed outline Detailed, hierarchical outlines designed this way because a human author may iteratively refine and expand a short initial outline before drafting a longer document.

Rather than improvising new plot points, a writer might plan a coherent overarching plot in the high-level outline stage, using an expanded outline to provide more detailed guidance during the drafting process.

Tian Yuandongs team released the second version of the DOC of Long Story Generator: the coherence has been greatly improved, and the fun has increased by 20.7%!

During the drafting stage, the researchers reused the outline relevance and text coherence reordering from the Re3 rewriting stage to detect where the current outline items were. A paragraph of article is completed at the same time, and early stopping is implemented based on the score threshold.

There are complete settings and relevant characters in the outline, and each outline item is carefully screened for relevance and coherence in context.

In the structured prompt, the model highlights the current settings, changes in the settings, and also retrieves role descriptions based on the roles detected in the outline.

Tian Yuandongs team released the second version of the DOC of Long Story Generator: the coherence has been greatly improved, and the fun has increased by 20.7%!

In contrast, Re3 dynamically selects relevant characters for each segment during the drafting process and does not track setting information, which can lead to story Unexpected changes in settings

Detailed Controller

The second component, the detailed controller, controls paragraphs based on the corresponding outline item Generated to maintain fidelity to a detailed outline.

Because the detailed outline imposes many overlapping soft constraints, the detailed controller must exert sufficient control strength. At the same time, the detailed controller must also adapt to flexible natural language input and use State-of-the-art large language models are generated with computational efficiency.

So the researchers implemented the detailed controller as a controller based on OPT350m, and designed a contrast training program to align the summary with the paragraph prefix.

Tian Yuandongs team released the second version of the DOC of Long Story Generator: the coherence has been greatly improved, and the fun has increased by 20.7%!

The most critical thing is that the researchers also constructed many fluent hard negatives to facilitate the generated paragraphs to not only It starts off relevant to the theme and stays relevant throughout.

Experimental part

In the experiment, the input to the model is just a short English premise, usually 30-60 words, and the output is a complete story .

The researchers did not impose more rule constraints because the definition of "story" is not yet clear, let alone the definition of "good story", and the quality mainly relies on manual evaluation. index.

There are three main indicators used in evaluation, which are more suitable for comparing paragraphs rather than complete stories:

1. Coherence Sexuality, the percentage of paragraphs that human annotators judge to have a coherent plot;

2. Relevance, the percentage of paragraphs that are judged to conform to the corresponding outline entries;

3. Interestingness, the percentage of passages that are considered interesting.

The baseline models compared include Re3, ROLLING-OPT and ROLLING-GPT.

Tian Yuandongs team released the second version of the DOC of Long Story Generator: the coherence has been greatly improved, and the fun has increased by 20.7%!

As can be seen from the experimental results, compared with Re3, the annotators believe that the plot generated by DOC is more coherent and more relevant to the outline. ROLLING baseline improvement is higher.

And the results confirm the correctness of the model design, that is, plot coherence and outline relevance benefit from shifting creative work from planning to drafting, as well as improved control mechanisms.

And surprisingly, the annotators also believed that the DOC paragraphs were significantly more interesting. The researchers believed that this was an improvement brought about by more detailed (more event-based) outlines, and further ablation experiments also supported this this assumption.

However, qualitative analysis also revealed that the model still has huge room for further improvement.

Unlike RE3, DOC usually doesn't deviate significantly from the top-level outline, while RE3 sometimes strays almost completely off topic, but DOC often fails to follow the lower-level parts of the detailed outline.

Internal consistency remains problematic in DOC and RE3, and occasional errors in detailed outlines can have a particularly negative impact, leading to greater levels of confusion during the drafting process. Connection error.

Additionally, outlines in the DOC are often inconsistent in the level of detail, with some being too vague and others appearing to be over-expanded.

Tian Yuandongs team released the second version of the DOC of Long Story Generator: the coherence has been greatly improved, and the fun has increased by 20.7%!

Additionally, the settings and roles detected by the model can sometimes be incorrect or incomplete, the example below shows the DOC written according to the above outline A heavily abridged story.

Tian Yuandongs team released the second version of the DOC of Long Story Generator: the coherence has been greatly improved, and the fun has increased by 20.7%!

The above is the detailed content of Tian Yuandong's team released the second version of the DOC of 'Long Story Generator': the coherence has been greatly improved, and the fun has increased by 20.7%!. For more information, please follow other related articles on the PHP Chinese website!

Statement
This article is reproduced at:51CTO.COM. If there is any infringement, please contact admin@php.cn delete
California Taps AI To Fast-Track Wildfire Recovery PermitsCalifornia Taps AI To Fast-Track Wildfire Recovery PermitsMay 04, 2025 am 11:10 AM

AI Streamlines Wildfire Recovery Permitting Australian tech firm Archistar's AI software, utilizing machine learning and computer vision, automates the assessment of building plans for compliance with local regulations. This pre-validation significan

What The US Can Learn From Estonia's AI-Powered Digital GovernmentWhat The US Can Learn From Estonia's AI-Powered Digital GovernmentMay 04, 2025 am 11:09 AM

Estonia's Digital Government: A Model for the US? The US struggles with bureaucratic inefficiencies, but Estonia offers a compelling alternative. This small nation boasts a nearly 100% digitized, citizen-centric government powered by AI. This isn't

Wedding Planning Via Generative AIWedding Planning Via Generative AIMay 04, 2025 am 11:08 AM

Planning a wedding is a monumental task, often overwhelming even the most organized couples. This article, part of an ongoing Forbes series on AI's impact (see link here), explores how generative AI can revolutionize wedding planning. The Wedding Pl

What Are Digital Defense AI Agents?What Are Digital Defense AI Agents?May 04, 2025 am 11:07 AM

Businesses increasingly leverage AI agents for sales, while governments utilize them for various established tasks. However, consumer advocates highlight the need for individuals to possess their own AI agents as a defense against the often-targeted

A Business Leader's Guide To Generative Engine Optimization (GEO)A Business Leader's Guide To Generative Engine Optimization (GEO)May 03, 2025 am 11:14 AM

Google is leading this shift. Its "AI Overviews" feature already serves more than one billion users, providing complete answers before anyone clicks a link.[^2] Other players are also gaining ground fast. ChatGPT, Microsoft Copilot, and Pe

This Startup Is Using AI Agents To Fight Malicious Ads And Impersonator AccountsThis Startup Is Using AI Agents To Fight Malicious Ads And Impersonator AccountsMay 03, 2025 am 11:13 AM

In 2022, he founded social engineering defense startup Doppel to do just that. And as cybercriminals harness ever more advanced AI models to turbocharge their attacks, Doppel’s AI systems have helped businesses combat them at scale— more quickly and

How World Models Are Radically Reshaping The Future Of Generative AI And LLMsHow World Models Are Radically Reshaping The Future Of Generative AI And LLMsMay 03, 2025 am 11:12 AM

Voila, via interacting with suitable world models, generative AI and LLMs can be substantively boosted. Let’s talk about it. This analysis of an innovative AI breakthrough is part of my ongoing Forbes column coverage on the latest in AI, including

May Day 2050: What Have We Left To Celebrate?May Day 2050: What Have We Left To Celebrate?May 03, 2025 am 11:11 AM

Labor Day 2050. Parks across the nation fill with families enjoying traditional barbecues while nostalgic parades wind through city streets. Yet the celebration now carries a museum-like quality — historical reenactment rather than commemoration of c

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Tools

Atom editor mac version download

Atom editor mac version download

The most popular open source editor

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

PhpStorm Mac version

PhpStorm Mac version

The latest (2018.2.1) professional PHP integrated development tool

MantisBT

MantisBT

Mantis is an easy-to-deploy web-based defect tracking tool designed to aid in product defect tracking. It requires PHP, MySQL and a web server. Check out our demo and hosting services.

mPDF

mPDF

mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),