Open source AI video tool, you only need to be a director, built by HuggingFace engineers-AI-php.cn

Home

Technology peripherals

Open source AI video tool, you only need to be a director, built by HuggingFace engineers

PHPz

Aug 13, 2024 pm 04:37 PM

industryClapper

To make videos with Clapper, you only need to be the director.

As soon as Sora came out, the video field seems to have entered the era of generative AI. But until today, we still have not used OpenAI’s official video generation tool, and people who can’t wait have begun to look for other methods.

In recent weeks, Clapper, an open source video editing tool, has attracted people’s attention.

Open source AI video tool, you only need to be a director, built by HuggingFace engineers

Unlike the video generators provided by many technology companies, Clapper is an open source AI story visualization tool that launched as a prototype a year ago. It's not designed to replace traditional video editors, or modern AI editors that use 3D scenes as input.

Clapper’s philosophy is to bring together various generative AI technologies to allow anyone to create videos using AI through an interactive, iterative and intuitive process. No external tools, filmmaking or AI engineering skills required. In Clapper, you don’t need to directly edit sequences of video and audio files, but instead iterate on your story based on AI Agents by adjusting high-level, abstract concepts such as characters, locations, weather, time periods, styles, and more.

Clapper author Julian Bilcke is an AI front-end engineer at HuggingFace. He said that in order to continue working in this direction, he is also developing a director mode: the goal is to allow users to play videos in full screen, sit comfortably in the director's chair (or sofa), shout commands to the Agent, and let the AI make movies.

Open source AI video tool, you only need to be a director, built by HuggingFace engineers

In recent days, Julian Bilcke has launched new features such as using large models to convert arbitrary text into timelines. The popularity of Clapper has also increased, and it already has more than 1,100 stars on GitHub.

Open source AI video tool, you only need to be a director, built by HuggingFace engineers

GitHub link: https://github.com/jbilcke-hf/clapper
HuggingFace link: https://huggingface.co/spaces/jbilcke-hf/clapper/tree/main
Trial URL: https://clapper.app/

How to use

Since it is an open source tool, the main thing we look at is of course whether it is easy to use.

Do you still remember the experience of AI master Karpathy creating AI short videos? In order to turn the first three sentences of "Pride and Prejudice" into an animated version, it took this top expert a full hour. Although there are only three sentences and three scenes, this workflow is far more complicated than three sentences. He first used Claude to generate a series of image prompt words based on the original text, then input these prompt words into the Vincent graph model to generate the corresponding images, and then handed it over to the video model to make animations. The dubbing task was assigned to Elevenlabs, and finally he put it in Veed Studio. Put all the pieces together.

So, after Karpathy finished, he tweeted to complain, saying: "Entrepreneurs, the opportunity has come! The market is in urgent need of an AI tool that can integrate and simplify these processes."

Clapper is exactly one A one-stop platform integrating all these features.

Open source AI video tool, you only need to be a director, built by HuggingFace engineers

Usually if you want to make a short video, you need to go through the following steps. First, you need a story and script, then draw storyboards based on the script, then shoot or find materials based on the storyboards, put them together in editing software, add animation effects and special effects, and then selectively add spoken word and background music Or sound effects. Therefore, the division of labor in the film and television production industry such as choreography, directing, photography, editing, post-production, and dubbing came into being.

At Clapper, video production follows another logic. Each track of it does not correspond to video or picture material like Premier, Cutting and other editing software, but corresponds to a specific type of work.

Open source AI video tool, you only need to be a director, built by HuggingFace engineers ^{的 Clapper's track}

^{In the matter of using AI for video, we are Party A. Clapper is like a crew made up of the best AI in the industry. Clapper has built-in a series of "top-notch" large models such as GPT-4o, Claude 3.5 (Sonnet), etc. It is like Party B's executive director, responsible for connecting your needs to the corresponding "AI director."}

As you can see from the picture above, the first track represents the storyboard and talks to Clapper’s built-in large model. It will call the Vincent diagram model through the API and let the AI storyboard teacher generate the corresponding Pictures serve as the basis for video images. Er Through CLAPPER, you can access the above Wensheng map model. Take the samples given by Clapper as an example. The next track corresponds to the scene, narration, camera perspective, background music, and sound effects. You can ask ElevenLabs or Fal.ai to generate some wind sounds from ruins or explosions from gunfights for this Western wasteland story.

And Clapper also has a feature that may really take a big step towards the dream of "making movies by talking". We can directly import the script into Clapper and carefully create a character for your protagonist in the "Story" column.
Taking "The Wizard of Oz" as an example, we can not only add more personalized character descriptions to the characters, but also upload pictures to set the visual image of the heroine Dorothy. That means we can ask any actor in the world to play this role, even if you want to see an 18-year-old DiCaprio playing Dorothy, you can do it. The functions of Clapper are so detailed that you can adjust the age and timbre of the characters, the furnishings of each scene, what furniture is in Dorothy's room, and what the house in their adventure destination "Emerald City" looks like, all can be adjusted in Clapper. Adjustment.

Of course, you can also use AI to draw some atmosphere pictures first, which may further stimulate your inspiration and creativity.
However, although the function of Clapper has fully considered the needs of making videos, its effect is somewhat unsatisfactory. Not only are the movements of the characters in the picture a bit "ghostly", they do not conform to the laws of physical movement. The overall effect of the video is more like a moving PPT, lacking transitions and continuity between shots, and the soundtrack is also full of AI, sounds without melody, and has some noise.

It may take a long time for generative AI to change the video production process, but the emergence of Clapper may be able to provide major manufacturers that are still expanding AI functions for traditional video editing software. Developed a new implementation idea.

Reference content:
https://news.ycombinator.com/item?id=41221399

https://x.com/aigclink/status/1818 111874531205216

The above is the detailed content of Open source AI video tool, you only need to be a director, built by HuggingFace engineers. For more information, please follow other related articles on the PHP Chinese website!

Statement

The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

A Comprehensive Guide to ExtrapolationApr 15, 2025 am 11:38 AM

Introduction Suppose there is a farmer who daily observes the progress of crops in several weeks. He looks at the growth rates and begins to ponder about how much more taller his plants could grow in another few weeks. From th

The Rise Of Soft AI And What It Means For Businesses TodayApr 15, 2025 am 11:36 AM

Soft AI — defined as AI systems designed to perform specific, narrow tasks using approximate reasoning, pattern recognition, and flexible decision-making — seeks to mimic human-like thinking by embracing ambiguity. But what does this mean for busine

Evolving Security Frameworks For The AI FrontierApr 15, 2025 am 11:34 AM

The answer is clear—just as cloud computing required a shift toward cloud-native security tools, AI demands a new breed of security solutions designed specifically for AI's unique needs. The Rise of Cloud Computing and Security Lessons Learned In th

3 Ways Generative AI Amplifies Entrepreneurs: Beware Of Averages!Apr 15, 2025 am 11:33 AM

Entrepreneurs and using AI and Generative AI to make their businesses better. At the same time, it is important to remember generative AI, like all technologies, is an amplifier – making the good great and the mediocre, worse. A rigorous 2024 study o

New Short Course on Embedding Models by Andrew NgApr 15, 2025 am 11:32 AM

Unlock the Power of Embedding Models: A Deep Dive into Andrew Ng's New Course Imagine a future where machines understand and respond to your questions with perfect accuracy. This isn't science fiction; thanks to advancements in AI, it's becoming a r

Is Hallucination in Large Language Models (LLMs) Inevitable?Apr 15, 2025 am 11:31 AM

Large Language Models (LLMs) and the Inevitable Problem of Hallucinations You've likely used AI models like ChatGPT, Claude, and Gemini. These are all examples of Large Language Models (LLMs), powerful AI systems trained on massive text datasets to

The 60% Problem — How AI Search Is Draining Your TrafficApr 15, 2025 am 11:28 AM

Recent research has shown that AI Overviews can cause a whopping 15-64% decline in organic traffic, based on industry and search type. This radical change is causing marketers to reconsider their whole strategy regarding digital visibility. The New

MIT Media Lab To Put Human Flourishing At The Heart Of AI R&DApr 15, 2025 am 11:26 AM

A recent report from Elon University’s Imagining The Digital Future Center surveyed nearly 300 global technology experts. The resulting report, ‘Being Human in 2035’, concluded that most are concerned that the deepening adoption of AI systems over t

See all articles