Big release, 'brain-like science' or the optimal solution to the problem of computing power consumption and context length of artificial intelligence large language model!-AI-php.cn

Big release, 'brain-like science' or the optimal solution to the problem of computing power consumption and context length of artificial intelligence large language model!

WBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWB

Oct 20, 2023 pm 05:25 PM

In a grand event of science fiction and science, science fiction suddenly shines into reality.

Recently, at the Shenzhen Advanced Institute, the Shenzhen University of Technology Education Foundation and the Science and Fantasy Growth Fund held an event based on the emergence of science fiction and AI. A team from Shenzhen called Luxi Technology has publicly released their artificial intelligence large language model---NLM (Neuromorphic Generative Pre-trained Language Model) for the first time, a large language model that is not based on Transformer.

Different from many large models at home and abroad, this team takes brain-like science and brain-like intelligence as its core, while integrating the characteristics of recurrent neural networks, and developing large language models inspired by the efficient computing characteristics of the brain.

Big release, brain-like science or the optimal solution to the problem of computing power consumption and context length of artificial intelligence large language model!

What’s even more amazing is that the computing power consumption of this model under the same level of parameters is 1/22 of the Transformer architecture; on the issue of context length, NLM also gave a perfect answer: the context length window can achieve unlimited growth, regardless of It doesn't matter whether it is the 2k limit of open source LLM, or other context length limits of 32k or 100k.

What is brain-inspired computing?

Brain-like computing is a computing model that imitates the structure and function of the human brain. It simulates the neural network connections of the human brain in terms of architecture, design principles, and information processing methods. This kind of computing goes beyond simply trying to simulate the surface characteristics of biological neural networks, but goes deep into how to simulate the basic construction of biological neural networks-that is, processing and storing sequence information through large-scale interconnections of neurons and synapses.

Unlike traditional rule-based algorithms, brain-inspired computing relies on a large number of interconnected neural networks to learn and extract information autonomously, just like the human brain. This approach allows computing systems to learn from experience, adapt to new situations, understand complex patterns, and make advanced decisions and predictions.

Due to its high degree of adaptability and parallel processing capabilities, brain-inspired computing systems have shown extremely high efficiency and accuracy in processing big data, image and speech recognition, natural language processing and other fields. Not only can these systems quickly process complex and changing information, but they also consume far less energy and computing resources than traditional computing architectures because they do not require extensive pre-programming and data input.

In general, brain-inspired computing opens up a new computing paradigm. It transcends traditional artificial neural networks and moves towards advanced intelligent systems that can self-learn, self-organize, and even have a certain degree of self-awareness.

The advancement of large brain-inspired models

At the event, Dr. Zhou Peng from Lu Xi’s team explained in detail the implementation mechanism of the large brain-like model.

As a new generation of neural network model, also known as brain-like neural network, it breaks through the shortcomings of the first two generations of neural networks.

-The first generation neural network (also known as: MLP multi-layer perceptron), which transmits signals as 0 and 1, cannot handle overly complex tasks and does not require much computing power.

-The second generation neural network, also known as artificial neural network, changes the transmission signal into a continuous interval of [0-1], which has sufficient complexity, but the computing power overhead has also soared.

- The third generation of neural networks, also known as brain-like neural networks, turns signals into pulse sequences. While having sufficient complexity, it also makes the computing power cost controllable. This pulse sequence is achieved by mimicking the dynamics in neural structures. At the same time, sequence means time, and the third generation neural network can effectively integrate and output the time information in the information.

-Compared with the previous two generations of neural networks, it processes sequence information with time dimensions more effectively and understands the real world more effectively.

Big release, brain-like science or the optimal solution to the problem of computing power consumption and context length of artificial intelligence large language model!

The reasoning principle of large models based on brain-like algorithms is also completely different from Transformer. During the reasoning process, there are significant differences in the operating mechanisms of the Transformer model and the brain-like model. Whenever the Transformer model performs inference, it will consider all contextual information to generate the next token. This operation can be compared to during a chat, every time we say a word, we need to recall all the experiences of the day. This is also the main reason why the calculation costs of large-scale models continue to increase while their parameters continue to grow.

Relatively speaking, the brain-like model only needs to rely on its internal state and a token when reasoning. This can be compared to when we blurt out what the next word is when speaking, without having to specifically recall all previous situations, and the content of the speech is also intrinsically related to previous experiences. This mechanism is the key to NLM's ability to significantly reduce computing power overhead, making it closer to the way the human brain operates, and thus significantly improving its performance.

Big release, brain-like science or the optimal solution to the problem of computing power consumption and context length of artificial intelligence large language model!

Also because of the characteristics of brain inspiration, the limited context length is no longer a troubling problem. The NLM large model using the third generation neural network does not have a context length bottleneck because the computing power required to process the next token is not related to the context length. The context length of the large language model of the publicly available Transformer architecture is only 100k. Increasing the context length is not only a matter of computing power overhead, but also a question of "can it".

The infinite length context of NLM will open the door to imagination in the application of large language models, whether it is studying complex financial reports, reading hundreds of thousands of words of novels, or making large models "more precise" through unlimited length context. "I understand you" can become a reality.

AI in the eyes of Lu Xi’s team

At this event, Dr. Zhou Peng, the founder and CTO of Luxi Technology, explained the team’s current mission - to empower all things with wisdom.

In an era of artificial intelligence, artificial intelligence needs to be popularized everywhere, just as the Internet and electricity are already everywhere around us. Although current artificial intelligence is impressive in terms of capabilities, its operating costs place a huge burden on businesses and consumers. The vast majority of mobile phones, watches, tablets and laptops are unable to run generative artificial intelligence large language models in a complete, systematic, efficient and high-quality manner under current technology. The threshold for developing large model applications has also hindered many outstanding developers who are interested in this. Intimidated.

At the event, Luxi Technology showed the audience how to use the "NLM-GPT" large model in the offline mode of an ordinary Android phone to complete various common tasks in work and life, pushing the event to a climax .

- The mobile phones participating in the demonstration are equipped with common chip architectures on the market, and their performance is similar to that of common Android models in the consumer market. With the phone in airplane mode and not connected to the Internet, Luxi Technology demonstrated a large model of "NLM-GPT" that can talk to users in real time on the phone, answer questions raised by users, and complete tasks including poetry creation, recipe writing, knowledge Instructions such as retrieval and file interpretation are highly complex, require high performance parameters of mobile phone hardware, and traditionally require networking to complete.

- During the entire demonstration process, the energy consumption of the mobile phone was stable, with minimal impact on the normal standby time, and no impact on the overall performance of the mobile phone.

-This demonstration successfully proved that the "NLM-GPT" large model has the potential to run in all scenarios, with high efficiency, low power consumption and zero traffic consumption in small C-end commercial devices such as smartphones and tablets. This means that thanks to the empowerment of the "NLM-GPT" large model, mobile phones, watches, tablets, laptops and other devices can more accurately and efficiently understand human beings' true intentions, and can be used in various situations such as office, study, social networking, entertainment, etc. Complete various instructions and tasks put forward by humans with higher quality in application scenarios, greatly improving the efficiency and quality of social production and human life.

Luxi Technology believes that the "generative artificial intelligence large language model" driven by "brain-like technology" will comprehensively expand human thinking, perception and action in various fields such as learning, work and life, and enhance the overall human ability overall wisdom. Thanks to the empowerment of brain-like technology, artificial intelligence will no longer be a new agent that replaces humans, but will become an efficient intelligent tool for humans to change the world and create a better future.

Just as the ancients trained hounds and falcons, the profession of hunter will not disappear because of the emergence of hounds and falcons. On the contrary, hunters have benefited from this and have mastered the power possessed by hounds and falcons but not possessed by humans themselves. They can obtain prey more efficiently, providing power and nutrients for the growth of human groups and the development of human civilization.

In the future, applying artificial intelligence large language models in daily work and life will no longer be a complex multi-process system project, but will be like "opening the payment code when checking out", "pressing the shutter when taking a photo", "One click and three consecutive clicks when viewing short videos" is generally simple, natural and smooth. Lu Xi's team will continue to work in the field of brain-like computing, conduct in-depth research on the brain, nature's most precious gift to mankind, and bring brain-like intelligence into daily life.

Perhaps, in the near future, humans will have more new artificial intelligence partners. There is no blood flowing in their bodies, and their intelligence will not replace humans. With the support of brain-inspired technology, they will work with us to explore the mysteries of the universe, expand the boundaries of society, and create a better future.

Source: Life Daily

(Source: undefined)

For more exciting information, please download the "Jimu News" client in the application market. Please do not reprint without authorization. You are welcome to provide news clues, and you will be paid once accepted. The 24-hour reporting hotline is 027-86777777.

The above is the detailed content of Big release, 'brain-like science' or the optimal solution to the problem of computing power consumption and context length of artificial intelligence large language model!. For more information, please follow other related articles on the PHP Chinese website!

Statement

This article is reproduced at:搜狐. If there is any infringement, please contact admin@php.cn delete

Why Sam Altman And Others Are Now Using Vibes As A New Gauge For The Latest Progress In AIMay 06, 2025 am 11:12 AM

Let's discuss the rising use of "vibes" as an evaluation metric in the AI field. This analysis is part of my ongoing Forbes column on AI advancements, exploring complex aspects of AI development (see link here). Vibes in AI Assessment Tradi

Inside The Waymo Factory Building A Robotaxi FutureMay 06, 2025 am 11:11 AM

Waymo's Arizona Factory: Mass-Producing Self-Driving Jaguars and Beyond Located near Phoenix, Arizona, Waymo operates a state-of-the-art facility producing its fleet of autonomous Jaguar I-PACE electric SUVs. This 239,000-square-foot factory, opened

Inside S&P Global's Data-Driven Transformation With AI At The CoreMay 06, 2025 am 11:10 AM

S&P Global's Chief Digital Solutions Officer, Jigar Kocherlakota, discusses the company's AI journey, strategic acquisitions, and future-focused digital transformation. A Transformative Leadership Role and a Future-Ready Team Kocherlakota's role

The Rise Of Super-Apps: 4 Steps To Flourish In A Digital EcosystemMay 06, 2025 am 11:09 AM

From Apps to Ecosystems: Navigating the Digital Landscape The digital revolution extends far beyond social media and AI. We're witnessing the rise of "everything apps"—comprehensive digital ecosystems integrating all aspects of life. Sam A

Mastercard And Visa Unleash AI Agents To Shop For YouMay 06, 2025 am 11:08 AM

Mastercard's Agent Pay: AI-Powered Payments Revolutionize Commerce While Visa's AI-powered transaction capabilities made headlines, Mastercard has unveiled Agent Pay, a more advanced AI-native payment system built on tokenization, trust, and agentic

Backing The Bold: Future Ventures' Transformative Innovation PlaybookMay 06, 2025 am 11:07 AM

Future Ventures Fund IV: A $200M Bet on Novel Technologies Future Ventures recently closed its oversubscribed Fund IV, totaling $200 million. This new fund, managed by Steve Jurvetson, Maryanna Saenko, and Nico Enriquez, represents a significant inv

As AI Use Soars, Companies Shift From SEO To GEOMay 05, 2025 am 11:09 AM

With the explosion of AI applications, enterprises are shifting from traditional search engine optimization (SEO) to generative engine optimization (GEO). Google is leading the shift. Its "AI Overview" feature has served over a billion users, providing full answers before users click on the link. [^2] Other participants are also rapidly rising. ChatGPT, Microsoft Copilot and Perplexity are creating a new “answer engine” category that completely bypasses traditional search results. If your business doesn't show up in these AI-generated answers, potential customers may never find you—even if you rank high in traditional search results. From SEO to GEO – What exactly does this mean? For decades

Big Bets On Which Of These Pathways Will Push Today's AI To Become Prized AGIMay 05, 2025 am 11:08 AM

Let's explore the potential paths to Artificial General Intelligence (AGI). This analysis is part of my ongoing Forbes column on AI advancements, delving into the complexities of achieving AGI and Artificial Superintelligence (ASI). (See related art

See all articles

Hot AI Tools

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress images for free

Clothoff.io

AI clothes remover

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

How to fix KB5055523 fails to install in Windows 11?

3 weeks agoByDDD

How to fix KB5055518 fails to install in Windows 10?

3 weeks agoByDDD

Roblox: Dead Rails - How To Tame Wolves

4 weeks agoByDDD

Strength Levels for Every Enemy & Monster in R.E.P.O.

4 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

Roblox: Grow A Garden - Complete Mutation Guide

2 weeks agoByDDD

Hot Tools

SublimeText3 English version

Recommended: Win version, supports code prompts!

mPDF

mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),