search
HomeTechnology peripheralsAIChris Lattner, the father of LLVM: Why we need to rebuild AI infrastructure software

Chris Lattner, the father of LLVM: Why we need to rebuild AI infrastructure software

The vision of AI that people once imagined was very beautiful, but the current situation is not satisfactory. AI has not realized its original predictions in daily applications such as autonomous driving and new drug research and development. A common complaint is that the global technology giants have brought together a large number of the smartest brains, but they still focus more on precise advertising and credit scoring. And on "smart" speakers that are not very smart.

Theoretically, as long as there are correct algorithms and sufficient computing resources, AI can solve all problems represented by any available data. Now that data, algorithms and hardware resources are abundant enough, AI can be used to benefit society. All conditions are met. We have seen the broad application and initial effects of AI, but in fact, the technology is not applied in depth, and it is far from realizing the full potential of existing machine learning research.

Why does this happen? The facts are more profound than the latest research updates on AI by the world’s technology giants and media. ​Compiler expert Chris Lattner once pointed out that the singleness and fragmentation of AI systems and tools are the root cause of this problem. In order to solve this problem, in January 2022, compiler expert Chris Lattner announced that he would start a business overseas and co-founded Modular AI with Tim Davis. The goal is to rebuild the global ML infrastructure, including compilers and runtimes. , pay equal attention to heterogeneous computing, edge to data center, and focus on availability to improve developer efficiency. Currently, the Modular AI team has participated in building most of the world's production machine learning infrastructure from TensorFlow, TF Lite, XLA, TPU, Android ML, Apple ML, MLIR, etc., and has deployed production workloads to billions of users and equipment.

Recently, Modular AI announced the completion of a US$30 million seed round of financing, led by Google Venture. In the latest official blog post published by Chris Lattner and others, they issued "Three Questions for the Soul": AI is so important, why is the software so miserable? Why haven’t tech giants solved the AI ​​puzzle? How to solve this problem? Of course, they also gave answers. The OneFlow community compiled and organized the original text.

1 AI is so important, why is the software so miserable?

AI software was originally designed for full-stack researchers, engineers, and architects building AI technology. It was never defined as a product. Therefore,

AI software has defect.

This kind of software is built by big tech companies to solve their own problems, and other enterprises use it on a "trickle down" infrastructure. The result is that only the largest and most commercially impactful AI applications are built and deployed in practice, and even then only if the needs of the enterprise are highly aligned with the internal needs of large technology companies. accomplish.

Why is this? Because the current AI software is very simple and has heavy research attributes, it is mainly used to meet the development plans of technology giants (the developers of these software). These softwares were created by researchers to do research, and the rapid development of AI leaves researchers no time to stop and rebuild.

Instead, over time we have added more and more complexity, making it difficult for the industry to maintain and scale fragmented custom tool chains that are used in research and production, training and There are differences between deployments, servers and edges.

Artificial intelligence systems have now become a vast ocean of incompatible technologies, and only those comprehensive technology giants have the ability to use AI to achieve their goals.

2 Why haven’t technology giants solved AI problems?

AI research and developers work together to make deploying AI a success, and tech giants use their vast computing and financial resources to advance their products and core business priorities, including their own clouds, phones, Social networking and artificial intelligence hardware.

Although they have made outstanding contributions to the field, from a business perspective, it is impossible for them to promote AI to the whole world (covering all hardware, cloud and ML frameworks), and the rest of the world cannot Expect them to do so. However, this unfortunate fact limits the rest of the world’s ability to use this technology to solve problems outside of the focus areas of big tech companies, including some of the biggest socioeconomic and environmental issues facing the world. But this is not the future we want.

Although giants have made huge contributions to the development of artificial intelligence, for artificial intelligence to fully realize its potential, an independent company is needed. This company does not have to prioritize its own hardware, cloud infrastructure, Mobile phone development or own research; at the same time, we need a neutral company to do what is best for the interests of global users and businesses. We need to incorporate what we learn from the rapid growth of AI software into next-generation technologies to provide usable solutions and common standards for the types of problems faced by all organizations.

Today, the most pressing issue facing small and medium-sized technology companies is how to break through the limitations of capacity, cost, time and talent to put AI into production.

Due to opportunity cost considerations, it is difficult for their innovative technologies to be promoted to the market, and the product experience is poor, which will ultimately have a negative impact on their development. For society as a whole, this means that we still have to wait quite a while before AI can solve some of the world's biggest problems.

We don’t have time to wait for the tech giants to roll out trickle-down AI software. AI can change the world, but only if fragmentation is resolved and the global AI developer community doesn’t have to struggle with high-quality infrastructure.

3 Who will solve this problem? How to solve?

Modular is building a next-generation AI developer platform that will be more practical, faster and more flexible.

Our platform unifies the front-ends of popular AI frameworks through common interfaces, and enhances access and portability to various hardware back-ends and cloud environments. We're rebuilding our core developer workflow tools to be more expressive, usable, debuggable, reliable, scalable, and performant. Our tools can be easily deployed into existing workflows, allowing users to seamlessly complete their work without refactoring or rewriting code, and achieve improvements in productivity and performance at a lower cost. We will accelerate the exploration of the value of AI and bring it to the market as soon as possible to benefit the majority of users.

When AI can penetrate into various applications in a more subtle way, its potential will be fully demonstrated - by then, you will no longer have to define your application around AI. Our platform is built from modular, composable infrastructure components that can be rearranged and extended to implement a variety of use cases. At the same time, experts in various fields can innovate through our platform even without understanding how the entire system works. We’ve seen firsthand how a modular approach can unlock new use cases that we hadn’t thought of in the past.

In order to truly repair AI infrastructure, we must not only solve "hard technical" problems (such as compilers for heterogeneous computing technologies), but also establish a seamless end-to-end developer workflow.

4 From the "AI Research Era" to the "AI Production Era"

Our success means that developers around the world will have access to truly usable, portable and scalable AI software.

In the new world, developers who lack sufficient budgets or top talent can also work as efficiently as global technology giants; the efficiency and total cost of ownership (TCO) of AI hardware will be improved Optimization; enterprises can easily plug in custom ASICs to suit their use cases; deploy to the edge as easily as deploying to servers; enterprises can use whichever AI framework best suits their needs; AI programs can seamlessly scale on hardware, Deploying the latest AI research into production couldn't be easier.

We will see: the development of the AI ​​industry is no longer limited by the timetable determined by the technology giants according to their own needs; the development of the AI ​​industry will be faster and more concentrated; innovation will be at all levels of the stack Booming, with developers focused on bringing new innovations to market in their areas of expertise and building a better future for all of us; the industry is developing at a rapid pace, leading us from the "AI Research Era" to the "AI Production Era" ".

The above is the detailed content of Chris Lattner, the father of LLVM: Why we need to rebuild AI infrastructure software. For more information, please follow other related articles on the PHP Chinese website!

Statement
This article is reproduced at:51CTO.COM. If there is any infringement, please contact admin@php.cn delete
Are You At Risk Of AI Agency Decay? Take The Test To Find OutAre You At Risk Of AI Agency Decay? Take The Test To Find OutApr 21, 2025 am 11:31 AM

This article explores the growing concern of "AI agency decay"—the gradual decline in our ability to think and decide independently. This is especially crucial for business leaders navigating the increasingly automated world while retainin

How to Build an AI Agent from Scratch? - Analytics VidhyaHow to Build an AI Agent from Scratch? - Analytics VidhyaApr 21, 2025 am 11:30 AM

Ever wondered how AI agents like Siri and Alexa work? These intelligent systems are becoming more important in our daily lives. This article introduces the ReAct pattern, a method that enhances AI agents by combining reasoning an

Revisiting The Humanities In The Age Of AIRevisiting The Humanities In The Age Of AIApr 21, 2025 am 11:28 AM

"I think AI tools are changing the learning opportunities for college students. We believe in developing students in core courses, but more and more people also want to get a perspective of computational and statistical thinking," said University of Chicago President Paul Alivisatos in an interview with Deloitte Nitin Mittal at the Davos Forum in January. He believes that people will have to become creators and co-creators of AI, which means that learning and other aspects need to adapt to some major changes. Digital intelligence and critical thinking Professor Alexa Joubin of George Washington University described artificial intelligence as a “heuristic tool” in the humanities and explores how it changes

Understanding LangChain Agent FrameworkUnderstanding LangChain Agent FrameworkApr 21, 2025 am 11:25 AM

LangChain is a powerful toolkit for building sophisticated AI applications. Its agent architecture is particularly noteworthy, allowing developers to create intelligent systems capable of independent reasoning, decision-making, and action. This expl

What are the Radial Basis Functions Neural Networks?What are the Radial Basis Functions Neural Networks?Apr 21, 2025 am 11:13 AM

Radial Basis Function Neural Networks (RBFNNs): A Comprehensive Guide Radial Basis Function Neural Networks (RBFNNs) are a powerful type of neural network architecture that leverages radial basis functions for activation. Their unique structure make

The Meshing Of Minds And Machines Has ArrivedThe Meshing Of Minds And Machines Has ArrivedApr 21, 2025 am 11:11 AM

Brain-computer interfaces (BCIs) directly link the brain to external devices, translating brain impulses into actions without physical movement. This technology utilizes implanted sensors to capture brain signals, converting them into digital comman

Insights on spaCy, Prodigy and Generative AI from Ines MontaniInsights on spaCy, Prodigy and Generative AI from Ines MontaniApr 21, 2025 am 11:01 AM

This "Leading with Data" episode features Ines Montani, co-founder and CEO of Explosion AI, and co-developer of spaCy and Prodigy. Ines offers expert insights into the evolution of these tools, Explosion's unique business model, and the tr

A Guide to Building Agentic RAG Systems with LangGraphA Guide to Building Agentic RAG Systems with LangGraphApr 21, 2025 am 11:00 AM

This article explores Retrieval Augmented Generation (RAG) systems and how AI agents can enhance their capabilities. Traditional RAG systems, while useful for leveraging custom enterprise data, suffer from limitations such as a lack of real-time dat

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Tools

SecLists

SecLists

SecLists is the ultimate security tester's companion. It is a collection of various types of lists that are frequently used during security assessments, all in one place. SecLists helps make security testing more efficient and productive by conveniently providing all the lists a security tester might need. List types include usernames, passwords, URLs, fuzzing payloads, sensitive data patterns, web shells, and more. The tester can simply pull this repository onto a new test machine and he will have access to every type of list he needs.

DVWA

DVWA

Damn Vulnerable Web App (DVWA) is a PHP/MySQL web application that is very vulnerable. Its main goals are to be an aid for security professionals to test their skills and tools in a legal environment, to help web developers better understand the process of securing web applications, and to help teachers/students teach/learn in a classroom environment Web application security. The goal of DVWA is to practice some of the most common web vulnerabilities through a simple and straightforward interface, with varying degrees of difficulty. Please note that this software

SAP NetWeaver Server Adapter for Eclipse

SAP NetWeaver Server Adapter for Eclipse

Integrate Eclipse with SAP NetWeaver application server.

MinGW - Minimalist GNU for Windows

MinGW - Minimalist GNU for Windows

This project is in the process of being migrated to osdn.net/projects/mingw, you can continue to follow us there. MinGW: A native Windows port of the GNU Compiler Collection (GCC), freely distributable import libraries and header files for building native Windows applications; includes extensions to the MSVC runtime to support C99 functionality. All MinGW software can run on 64-bit Windows platforms.

Safe Exam Browser

Safe Exam Browser

Safe Exam Browser is a secure browser environment for taking online exams securely. This software turns any computer into a secure workstation. It controls access to any utility and prevents students from using unauthorized resources.