search
HomeTechnology peripheralsAIYear-end review: The ten hottest data science and machine learning startups in 2022

As businesses deal with ever-increasing amounts of data, both generated internally within the organization and collected from external sources, finding effective ways to analyze and “manipulate” this data for competitive advantage becomes increasingly important. The more challenging it is.

This has also driven the need for new tools and technologies in the field of data science and machine learning. According to the "Fortune Business Insights" report, the global machine learning market will reach US$15.44 billion in 2021 and will reach US$21.17 billion this year. It is expected to grow to US$209.91 billion by 2029, with a compound annual growth rate of 38.8%. .

At the same time, according to a report by Allied Market Research, the global market size of data science platforms in 2020 was US$4.7 billion and is expected to reach US$79.7 billion by 2030, with a compound annual growth rate of 33.6%.

“Data science” and “machine learning” can sometimes feel a little confusing or even used interchangeably. These are actually two different concepts, but they are related because data science practices are key to machine learning projects.

According to the definition of the Master's in Data Science website, data science is a field of study that uses scientific methods to extract meaning and insights from data, including developing data analysis strategies, preparing data for analysis, developing data visualizations, and building data models .

According to the "Fortune Business Insights" report, machine learning is a subsection of the broader field of artificial intelligence and refers to the use of data analysis to teach computers how to learn (that is, to imitate the way humans learn) Use algorithms and data-based models.

The demand for data science and machine learning tools has spawned a number of startups developing cutting-edge technologies in the field of data science or machine learning. Let’s take a look at 10 of them:

  • Aporia
  • Baseten
  • Deci
  • Galileo
  • Neuton
  • Pinecone
  • Predibase
  • Snorkel AI
  • Vectice
  • Verta

Aporia

Aporia develops a full-stack, highly customizable machine learning observable A comprehensive platform that data science and machine learning can use to monitor, debug, interpret, and improve machine learning models and data.

Aporia was founded in 2020 and received an additional $25 million in Series A funding in March 2022, 10 months after receiving $5 million in seed funding.

Aporia will use the funding to triple the size of its workforce by early 2023, while also expanding its U.S. presence and expanding the range of use cases its technology covers.

Baseten

Baseten was officially released in April this year. It provides products that can speed up the process from machine learning model development to production-level applications.

According to Baseten, the technology, which has been in internal beta since the summer of 2021, can automate many of the skills required to bring machine learning models into production, helping data science and machine learning teams integrate machine learning into In the business process, no back-end, front-end or MLOps knowledge is required.

Baseten was founded in 2019 by CEO Tuhin Srivastava, CTO Amir Haghighat and chief scientist Philip Howes, who all previously worked at e-commerce platform developer Gumroad. Baseten raised $12 million in Series A funding in April this year, following an earlier $8 million in seed funding.

Deci

Deci has developed a deep learning development platform for building the next generation of artificial intelligence and deep learning applications. Deci’s technology is designed to help address the “AI efficiency gap,” where computer hardware is unable to keep up with the demands of machine learning models that are growing in size and complexity.

Deci’s platform takes production into account early in the development lifecycle, helping data scientists bridge this gap and reduce the time and cost of resolving issues when deploying models in production. According to Deci, the platform incorporates Deci’s proprietary AutoNAC (Automatic Neural Architecture Construction) technology to provide a “more efficient development paradigm” to help AI developers leverage hardware-aware “neural architecture search” to build deep learning models. to meet specific production demand targets.

Deci was founded in 2019 and raised US$25 million in Series B financing led by Insight Partners in July this year. Just 7 months ago, Deci had just received US$2,100 in Series A financing. Ten thousand U.S. dollars.

Galileo

Galileo has developed a machine learning data intelligence platform for unstructured data, allowing data scientists to inspect, discover and fix critical machine learning errors throughout the machine learning lifecycle.

In early November this year, Galileo launched the Galileo Community Edition, a free version of the platform, allowing data scientists engaged in natural language processing to build models more quickly using higher-quality training data.

Galileo came out of stealth mode when it received US$5.1 million in seed funding in May this year, and then on November 1, it received US$18 million in Series A financing led by Battery Ventures. Galileo's co-founders include CEO Vikram Chatterji, who was the head of cloud AI project management at Google; Atindriyo Sanyal, a former software engineer at Apple and Uber; and Yash Sheth, a former software engineer on Google's speech recognition system engineer.

Neuton

Founded in 2021, Neuton develops an automated no-code “tinyML” platform and other tools for developing tiny machine learning models that can be embedded in microcontrollers to enable the edge Devices become smart.

Neuton's technology is being used in a wide range of applications, including predictive maintenance of compressor water pumps, preventing grid overload, room occupancy detection, handwriting recognition on handheld devices, transmission failure prediction and water pollution monitoring equipment.

Pinecone

The vector database and search technology developed by Pinecone mainly provide support for artificial intelligence and machine learning applications. In October 2021, Pinecone launched Pinecone 2.0, taking the software from research laboratories to production applications.

Pinecone was founded in 2019 and was officially launched last year. It received US$10 million in seed round financing in January 2021 and US$28 million in Series A financing in March this year.

In October this year, Pinecone expanded its machine learning search infrastructure portfolio with the launch of a new “vector search” solution that combines semantic and keyword search capabilities.

Gartner named Pinecone a "Cool Vendor" in the field of artificial intelligence and machine learning data in 2021.

Predibase

In May, Predibase came out of stealth mode with a low-code machine learning platform that allows data scientists and non-experts to quickly develop applications with “best-in-class” machine learning infrastructure. Machine learning model. The software is currently in beta use at many Fortune 500 companies.

Predibase offers its technology as an alternative to traditional AutoML to develop machine learning models to solve real-world problems. The platform uses declarative machine learning, which Predibase says allows users to specify machine learning models as "configurations," or simple files, telling the system what the user wants and letting the system figure out the best way to meet that need.

Predibase was co-founded by CEO Piero Molino, Chief Technology Officer Travis Addair, Chief Product Officer Devvret Rishi, and Stanford University Associate Professor Chris Re. Both Molino and Addair once worked at Uber. While at Uber, the two developed the Ludwig open source framework for deep learning models, and the Horovod open source framework for extending and distributing deep learning model training to massive data (Predibase is built on Ludwig and Horovod.)

In May of this year, Predibase received US$16.5 million in seed and series A financing led by Greylock.

Snorkel AI

Snorkel was founded in 2019 and originated from the Stanford University Artificial Intelligence Laboratory, where the company’s five founders worked on solving the problem of lack of labeled training data for machine learning development Methods.

Snorkel developed Snorkel Flow, a data-centric system that became fully commercially available in March this year, by using programmatic tagging, a key step in data preparation and machine learning model development and training. ) to accelerate the development of artificial intelligence and machine learning.

Snorkel's valuation reached $1 billion in August 2021, when the startup secured $85 million in Series C funding, using the funds to grow its engineering and sales teams and accelerate its platform development.

Vectice

Vectice develops an automated data science knowledge capture and sharing solution. Vectice's technology automatically captures the assets data science teams create for projects, including datasets, code, models, notebooks, runs, and illustrations, and generates documentation throughout the project lifecycle, from business requirements to production deployment.

Vectice’s software is said to be designed to help enterprises manage transparency, governance and alignment with AI and machine learning projects and deliver consistent project results.

Vectice, founded in 2020 by CEO Cyril Brignone and CTO Gregory Haardt, raised $12.6 million in Series A funding in January this year, bringing total funding to $15.6 million.

Verta

Verta develops artificial intelligence/machine learning model management and operation software that data science and machine learning teams use to deploy, deploy, and machine learning models throughout the entire artificial intelligence and machine learning model lifecycle. Operations, management, and monitoring inherently complex models.

In August, Verta enhanced the enterprise capabilities of its MLOps platform, including adding a native integration ecosystem, additional capabilities around enterprise security, privacy and access control, and model risk management.

Verta was founded in 2018 and officially released in 2020. This year, it was named a "Cool Vendor" in the core AI technology field by Gartner.

The above is the detailed content of Year-end review: The ten hottest data science and machine learning startups in 2022. For more information, please follow other related articles on the PHP Chinese website!

Statement
This article is reproduced at:51CTO.COM. If there is any infringement, please contact admin@php.cn delete
Sam's Club Bets On AI To Eliminate Receipt Checks And Enhance RetailSam's Club Bets On AI To Eliminate Receipt Checks And Enhance RetailApr 22, 2025 am 11:29 AM

Revolutionizing the Checkout Experience Sam's Club's innovative "Just Go" system builds on its existing AI-powered "Scan & Go" technology, allowing members to scan purchases via the Sam's Club app during their shopping trip.

Nvidia's AI Omniverse Expands At GTC 2025Nvidia's AI Omniverse Expands At GTC 2025Apr 22, 2025 am 11:28 AM

Nvidia's Enhanced Predictability and New Product Lineup at GTC 2025 Nvidia, a key player in AI infrastructure, is focusing on increased predictability for its clients. This involves consistent product delivery, meeting performance expectations, and

Exploring the Capabilities of Google's Gemma 2 ModelsExploring the Capabilities of Google's Gemma 2 ModelsApr 22, 2025 am 11:26 AM

Google's Gemma 2: A Powerful, Efficient Language Model Google's Gemma family of language models, celebrated for efficiency and performance, has expanded with the arrival of Gemma 2. This latest release comprises two models: a 27-billion parameter ver

The Next Wave of GenAI: Perspectives with Dr. Kirk Borne - Analytics VidhyaThe Next Wave of GenAI: Perspectives with Dr. Kirk Borne - Analytics VidhyaApr 22, 2025 am 11:21 AM

This Leading with Data episode features Dr. Kirk Borne, a leading data scientist, astrophysicist, and TEDx speaker. A renowned expert in big data, AI, and machine learning, Dr. Borne offers invaluable insights into the current state and future traje

AI For Runners And Athletes: We're Making Excellent ProgressAI For Runners And Athletes: We're Making Excellent ProgressApr 22, 2025 am 11:12 AM

There were some very insightful perspectives in this speech—background information about engineering that showed us why artificial intelligence is so good at supporting people’s physical exercise. I will outline a core idea from each contributor’s perspective to demonstrate three design aspects that are an important part of our exploration of the application of artificial intelligence in sports. Edge devices and raw personal data This idea about artificial intelligence actually contains two components—one related to where we place large language models and the other is related to the differences between our human language and the language that our vital signs “express” when measured in real time. Alexander Amini knows a lot about running and tennis, but he still

Jamie Engstrom On Technology, Talent And Transformation At CaterpillarJamie Engstrom On Technology, Talent And Transformation At CaterpillarApr 22, 2025 am 11:10 AM

Caterpillar's Chief Information Officer and Senior Vice President of IT, Jamie Engstrom, leads a global team of over 2,200 IT professionals across 28 countries. With 26 years at Caterpillar, including four and a half years in her current role, Engst

New Google Photos Update Makes Any Photo Pop With Ultra HDR QualityNew Google Photos Update Makes Any Photo Pop With Ultra HDR QualityApr 22, 2025 am 11:09 AM

Google Photos' New Ultra HDR Tool: A Quick Guide Enhance your photos with Google Photos' new Ultra HDR tool, transforming standard images into vibrant, high-dynamic-range masterpieces. Ideal for social media, this tool boosts the impact of any photo,

What are the TCL Commands in SQL? - Analytics VidhyaWhat are the TCL Commands in SQL? - Analytics VidhyaApr 22, 2025 am 11:07 AM

Introduction Transaction Control Language (TCL) commands are essential in SQL for managing changes made by Data Manipulation Language (DML) statements. These commands allow database administrators and users to control transaction processes, thereby

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Tools

SublimeText3 English version

SublimeText3 English version

Recommended: Win version, supports code prompts!

mPDF

mPDF

mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

MinGW - Minimalist GNU for Windows

MinGW - Minimalist GNU for Windows

This project is in the process of being migrated to osdn.net/projects/mingw, you can continue to follow us there. MinGW: A native Windows port of the GNU Compiler Collection (GCC), freely distributable import libraries and header files for building native Windows applications; includes extensions to the MSVC runtime to support C99 functionality. All MinGW software can run on 64-bit Windows platforms.

Atom editor mac version download

Atom editor mac version download

The most popular open source editor