You Yang's team obtained new results in the AAAI 2023 Outstanding Paper Award, using a single V100 to train the model 72 times faster-AI-php.cn

You Yang's team obtained new results in the AAAI 2023 Outstanding Paper Award, using a single V100 to train the model 72 times faster

王林

May 10, 2023 am 09:04 AM

algorithmModel

This article is reprinted with the authorization of AI New Media Qubit (public account ID: QbitAI). Please contact the source for reprinting.

Just now, Young Professor You Yang, Ph.D. from UC Berkeley and President of the National University of Singapore, released the latest news——

won the AAAI 2023Outstanding Paper Award（Distinguished Paper）!

单块V100训练模型提速72倍！尤洋团队新成果获AAAI 2023杰出论文奖

The research results increase the training speed of the model by 72 times at one time.

Even netizens sighed after reading the paper:

From 12 hours to 10 minutes, tender cow(you cow)ah!

单块V100训练模型提速72倍！尤洋团队新成果获AAAI 2023杰出论文奖

Dr. You Yang once set the world record for ImageNet and BERT training speed during his studies.

The algorithms he designed are also widely used in technology giants such as Google, Microsoft, Intel, and NVIDIA.

Now, he has returned to China to start his own businessLuchen Technology After a year and a half, what kind of algorithm did he and his team come up with to win such an honor at the top AI conference?

Training time from 12 hours to 10 minutes

In this study, You Yang’s team proposed an optimization strategyCowClip, which can accelerate the development of the CTR prediction model Batch training.

CTR（click-through rate） The prediction model is a commonly used algorithm in personalized recommendation scenarios.

It usually needs to learn user feedback (clicks, collections, purchases, etc.), and the amount of data generated online every day is unprecedentedly huge.

单块V100训练模型提速72倍！尤洋团队新成果获AAAI 2023杰出论文奖

Therefore, it is crucial to speed up the training of the CTR prediction model.

Generally speaking, batch training is used to increase the training speed, but if the batch size is too large, the accuracy of the model will be reduced.

Through mathematical analysis, the team proved that the learning rate for infrequent features (learning rate for infrequent features) should not be scaled when expanding the batch.

单块V100训练模型提速72倍！尤洋团队新成果获AAAI 2023杰出论文奖

With their proposed CowClip, the batch size can be easily and effectively expanded.

单块V100训练模型提速72倍！尤洋团队新成果获AAAI 2023杰出论文奖

The team successfully expanded the original batch size by testing on 4 CTR prediction models and 2 data sets128 Times, without causing any loss of accuracy.

Especially on DeepFM, CowClip achieves more than 0.1% improvement in AUC by expanding the batch size from 1K to 128K.

And on a single V100 GPU, the training time is shortened from the original 12 hours to just 10 minutes, and the training speed is 72 times.

单块V100训练模型提速72倍！尤洋团队新成果获AAAI 2023杰出论文奖

Currently, the project code is open source. The team says the algorithm is also suitable for tasks such as NLP.

Team Introduction

The first author of this article is You Yang’s doctoral student Zheng Zangwei. He graduated from the Computer Elite Class of Nanjing University with a bachelor’s degree and a Ph.D. from the National University of Singapore.

His research directions include machine learning, computer vision and high-performance computing.

The above is the detailed content of You Yang's team obtained new results in the AAAI 2023 Outstanding Paper Award, using a single V100 to train the model 72 times faster. For more information, please follow other related articles on the PHP Chinese website!

Statement

This article is reproduced at:51CTO.COM. If there is any infringement, please contact admin@php.cn delete

Most Used 10 Power BI Charts - Analytics VidhyaApr 16, 2025 pm 12:05 PM

Harnessing the Power of Data Visualization with Microsoft Power BI Charts In today's data-driven world, effectively communicating complex information to non-technical audiences is crucial. Data visualization bridges this gap, transforming raw data i

Expert Systems in AIApr 16, 2025 pm 12:00 PM

Expert Systems: A Deep Dive into AI's Decision-Making Power Imagine having access to expert advice on anything, from medical diagnoses to financial planning. That's the power of expert systems in artificial intelligence. These systems mimic the pro

Three Of The Best Vibe Coders Break Down This AI Revolution In CodeApr 16, 2025 am 11:58 AM

First of all, it’s apparent that this is happening quickly. Various companies are talking about the proportions of their code that are currently written by AI, and these are increasing at a rapid clip. There’s a lot of job displacement already around

Runway AI's Gen-4: How Can AI Montage Go Beyond AbsurdityApr 16, 2025 am 11:45 AM

The film industry, alongside all creative sectors, from digital marketing to social media, stands at a technological crossroad. As artificial intelligence begins to reshape every aspect of visual storytelling and change the landscape of entertainment

How to Enroll for 5 Days ISRO AI Free Courses? - Analytics VidhyaApr 16, 2025 am 11:43 AM

ISRO's Free AI/ML Online Course: A Gateway to Geospatial Technology Innovation The Indian Space Research Organisation (ISRO), through its Indian Institute of Remote Sensing (IIRS), is offering a fantastic opportunity for students and professionals to

Local Search Algorithms in AIApr 16, 2025 am 11:40 AM

Local Search Algorithms: A Comprehensive Guide Planning a large-scale event requires efficient workload distribution. When traditional approaches fail, local search algorithms offer a powerful solution. This article explores hill climbing and simul

OpenAI Shifts Focus With GPT-4.1, Prioritizes Coding And Cost EfficiencyApr 16, 2025 am 11:37 AM

The release includes three distinct models, GPT-4.1, GPT-4.1 mini and GPT-4.1 nano, signaling a move toward task-specific optimizations within the large language model landscape. These models are not immediately replacing user-facing interfaces like

The Prompt: ChatGPT Generates Fake PassportsApr 16, 2025 am 11:35 AM

Chip giant Nvidia said on Monday it will start manufacturing AI supercomputers— machines that can process copious amounts of data and run complex algorithms— entirely within the U.S. for the first time. The announcement comes after President Trump si

See all articles

Hot AI Tools

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress images for free

Clothoff.io

AI clothes remover

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)

4 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

R.E.P.O. Best Graphic Settings

4 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

Assassin's Creed Shadows: Seashell Riddle Solution

2 weeks agoByDDD

R.E.P.O. How to Fix Audio if You Can't Hear Anyone

4 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

R.E.P.O. Chat Commands and How to Use Them

4 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

Hot Tools

mPDF

mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),