


You Yang's team obtained new results in the AAAI 2023 Outstanding Paper Award, using a single V100 to train the model 72 times faster
This article is reprinted with the authorization of AI New Media Qubit (public account ID: QbitAI). Please contact the source for reprinting.
Just now, Young Professor You Yang, Ph.D. from UC Berkeley and President of the National University of Singapore, released the latest news——
won the AAAI 2023Outstanding Paper Award(Distinguished Paper)!
The research results increase the training speed of the model by 72 times at one time.
Even netizens sighed after reading the paper:
From 12 hours to 10 minutes, tender cow(you cow)ah!
Dr. You Yang once set the world record for ImageNet and BERT training speed during his studies.
The algorithms he designed are also widely used in technology giants such as Google, Microsoft, Intel, and NVIDIA.
Now, he has returned to China to start his own businessLuchen Technology After a year and a half, what kind of algorithm did he and his team come up with to win such an honor at the top AI conference?
Training time from 12 hours to 10 minutes
In this study, You Yang’s team proposed an optimization strategyCowClip, which can accelerate the development of the CTR prediction model Batch training.
CTR(click-through rate) The prediction model is a commonly used algorithm in personalized recommendation scenarios.
It usually needs to learn user feedback (clicks, collections, purchases, etc.), and the amount of data generated online every day is unprecedentedly huge.
Therefore, it is crucial to speed up the training of the CTR prediction model.
Generally speaking, batch training is used to increase the training speed, but if the batch size is too large, the accuracy of the model will be reduced.
Through mathematical analysis, the team proved that the learning rate for infrequent features (learning rate for infrequent features) should not be scaled when expanding the batch.
With their proposed CowClip, the batch size can be easily and effectively expanded.
The team successfully expanded the original batch size by testing on 4 CTR prediction models and 2 data sets128 Times, without causing any loss of accuracy.
Especially on DeepFM, CowClip achieves more than 0.1% improvement in AUC by expanding the batch size from 1K to 128K.
And on a single V100 GPU, the training time is shortened from the original 12 hours to just 10 minutes, and the training speed is 72 times.
Currently, the project code is open source. The team says the algorithm is also suitable for tasks such as NLP.
Team Introduction
The first author of this article is You Yang’s doctoral student Zheng Zangwei. He graduated from the Computer Elite Class of Nanjing University with a bachelor’s degree and a Ph.D. from the National University of Singapore.
His research directions include machine learning, computer vision and high-performance computing.
The above is the detailed content of You Yang's team obtained new results in the AAAI 2023 Outstanding Paper Award, using a single V100 to train the model 72 times faster. For more information, please follow other related articles on the PHP Chinese website!

Harnessing the Power of Data Visualization with Microsoft Power BI Charts In today's data-driven world, effectively communicating complex information to non-technical audiences is crucial. Data visualization bridges this gap, transforming raw data i

Expert Systems: A Deep Dive into AI's Decision-Making Power Imagine having access to expert advice on anything, from medical diagnoses to financial planning. That's the power of expert systems in artificial intelligence. These systems mimic the pro

First of all, it’s apparent that this is happening quickly. Various companies are talking about the proportions of their code that are currently written by AI, and these are increasing at a rapid clip. There’s a lot of job displacement already around

The film industry, alongside all creative sectors, from digital marketing to social media, stands at a technological crossroad. As artificial intelligence begins to reshape every aspect of visual storytelling and change the landscape of entertainment

ISRO's Free AI/ML Online Course: A Gateway to Geospatial Technology Innovation The Indian Space Research Organisation (ISRO), through its Indian Institute of Remote Sensing (IIRS), is offering a fantastic opportunity for students and professionals to

Local Search Algorithms: A Comprehensive Guide Planning a large-scale event requires efficient workload distribution. When traditional approaches fail, local search algorithms offer a powerful solution. This article explores hill climbing and simul

The release includes three distinct models, GPT-4.1, GPT-4.1 mini and GPT-4.1 nano, signaling a move toward task-specific optimizations within the large language model landscape. These models are not immediately replacing user-facing interfaces like

Chip giant Nvidia said on Monday it will start manufacturing AI supercomputers— machines that can process copious amounts of data and run complex algorithms— entirely within the U.S. for the first time. The announcement comes after President Trump si


Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

mPDF
mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),

Dreamweaver Mac version
Visual web development tools

Safe Exam Browser
Safe Exam Browser is a secure browser environment for taking online exams securely. This software turns any computer into a secure workstation. It controls access to any utility and prevents students from using unauthorized resources.

SublimeText3 Chinese version
Chinese version, very easy to use

PhpStorm Mac version
The latest (2018.2.1) professional PHP integrated development tool