search
HomeTechnology peripheralsAITsinghua University releases CurML, the first open source library for course learning

In the development process of machine learning, human learning methods often inspire the design of various algorithms. As an important paradigm of human learning, learning through courses has been borrowed from machine learning to form a research direction called Curriculum Learning.

Generally speaking, human education is completed through highly organized courses. Each course or subject will start with simple content and gradually present students with more complex concepts. For example, before accepting the concepts of calculus in college, a student should first learn arithmetic in elementary school, functions in middle school, and derivatives in high school. However, unlike human education, the training of traditional machine learning models involves randomly inputting data samples into the model, ignoring the different complexities between data samples and the current learning status of the model. Therefore, curriculum learning was proposed in the field of machine learning precisely to imitate human learning from easy to difficult, provide better training strategies for the model, and thereby improve the performance of the model.

Tsinghua University releases CurML, the first open source library for course learning

Course Learning Concept Map

Currently Curriculum learning has been widely used in various tasks of machine learning, including image classification, target detection, semantic segmentation, machine translation, audio recognition, audio enhancement, video question and answer, etc. It is also used in supervised, unsupervised and semi-supervised learning and Reinforcement learning and other scenarios have received a lot of attention and research.

As the applications and scenarios of course learning become increasingly rich, it is particularly necessary to conduct a detailed sorting and summary in this field, so as to promote in-depth exploration by researchers and improve the application experience of users. .

Therefore, based on the accumulation and foundation of publishing a number of academic papers on course learning, the Media and Network Big Data Laboratory led by Professor Zhu Wenwu of Tsinghua University, laboratory member Wang Xin IEEE TPAMI published a review paper on curriculum learning, and the laboratory further released the world's first open source library for curriculum learning, CurML (Curriculum Machine Learning).

Professor Zhu Wenwu and Assistant Researcher Wang Xin’s curriculum learning research work includes curriculum meta-learning method applied to urban interest location recommendation, curriculum decoupling product recommendation based on noisy multiple feedback information , Shared parameter neural architecture search based on course learning, and combinatorial optimization problem solving based on course difficulty adaptation, etc. Research results have been published in high-level international machine learning conferences such as SIGKDD, NeurIPS, and ACM MM.

Tsinghua University releases CurML, the first open source library for course learning

##Framework diagram of some research results

The course learning review paper comprehensively reviews all aspects of course learning, such as the emergence, definition, theory and application, designs a unified course learning framework, and divides course learning algorithms into two major categories and multiple categories based on the core components within the framework. This subcategory distinguishes the differences and connections between curriculum learning and other machine learning concepts, and points out the challenges faced by this field and possible future research directions.

Tsinghua University releases CurML, the first open source library for course learning

Course Learning Method Classification

Course The learning open source library CurML is a support platform for course learning algorithms. It has integrated more than ten course learning algorithms and supports both noisy and non-noisy application scenarios, making it easier for researchers and users to reproduce, evaluate, compare and select course learning. algorithm.

The main module of CurML is CL Trainer, which consists of two sub-modules Model Trainer and CL Algorithm. The two interact through five interface functions to realize a machine for course learning guidance. learning process.

Tsinghua University releases CurML, the first open source library for course learning

CurML framework diagram

Main module: CL Trainer

This module is the main part of the entire open source library. By calling this module, users can implement the course learning algorithm with just a few lines of code. After given the data set, model and hyperparameters, the module will train for a certain period of time and output the trained model parameters and test results of the task. This module is mainly designed to meet the requirements of ease of use, so it is highly encapsulated and provided to users who want to use the course learning algorithm but do not care about the specific implementation details.

Sub-module 1: Model Trainer

This module is used to complete the general machine learning process, such as training An image classifier or a language model. At the same time, it reserves positions for five interface functions for interacting with the second sub-module CL Algorithm, and also supports custom input functions.

Sub-module 2: CL Algorithm

This module encapsulates all course learning algorithms supported by CurML. As shown in the following table:

Tsinghua University releases CurML, the first open source library for course learning

The module is implemented through five interface functions, which are used to obtain data and model information from the machine learning process. and the learning strategy to guide the model, as shown in the figure below.

Tsinghua University releases CurML, the first open source library for course learning

##CurML flow chart

Interface function: data_prepare

This function is used to provide data set information from the Model Trainer module to the CL Algorithm module. Many course learning algorithms require an overall understanding of the data set in order to better judge the difficulty of the data sample, so this interface function is necessary.

Interface function: model_prepare

This function is very similar to data_prepare, the difference is that it does not transfer data set information It is information related to model training, such as model architecture, parameter optimizer, learning rate adjuster, etc. Many course learning algorithms guide machine learning by adjusting these elements.

Interface function: data_curriculum

This function is used to calculate the difficulty of the data sample, and based on the difficulty of the data and the current Model state provides the model with appropriate data, and most courses have similar ideas.

Interface function: model_curriculum

This function is used to update the model and adjust the model’s accuracy obtained from the data sample. The amount of information indirectly guides the learning of the model. Currently, the number of such algorithms is still small, but CurML also supports the implementation of such algorithms.

Interface function: loss_curriculum

This function is used to reweight the loss function value, and the indirect adjustment is different The impact of data on the model. This type of algorithm is more common in course learning, because the weighting of the loss value is essentially a soft sampling of the data.

Through a summary of more than ten course learning methods in recent years, different types of course learning algorithms can be unified and implemented using the above modules and interface parameters, so that they can be used in fair scenarios and Evaluate, compare and select course learning algorithms under the task.

Future Outlook

CurML’s R&D team stated that they will continue to update this open source library in the future to provide further support for the development and application of course learning.

Related links:

  • CurML open source code library link: https://github.com/THUMNLab/CurML
  • CurML open source software paper link: https://dl.acm.org/doi/pdf/10.1145/3503161.3548549
  • Course learning summary paper link: https://ieeexplore.ieee.org/abstract/document/9392296/
  • Course meta-learning paper link: https://dl.acm.org/doi/abs/10.1145/ 3447548.3467132
  • Course decoupling learning paper link: https://proceedings.neurips.cc/paper/2021/file/e242660df1b69b74dcc7fde711f924ff-Paper.pdf
  • Course neural architecture search paper link: https://dl.acm.org/doi/abs/10.1145/3503161.3548271
  • ##Course difficulty adaptive paper link: https:// ojs.aaai.org/index.php/AAAI/article/download/20899/version/19196/20658

The above is the detailed content of Tsinghua University releases CurML, the first open source library for course learning. For more information, please follow other related articles on the PHP Chinese website!

Statement
This article is reproduced at:51CTO.COM. If there is any infringement, please contact admin@php.cn delete
You Must Build Workplace AI Behind A Veil Of IgnoranceYou Must Build Workplace AI Behind A Veil Of IgnoranceApr 29, 2025 am 11:15 AM

In John Rawls' seminal 1971 book The Theory of Justice, he proposed a thought experiment that we should take as the core of today's AI design and use decision-making: the veil of ignorance. This philosophy provides a simple tool for understanding equity and also provides a blueprint for leaders to use this understanding to design and implement AI equitably. Imagine that you are making rules for a new society. But there is a premise: you don’t know in advance what role you will play in this society. You may end up being rich or poor, healthy or disabled, belonging to a majority or marginal minority. Operating under this "veil of ignorance" prevents rule makers from making decisions that benefit themselves. On the contrary, people will be more motivated to formulate public

Decisions, Decisions… Next Steps For Practical Applied AIDecisions, Decisions… Next Steps For Practical Applied AIApr 29, 2025 am 11:14 AM

Numerous companies specialize in robotic process automation (RPA), offering bots to automate repetitive tasks—UiPath, Automation Anywhere, Blue Prism, and others. Meanwhile, process mining, orchestration, and intelligent document processing speciali

The Agents Are Coming – More On What We Will Do Next To AI PartnersThe Agents Are Coming – More On What We Will Do Next To AI PartnersApr 29, 2025 am 11:13 AM

The future of AI is moving beyond simple word prediction and conversational simulation; AI agents are emerging, capable of independent action and task completion. This shift is already evident in tools like Anthropic's Claude. AI Agents: Research a

Why Empathy Is More Important Than Control For Leaders In An AI-Driven FutureWhy Empathy Is More Important Than Control For Leaders In An AI-Driven FutureApr 29, 2025 am 11:12 AM

Rapid technological advancements necessitate a forward-looking perspective on the future of work. What happens when AI transcends mere productivity enhancement and begins shaping our societal structures? Topher McDougal's upcoming book, Gaia Wakes:

AI For Product Classification: Can Machines Master Tax Law?AI For Product Classification: Can Machines Master Tax Law?Apr 29, 2025 am 11:11 AM

Product classification, often involving complex codes like "HS 8471.30" from systems such as the Harmonized System (HS), is crucial for international trade and domestic sales. These codes ensure correct tax application, impacting every inv

Could Data Center Demand Spark A Climate Tech Rebound?Could Data Center Demand Spark A Climate Tech Rebound?Apr 29, 2025 am 11:10 AM

The future of energy consumption in data centers and climate technology investment This article explores the surge in energy consumption in AI-driven data centers and its impact on climate change, and analyzes innovative solutions and policy recommendations to address this challenge. Challenges of energy demand: Large and ultra-large-scale data centers consume huge power, comparable to the sum of hundreds of thousands of ordinary North American families, and emerging AI ultra-large-scale centers consume dozens of times more power than this. In the first eight months of 2024, Microsoft, Meta, Google and Amazon have invested approximately US$125 billion in the construction and operation of AI data centers (JP Morgan, 2024) (Table 1). Growing energy demand is both a challenge and an opportunity. According to Canary Media, the looming electricity

AI And Hollywood's Next Golden AgeAI And Hollywood's Next Golden AgeApr 29, 2025 am 11:09 AM

Generative AI is revolutionizing film and television production. Luma's Ray 2 model, as well as Runway's Gen-4, OpenAI's Sora, Google's Veo and other new models, are improving the quality of generated videos at an unprecedented speed. These models can easily create complex special effects and realistic scenes, even short video clips and camera-perceived motion effects have been achieved. While the manipulation and consistency of these tools still need to be improved, the speed of progress is amazing. Generative video is becoming an independent medium. Some models are good at animation production, while others are good at live-action images. It is worth noting that Adobe's Firefly and Moonvalley's Ma

Is ChatGPT Slowly Becoming AI's Biggest Yes-Man?Is ChatGPT Slowly Becoming AI's Biggest Yes-Man?Apr 29, 2025 am 11:08 AM

ChatGPT user experience declines: is it a model degradation or user expectations? Recently, a large number of ChatGPT paid users have complained about their performance degradation, which has attracted widespread attention. Users reported slower responses to models, shorter answers, lack of help, and even more hallucinations. Some users expressed dissatisfaction on social media, pointing out that ChatGPT has become “too flattering” and tends to verify user views rather than provide critical feedback. This not only affects the user experience, but also brings actual losses to corporate customers, such as reduced productivity and waste of computing resources. Evidence of performance degradation Many users have reported significant degradation in ChatGPT performance, especially in older models such as GPT-4 (which will soon be discontinued from service at the end of this month). this

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Tools

Atom editor mac version download

Atom editor mac version download

The most popular open source editor

mPDF

mPDF

mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),

Dreamweaver Mac version

Dreamweaver Mac version

Visual web development tools

SublimeText3 Linux new version

SublimeText3 Linux new version

SublimeText3 Linux latest version

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools