search
HomeTechnology peripheralsAITsinghua University releases CurML, the first open source library for course learning

In the development process of machine learning, human learning methods often inspire the design of various algorithms. As an important paradigm of human learning, learning through courses has been borrowed from machine learning to form a research direction called Curriculum Learning.

Generally speaking, human education is completed through highly organized courses. Each course or subject will start with simple content and gradually present students with more complex concepts. For example, before accepting the concepts of calculus in college, a student should first learn arithmetic in elementary school, functions in middle school, and derivatives in high school. However, unlike human education, the training of traditional machine learning models involves randomly inputting data samples into the model, ignoring the different complexities between data samples and the current learning status of the model. Therefore, curriculum learning was proposed in the field of machine learning precisely to imitate human learning from easy to difficult, provide better training strategies for the model, and thereby improve the performance of the model.

Tsinghua University releases CurML, the first open source library for course learning

Course Learning Concept Map

Currently Curriculum learning has been widely used in various tasks of machine learning, including image classification, target detection, semantic segmentation, machine translation, audio recognition, audio enhancement, video question and answer, etc. It is also used in supervised, unsupervised and semi-supervised learning and Reinforcement learning and other scenarios have received a lot of attention and research.

As the applications and scenarios of course learning become increasingly rich, it is particularly necessary to conduct a detailed sorting and summary in this field, so as to promote in-depth exploration by researchers and improve the application experience of users. .

Therefore, based on the accumulation and foundation of publishing a number of academic papers on course learning, the Media and Network Big Data Laboratory led by Professor Zhu Wenwu of Tsinghua University, laboratory member Wang Xin IEEE TPAMI published a review paper on curriculum learning, and the laboratory further released the world's first open source library for curriculum learning, CurML (Curriculum Machine Learning).

Professor Zhu Wenwu and Assistant Researcher Wang Xin’s curriculum learning research work includes curriculum meta-learning method applied to urban interest location recommendation, curriculum decoupling product recommendation based on noisy multiple feedback information , Shared parameter neural architecture search based on course learning, and combinatorial optimization problem solving based on course difficulty adaptation, etc. Research results have been published in high-level international machine learning conferences such as SIGKDD, NeurIPS, and ACM MM.

Tsinghua University releases CurML, the first open source library for course learning

##Framework diagram of some research results

The course learning review paper comprehensively reviews all aspects of course learning, such as the emergence, definition, theory and application, designs a unified course learning framework, and divides course learning algorithms into two major categories and multiple categories based on the core components within the framework. This subcategory distinguishes the differences and connections between curriculum learning and other machine learning concepts, and points out the challenges faced by this field and possible future research directions.

Tsinghua University releases CurML, the first open source library for course learning

Course Learning Method Classification

Course The learning open source library CurML is a support platform for course learning algorithms. It has integrated more than ten course learning algorithms and supports both noisy and non-noisy application scenarios, making it easier for researchers and users to reproduce, evaluate, compare and select course learning. algorithm.

The main module of CurML is CL Trainer, which consists of two sub-modules Model Trainer and CL Algorithm. The two interact through five interface functions to realize a machine for course learning guidance. learning process.

Tsinghua University releases CurML, the first open source library for course learning

CurML framework diagram

Main module: CL Trainer

This module is the main part of the entire open source library. By calling this module, users can implement the course learning algorithm with just a few lines of code. After given the data set, model and hyperparameters, the module will train for a certain period of time and output the trained model parameters and test results of the task. This module is mainly designed to meet the requirements of ease of use, so it is highly encapsulated and provided to users who want to use the course learning algorithm but do not care about the specific implementation details.

Sub-module 1: Model Trainer

This module is used to complete the general machine learning process, such as training An image classifier or a language model. At the same time, it reserves positions for five interface functions for interacting with the second sub-module CL Algorithm, and also supports custom input functions.

Sub-module 2: CL Algorithm

This module encapsulates all course learning algorithms supported by CurML. As shown in the following table:

Tsinghua University releases CurML, the first open source library for course learning

The module is implemented through five interface functions, which are used to obtain data and model information from the machine learning process. and the learning strategy to guide the model, as shown in the figure below.

Tsinghua University releases CurML, the first open source library for course learning

##CurML flow chart

Interface function: data_prepare

This function is used to provide data set information from the Model Trainer module to the CL Algorithm module. Many course learning algorithms require an overall understanding of the data set in order to better judge the difficulty of the data sample, so this interface function is necessary.

Interface function: model_prepare

This function is very similar to data_prepare, the difference is that it does not transfer data set information It is information related to model training, such as model architecture, parameter optimizer, learning rate adjuster, etc. Many course learning algorithms guide machine learning by adjusting these elements.

Interface function: data_curriculum

This function is used to calculate the difficulty of the data sample, and based on the difficulty of the data and the current Model state provides the model with appropriate data, and most courses have similar ideas.

Interface function: model_curriculum

This function is used to update the model and adjust the model’s accuracy obtained from the data sample. The amount of information indirectly guides the learning of the model. Currently, the number of such algorithms is still small, but CurML also supports the implementation of such algorithms.

Interface function: loss_curriculum

This function is used to reweight the loss function value, and the indirect adjustment is different The impact of data on the model. This type of algorithm is more common in course learning, because the weighting of the loss value is essentially a soft sampling of the data.

Through a summary of more than ten course learning methods in recent years, different types of course learning algorithms can be unified and implemented using the above modules and interface parameters, so that they can be used in fair scenarios and Evaluate, compare and select course learning algorithms under the task.

Future Outlook

CurML’s R&D team stated that they will continue to update this open source library in the future to provide further support for the development and application of course learning.

Related links:

  • CurML open source code library link: https://github.com/THUMNLab/CurML
  • CurML open source software paper link: https://dl.acm.org/doi/pdf/10.1145/3503161.3548549
  • Course learning summary paper link: https://ieeexplore.ieee.org/abstract/document/9392296/
  • Course meta-learning paper link: https://dl.acm.org/doi/abs/10.1145/ 3447548.3467132
  • Course decoupling learning paper link: https://proceedings.neurips.cc/paper/2021/file/e242660df1b69b74dcc7fde711f924ff-Paper.pdf
  • Course neural architecture search paper link: https://dl.acm.org/doi/abs/10.1145/3503161.3548271
  • ##Course difficulty adaptive paper link: https:// ojs.aaai.org/index.php/AAAI/article/download/20899/version/19196/20658

The above is the detailed content of Tsinghua University releases CurML, the first open source library for course learning. For more information, please follow other related articles on the PHP Chinese website!

Statement
This article is reproduced at:51CTO.COM. If there is any infringement, please contact admin@php.cn delete
YOLOv6又快又准的目标检测框架已经开源了YOLOv6又快又准的目标检测框架已经开源了May 09, 2023 pm 02:52 PM

作者:楚怡、凯衡等近日,美团视觉智能部研发了一款致力于工业应用的目标检测框架YOLOv6,能够同时专注于检测的精度和推理效率。在研发过程中,视觉智能部不断进行了探索和优化,同时吸取借鉴了学术界和工业界的一些前沿进展和科研成果。在目标检测权威数据集COCO上的实验结果显示,YOLOv6在检测精度和速度方面均超越其他同体量的算法,同时支持多种不同平台的部署,极大简化工程部署时的适配工作。特此开源,希望能帮助到更多的同学。1.概述YOLOv6是美团视觉智能部研发的一款目标检测框架,致力于工业应用。

MLC LLM:开源AI聊天机器人,支持离线运行,适用于集成显卡电脑和iPhone。MLC LLM:开源AI聊天机器人,支持离线运行,适用于集成显卡电脑和iPhone。May 06, 2023 pm 03:46 PM

5月2日消息,目前大多数AI聊天机器人都需要连接到云端进行处理,即使可以本地运行的也配置要求极高。那么是否有轻量化的、无需联网的聊天机器人呢?一个名为MLCLLM的全新开源项目已在GitHub上线,完全本地运行无需联网,甚至集显老电脑、苹果iPhone手机都能运行。MLCLLM项目介绍称:“MLCLLM是一种通用解决方案,它允许将任何语言模型本地部署在一组不同的硬件后端和本地应用程序上,此外还有一个高效的框架,供每个人进一步优化自己用例的模型性能。一切都在本地运行,无需服务器支持,并通过手机和笔

基于开源的 ChatGPT Web UI 项目,快速构建属于自己的 ChatGPT 站点基于开源的 ChatGPT Web UI 项目,快速构建属于自己的 ChatGPT 站点Apr 15, 2023 pm 07:43 PM

作为一个技术博主,了不起比较喜欢各种折腾,之前给大家介绍过ChatGPT​接入微信,钉钉和知识星球(如果没看过的可以翻翻前面的文章),最近再看开源项目的时候,发现了一个ChatGPTWebUI项目。想着刚好之前没有将ChatGPT​接入过WebUI,有了这个开源项目可以拿来使用,真是不错,下面是实操的安装步骤,分享给大家。安装官方在Github​的项目文档上提供了很多中的安装方式,包括手动安装,docker​部署,以及远程部署等方法,了不起在选择部署方式的时候,一开始为了简单想着

仅需1% Embedding参数,硬件成本降低十倍,开源方案单GPU训练超大推荐模型仅需1% Embedding参数,硬件成本降低十倍,开源方案单GPU训练超大推荐模型Apr 12, 2023 pm 03:46 PM

深度推荐模型(DLRMs)已经成为深度学习在互联网公司应用的最重要技术场景,如视频推荐、购物搜索、广告推送等流量变现业务,极大改善了用户体验和业务商业价值。但海量的用户和业务数据,频繁地迭代更新需求,以及高昂的训练成本,都对 DLRM 训练提出了严峻挑战。在 DLRM 中,需要先在嵌入表(EmbeddingBags)中进行查表(lookup),再完成下游计算。嵌入表常常贡献 DLRM 中 99% 以上的内存需求,却只贡献 1% 的计算量。借助于 GPU 片上高速内存(High Bandwidth

用图像对齐所有模态,Meta开源多感官AI基础模型,实现大一统用图像对齐所有模态,Meta开源多感官AI基础模型,实现大一统May 11, 2023 pm 07:25 PM

在人类的感官中,一张图片可以将很多体验融合到一起,比如一张海滩图片可以让我们想起海浪的声音、沙子的质地、拂面而来的微风,甚至可以激发创作一首诗的灵感。图像的这种「绑定」(binding)属性通过与自身相关的任何感官体验对齐,为学习视觉特征提供了大量监督来源。理想情况下,对于单个联合嵌入空间,视觉特征应该通过对齐所有感官来学习。然而这需要通过同一组图像来获取所有感官类型和组合的配对数据,显然不可行。最近,很多方法学习与文本、音频等对齐的图像特征。这些方法使用单对模态或者最多几种视觉模态。最终嵌入仅

首个大众可用PyTorch版AlphaFold2复现,哥大开源,star量破千首个大众可用PyTorch版AlphaFold2复现,哥大开源,star量破千Apr 13, 2023 am 09:58 AM

刚刚,哥伦比亚大学系统生物学助理教授 Mohammed AlQuraishi 在推特上宣布,他们从头训练了一个名为 OpenFold 的模型,该模型是 AlphaFold2 的可训练 PyTorch 复现版本。Mohammed AlQuraishi 还表示,这是第一个大众可用的 AlphaFold2 复现。AlphaFold2 可以周期性地以原子精度预测蛋白质结构,在技术上利用多序列对齐和深度学习算法设计,并结合关于蛋白质结构的物理和生物学知识提升了预测效果。它实现了 2/3 蛋白质结构预测的卓

Stable Diffusion-XL开启公测,让你摆脱繁琐的长prompt!Stable Diffusion-XL开启公测,让你摆脱繁琐的长prompt!Apr 23, 2023 am 10:16 AM

自从Midjourney发布v5之后,在生成图像的人物真实程度、手指细节等方面都有了显著改善,并且在prompt理解的准确性、审美多样性和语言理解方面也都取得了进步。相比之下,StableDiffusion虽然免费、开源,但每次都要写一大长串的prompt,想生成高质量的图像全靠多次抽卡。最近StabilityAI的官宣,正在研发的StableDiffusionXL开始面向公众测试,目前可以在Clipdrop平台免费试用。试用链接:​https://clipdrop.co/stable-diff

基于PyTorch、易上手,细粒度图像识别深度学习工具库Hawkeye开源基于PyTorch、易上手,细粒度图像识别深度学习工具库Hawkeye开源Apr 12, 2023 pm 08:43 PM

细粒度图像识别 [1] 是视觉感知学习的重要研究课题,在智能新经济和工业互联网等方面具有巨大应用价值,且在诸多现实场景已有广泛应用…… 鉴于当前领域内尚缺乏该方面的深度学习开源工具库,南京理工大学魏秀参教授团队用时近一年时间,开发、打磨、完成了 Hawkeye——细粒度图像识别深度学习开源工具库,供相关领域研究人员和工程师参考使用。本文是对 Hawkeye 的详细介绍。1.什么是 Hawkeye 库Hawkeye 是一个基于 PyTorch 的细粒度图像识别深度学习工具库,专为相关领域研究人员和

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Tools

mPDF

mPDF

mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

SublimeText3 Linux new version

SublimeText3 Linux new version

SublimeText3 Linux latest version

SublimeText3 English version

SublimeText3 English version

Recommended: Win version, supports code prompts!