search
Classify:AIIt Industry

Hand-tearing Llama3 layer 1: Implementing llama3 from scratch

Release:2024-06-01 17:45:42
Hand-tearing Llama3 layer 1: Implementing llama3 from scratch

Simple and universal: the visual basic network accelerates lossless training by up to 3 times, Tsinghua EfficientTrain++ was selected for TPAMI 2024

Release:2024-06-01 17:41:29
Simple and universal: the visual basic network accelerates lossless training by up to 3 times, Tsinghua EfficientTrain++ was selected for TPAMI 2024

Comprehensively surpassing DPO: Chen Danqi's team proposed simple preference optimization SimPO, and also refined the strongest 8B open source model

Release:2024-06-01 16:41:36
Comprehensively surpassing DPO: Chen Danqi's team proposed simple preference optimization SimPO, and also refined the strongest 8B open source model

The author of ControlNet's new work is a hit: P photos can be changed into backgrounds without asking for help, and AI lighting is perfectly integrated

Release:2024-06-01 16:23:10
The author of ControlNet's new work is a hit: P photos can be changed into backgrounds without asking for help, and AI lighting is perfectly integrated

A new milestone in controllable nuclear fusion, AI achieves fully automatic optimization of dual tokamak 3D field for the first time, published in Nature sub-issue

Release:2024-06-01 15:57:53
A new milestone in controllable nuclear fusion, AI achieves fully automatic optimization of dual tokamak 3D field for the first time, published in Nature sub-issue

Palm Reading Technology joins hands with Amazon Cloud Technology to reshape the reading experience with the power of generative AI

Release:2024-06-01 15:02:07
Palm Reading Technology joins hands with Amazon Cloud Technology to reshape the reading experience with the power of generative AI

Li Feifei reveals the entrepreneurial direction of 'spatial intelligence': visualization turns into insight, seeing becomes understanding, and understanding leads to action

Release:2024-06-01 14:55:34
Li Feifei reveals the entrepreneurial direction of 'spatial intelligence': visualization turns into insight, seeing becomes understanding, and understanding leads to action

Tencent Hunyuan upgrades model matrix, launching 256k long text model on the cloud​

Release:2024-06-01 13:46:36
Tencent Hunyuan upgrades model matrix, launching 256k long text model on the cloud​

Working with Amazon Cloud Technology, Beijing Lingao Technology helps enterprises seamlessly combine large models and data​

Release:2024-06-01 12:48:21
Working with Amazon Cloud Technology, Beijing Lingao Technology helps enterprises seamlessly combine large models and data​

The $1 million prize from the Clay Institute will go to AI. The rules of the mathematics world have changed drastically. How will mathematicians deal with 'massive conjectures' in the future?

Release:2024-06-01 11:02:46
The $1 million prize from the Clay Institute will go to AI. The rules of the mathematics world have changed drastically. How will mathematicians deal with 'massive conjectures' in the future?

This article will take you to understand SHAP: model explanation for machine learning

Release:2024-06-01 10:58:13
This article will take you to understand SHAP: model explanation for machine learning

At CCIG2024, Hehe Information document analysis technology solves the 'famine' problem of large model corpus

Release:2024-05-31 22:28:49
At CCIG2024, Hehe Information document analysis technology solves the 'famine' problem of large model corpus

Tencent Cloud AI Code Assistant is fully open to the public

Release:2024-05-31 20:08:24
Tencent Cloud AI Code Assistant is fully open to the public

Is Flash Attention stable? Meta and Harvard found that their model weight deviations fluctuated by orders of magnitude

Release:2024-05-30 13:24:53
Is Flash Attention stable? Meta and Harvard found that their model weight deviations fluctuated by orders of magnitude

One article takes you through data models: conceptual model, logical model and physical model

Release:2024-05-30 12:00:35
One article takes you through data models: conceptual model, logical model and physical model