search
Classify:AIIt Industry

From bare metal to a large model with 70 billion parameters, here is a tutorial and ready-to-use scripts

Release:2024-07-24 20:13:31
From bare metal to a large model with 70 billion parameters, here is a tutorial and ready-to-use scripts

How to create an open source model that can defeat GPT-4o? Regarding Llama 3.1 405B, Meta is written in this paper

Release:2024-07-24 18:42:03
How to create an open source model that can defeat GPT-4o? Regarding Llama 3.1 405B, Meta is written in this paper

The performance is 11 times stronger. Georgia Tech and Tsinghua teams used AI to assist in discovering new energy storage materials, published in Nature sub-journal

Release:2024-07-24 17:42:52
The performance is 11 times stronger. Georgia Tech and Tsinghua teams used AI to assist in discovering new energy storage materials, published in Nature sub-journal

Neural networks also have spatial awareness! Learn to create maps in Minecraft, published in Nature sub-magazine

Release:2024-07-24 09:38:12
Neural networks also have spatial awareness! Learn to create maps in Minecraft, published in Nature sub-magazine

The first open source model to surpass GPT4o level! Llama 3.1 leaked: 405 billion parameters, download links and model cards are available

Release:2024-07-23 20:51:33
The first open source model to surpass GPT4o level! Llama 3.1 leaked: 405 billion parameters, download links and model cards are available

ECCV 2024|BlazeBVD, a general method for blind video de-flickering, is here, jointly proposed by Meitu and the National University of Science and Technology of China

Release:2024-07-23 15:13:34
ECCV 2024|BlazeBVD, a general method for blind video de-flickering, is here, jointly proposed by Meitu and the National University of Science and Technology of China

The embodied intelligent robot company invested by Xiaomi and the welding giant officially announced strategic cooperation

Release:2024-07-23 14:50:54
The embodied intelligent robot company invested by Xiaomi and the welding giant officially announced strategic cooperation

Unlimited video generation, planning and decision-making, diffusion forced integration of next token prediction and full sequence diffusion

Release:2024-07-23 14:05:21
Unlimited video generation, planning and decision-making, diffusion forced integration of next token prediction and full sequence diffusion

After 'Alibaba Star', Alibaba Taotian restarted the recruitment of top technical talents, with an annual salary of one million as standard

Release:2024-07-22 21:20:23
After 'Alibaba Star', Alibaba Taotian restarted the recruitment of top technical talents, with an annual salary of one million as standard

ICML 2024 Oral | Is DPO more suitable for LLM than PPO? Tsinghua Wuyi team's latest revelation

Release:2024-07-22 18:41:23
ICML 2024 Oral | Is DPO more suitable for LLM than PPO? Tsinghua Wuyi team's latest revelation

New standard for AI imaging, only 1% of original data can achieve the best performance, general medical basic model published in Nature sub-journal

Release:2024-07-22 17:38:00
New standard for AI imaging, only 1% of original data can achieve the best performance, general medical basic model published in Nature sub-journal

ECCV 2024 | To improve the performance of GPT-4V and Gemini detection tasks, you need this prompt paradigm

Release:2024-07-22 17:28:30
ECCV 2024 | To improve the performance of GPT-4V and Gemini detection tasks, you need this prompt paradigm

KDD 2024|Hong Kong Rhubarb Chao team deeply analyzes the 'unknown boundary' of large models in the field of graph machine learning

Release:2024-07-22 16:54:34
KDD 2024|Hong Kong Rhubarb Chao team deeply analyzes the 'unknown boundary' of large models in the field of graph machine learning

The University of Science and Technology of China and Huawei Noah proposed Entropy Law to reveal the relationship between large model performance, data compression rate and training loss.

Release:2024-07-22 16:39:35
The University of Science and Technology of China and Huawei Noah proposed Entropy Law to reveal the relationship between large model performance, data compression rate and training loss.

The weights, codes, and data sets are all open source, and the performance exceeds Mistral-7B. Apple's small model is here

Release:2024-07-22 16:18:40
The weights, codes, and data sets are all open source, and the performance exceeds Mistral-7B. Apple's small model is here