


Capability alignment, long text, Claude 3, this time we will talk about the key technical paths of large models
The large text model has reached a new level. Claude 3 surpasses GPT-4 and Gemini 1.0 Ultra, which was launched less than a month ago, in multiple dimensions such as mathematics, programming, multi-language understanding, and vision. "Rapidly changing" is no longer enough to describe the current development trend of large model technology. In order to better share the latest progress in large model technology, in 2024, this site, Zhangjiang Science and Technology Investment, Zhangjiang Incubator, and WAIC Circle jointly launched the "Large Model Technology Workshop" series of activities, inviting frontline experts from industry, academia, and research to bring cutting-edge observations and insights . On the afternoon of March 22, on the 3rd floor of Building A, Kehai Building, No. 800 Naxian Road, Zhangjiang, Shanghai, the theme was "Claude 3 The heat wave is coming, let’s talk about the key technical paths of text large models", from Fudan University, Waveform Intelligence, Amazon Cloud technology scholars and technical experts will conduct in-depth sharing and exchanges. Professional audiences who are concerned about the progress of large models are welcome to join the event and communicate and discuss together.
Speaker:
- Gui Yu
Associate researcher at Fudan University Natural Language Processing Laboratory
Research field:
- Pre-trained model
- Human-like alignment
- Agent interaction
Academic achievements:
- Published more than 50 papers in high-level international academic journals and conferences
- Host multiple talent projects (National Natural Science Foundation of China) , Computer Society, Artificial Intelligence Society)
-
Awards won:
- Qian Weichang First Prize in Chinese Information Processing Science and Technology Award
- NeurIPS2023 Large Model Alignment Track Best Paper Award
- COLING2018 Best Paper Nomination Award
- NLPCC2019 Outstanding Paper Award
- CIPS Excellent Paper Award
- ACM Excellent Paper Award
-
Selected:
- China Association for Science and Technology Youth Talent Promotion Project
- Shanghai Morning Star Program
- World Artificial Intelligence Conference Yunfan Award "Bright Star"
Speech title: Training and inference solution for large models of ultra-long text creative writing
Speaker:
Zhou Wangchunshu, CTO of Waveform Intelligence.
- Graduated from the Sino-French Engineering College of Beihang University with a bachelor's degree and a master's degree
- Ph.D. studied at ETH Zurich, studying under Ryan Cotterell & Mrinmaya Sachan
- Dropped out of school in April 2023 and founded AIWaves, serving as the company's Cofounder & CTO
-
The research directions mainly include:
- LLM training & prompting
- language agents
- long/creative text generation
- efficient methods for NLP
- multi-modal LLMs
- commonsense reasoning etc.
- Received Baidu Scholarship in 2022
- Worked as an intern at MSRA/Byte AI Lab/AI2 and other institutions, and served as a research scientist at Bytedance AI Lab
- Zhou Wang Chunshu has worked in machine learning and research fields such as NeurIPS/ICML/ICLR/ACL/EMNLP/NAACL He has published more than 30 articles in natural language processing conferences, and serves as a reviewer for these conferences and as the Action Editor/Area Chair of ARR/*ACL.
Speech title: Claude 3 technical analysis and scenario demonstration
Speaker:
Lin Ye, senior solution architect of Amazon Cloud Technology. Good at C++/C#/Java/PHP/Python/JS and other development languages, and has continuously developed a Github repo from single digits to 3000. He has built a shared bicycle APP that supports 10 million users, participated in the development of a number of well-known car company APPs, and won the Zhejiang ACM Award in 2005. Now he focuses on the development of enterprise cloud native architecture and GenAI, and is committed to applying his capabilities to enterprises. Business scene.
Event Registration
Registration for the "Large Model Technology Workshop Phase 1" has been opened. Scan the QR code below or click "Read Original" at the bottom to go directly to the event registration page.
The above is the detailed content of Capability alignment, long text, Claude 3, this time we will talk about the key technical paths of large models. For more information, please follow other related articles on the PHP Chinese website!

Introduction Suppose there is a farmer who daily observes the progress of crops in several weeks. He looks at the growth rates and begins to ponder about how much more taller his plants could grow in another few weeks. From th

Soft AI — defined as AI systems designed to perform specific, narrow tasks using approximate reasoning, pattern recognition, and flexible decision-making — seeks to mimic human-like thinking by embracing ambiguity. But what does this mean for busine

The answer is clear—just as cloud computing required a shift toward cloud-native security tools, AI demands a new breed of security solutions designed specifically for AI's unique needs. The Rise of Cloud Computing and Security Lessons Learned In th

Entrepreneurs and using AI and Generative AI to make their businesses better. At the same time, it is important to remember generative AI, like all technologies, is an amplifier – making the good great and the mediocre, worse. A rigorous 2024 study o

Unlock the Power of Embedding Models: A Deep Dive into Andrew Ng's New Course Imagine a future where machines understand and respond to your questions with perfect accuracy. This isn't science fiction; thanks to advancements in AI, it's becoming a r

Large Language Models (LLMs) and the Inevitable Problem of Hallucinations You've likely used AI models like ChatGPT, Claude, and Gemini. These are all examples of Large Language Models (LLMs), powerful AI systems trained on massive text datasets to

Recent research has shown that AI Overviews can cause a whopping 15-64% decline in organic traffic, based on industry and search type. This radical change is causing marketers to reconsider their whole strategy regarding digital visibility. The New

A recent report from Elon University’s Imagining The Digital Future Center surveyed nearly 300 global technology experts. The resulting report, ‘Being Human in 2035’, concluded that most are concerned that the deepening adoption of AI systems over t


Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

MantisBT
Mantis is an easy-to-deploy web-based defect tracking tool designed to aid in product defect tracking. It requires PHP, MySQL and a web server. Check out our demo and hosting services.

ZendStudio 13.5.1 Mac
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 English version
Recommended: Win version, supports code prompts!

SublimeText3 Linux new version
SublimeText3 Linux latest version