Safety of Large Language Models-清华大学求真书院

公开课

首页 > 书院学术 > 至美数学 > 公开课

Safety of Large Language Models

来源： 04-12

时间：09:50-12:15

地点：A3-4-312

组织者：/

主讲人：/

Date2025-04-16 ~ 2025-04-25

Schedule

Weekday	Time	Venue	Online	ID	Password
Mon,Wed,Fri	09:50-12:15	A3-4-312	Zoom	815 762 8413	BIMSA

Introduction

This course introduces students to the core principles and challenges surrounding large-scale neural language models' safe and responsible development. It is designed for graduate students and technical professionals with prior experience in machine learning and natural language processing.

The course will explore the basics of LLMs, including architectural foundations, training procedures. The second part of the course goes deeper with exploring vulnerabilities such as hallucinations and adversarial attacks, and recent advances in aligning LLMs with human intent and values.

List of Lectures

1. Introduction to Transformer Models and LLMs

2. Training of LLMs: From Pretraining to Fine-tuning

3. Hallucination Detection in LLMs

4. Adversarial Attacks on Language Models

5. Alternatives to Transformers: LLMs and State-space models

Lecturer Intro

Alexey Zaytsev has deep expertise in machine learning and processing of sequential data. He publishes at top venues, including KDD, ACM Multimedia and AISTATS. Industrial applications of his results are now in service at companies Airbus, Porsche and Saudi Aramco among others.

返回顶部

Topology of large language models data representations
Speaker: Serguei Barannikov BIMSA, IMJ-PRGTime: 14:00 - 16:00, 2024-12-05Venue: A3-1-301ZOOM: 230 432 7880PW: BIMSAOrganizers: Mingming Sun, Yaqing WangAbstractThe rapid advancement of large language models (LLMs) has made distinguishing between human and AI-generated text increasingly challenging. The talk examines the topological structures within LLM data representations, focusing on their ...
View more
Fundamentals of Natural Language Processing
PrerequisiteComputer Science, Machine Learning, PythonAbstractNatural Language Processing (NLP) is an important research area in Artificial Intelligence. NLP mainly studys how to use computer technology to process linguistic texts. The specific research problems in NLP includes recognition, classification, extraction, transformation and generation of lexical, syntactic, semantic and pragmatic i...
View more

书院学术

Safety of Large Language Models

Topology of large language models data representations

Fundamentals of Natural Language Processing

友情链接 HYPERLINK：