清华主页 EN
导航菜单

Safety of Large Language Models

来源: 04-12

时间:09:50-12:15

地点:A3-4-312

组织者:/

主讲人:/

Date2025-04-16 ~ 2025-04-25

Schedule

Weekday Time Venue Online ID Password

Mon,Wed,Fri

09:50-12:15

A3-4-312

Zoom

815 762 8413

BIMSA

Introduction

This course introduces students to the core principles and challenges surrounding large-scale neural language models' safe and responsible development. It is designed for graduate students and technical professionals with prior experience in machine learning and natural language processing.

The course will explore the basics of LLMs, including architectural foundations, training procedures. The second part of the course goes deeper with exploring vulnerabilities such as hallucinations and adversarial attacks, and recent advances in aligning LLMs with human intent and values.

List of Lectures

1. Introduction to Transformer Models and LLMs

2. Training of LLMs: From Pretraining to Fine-tuning

3. Hallucination Detection in LLMs

4. Adversarial Attacks on Language Models

5. Alternatives to Transformers: LLMs and State-space models

Lecturer Intro

Alexey Zaytsev has deep expertise in machine learning and processing of sequential data. He publishes at top venues, including KDD, ACM Multimedia and AISTATS. Industrial applications of his results are now in service at companies Airbus, Porsche and Saudi Aramco among others.

返回顶部
相关文章
  • Topology of large language models data representations

    Speaker: Serguei Barannikov BIMSA, IMJ-PRGTime: 14:00 - 16:00, 2024-12-05Venue: A3-1-301ZOOM: 230 432 7880PW: BIMSAOrganizers: Mingming Sun, Yaqing WangAbstractThe rapid advancement of large language models (LLMs) has made distinguishing between human and AI-generated text increasingly challenging. The talk examines the topological structures within LLM data representations, focusing on their ...

  • Fundamentals of Natural Language Processing

    PrerequisiteComputer Science, Machine Learning, PythonAbstractNatural Language Processing (NLP) is an important research area in Artificial Intelligence. NLP mainly studys how to use computer technology to process linguistic texts. The specific research problems in NLP includes recognition, classification, extraction, transformation and generation of lexical, syntactic, semantic and pragmatic i...