Safety of Large Language Models

Time：09:50-12:15

Venue：A3-4-312

Organizer：/

Speaker：/

Date2025-04-16 ~ 2025-04-25

Schedule

Weekday	Time	Venue	Online	ID	Password
Mon,Wed,Fri	09:50-12:15	A3-4-312	Zoom	815 762 8413	BIMSA

Introduction

This course introduces students to the core principles and challenges surrounding large-scale neural language models' safe and responsible development. It is designed for graduate students and technical professionals with prior experience in machine learning and natural language processing.

The course will explore the basics of LLMs, including architectural foundations, training procedures. The second part of the course goes deeper with exploring vulnerabilities such as hallucinations and adversarial attacks, and recent advances in aligning LLMs with human intent and values.

List of Lectures

1. Introduction to Transformer Models and LLMs

2. Training of LLMs: From Pretraining to Fine-tuning

3. Hallucination Detection in LLMs

4. Adversarial Attacks on Language Models

5. Alternatives to Transformers: LLMs and State-space models

Lecturer Intro

Alexey Zaytsev has deep expertise in machine learning and processing of sequential data. He publishes at top venues, including KDD, ACM Multimedia and AISTATS. Industrial applications of his results are now in service at companies Airbus, Porsche and Saudi Aramco among others.

DATEApril 12, 2025

Related News

0
Fundamentals of Natural Language Processing
PrerequisiteComputer Science, Machine Learning, PythonAbstractNatural Language Processing (NLP) is an important research area in Artificial Intelligence. NLP mainly studys how to use computer technology to process linguistic texts. The specific research problems in NLP includes recognition, classification, extraction, transformation and generation of lexical, syntactic, semantic and pragmatic i...
1
Large-N Matrix models and Spectral Curves
IntroductionMatrix models are ubiquitous both in Mathematics and in Theoretical Physics. The aim of this course is to introduce the main approaches to U(N)-invariant random matrix models: Coulomb gas, orthogonal polynomials, loop equations, topological recursion, tau-functions and integrable hierarchies. Most of these techniques are based on the fact that the eigenvalues of the random matrix be...