This is the repository of the course on LLMs based on the Transformer architecture targeted at people with experience in Python, Machine Learning, and Deep Learning but little or no experience with Transformers and LLMS.
The course focuses on modern LLMs and multimodal models built on the Transformer architecture. A separate module covers the use of Transformers for non-text modalities (time series, tabular data, and others) and modern multimodal architectures. The course also introduces the fundamentals of building AI agents based on LLMs. As a result, you will gain comprehensive knowledge of how modern LLMs, multimodal models, RAG systems, and basic LLM agents operate both in theory and in practice.
The course emphasizes not only theoretical understanding but also practical skills for applying these models and building systems around them. Special attention is given to modern LLM architectures, relevant frameworks such as Transformers, LLM, Ollama, LangChain, and others, as well as to the efficiency of LLM-based system design.
By the end of the course, you will have a coherent understanding of the principles and practical use of LLMs, multimodal models, RAG systems, and basic LLM agents — with a strong emphasis on practical application, proper evaluation, and computational efficiency.
The course features
- Transformer model
- Transformer-based Encoders
- Classification with Transformers
- Transformer-based decoders
- Towards ChatGPT
- Efficient Transformers
- RAG with Transformers
- Introduction to AI Agents
- Uncertainty estimation for Transformers
- Uncertainty quantification generation tasks
- Multilingual language models
- Multimodal dialogue models
- Transformers for tabular data
- Transformers for event sequences
Raw versions of the editable course materials (e.g., pptx presentations) can be found here.