Since the release of OpenAI’s o1 and DeepSeek’s R1 models, interest in the reasoning capabilities of LLMs has increased. This half-semester (7-week) course will cover some of the main ingredients that go into enhancing an LLM’s reasoning capability. We will also focus on some recent theory papers that try to understand this fascinating emerging area from a mathematical perspective.
Prior exposure to LLMs and learning theory will help but is not required. But a high level of mathematical maturity will be needed to fully benefit from this course. The topics list below is tentative and subject to change.
Time & Days: TuTh 2:30PM - 4:00PM
Location: 2060 SKB
Half semester course dates: Aug 25, 2025-Oct 10, 2025
J&M = Speech and Language Processing (3rd ed. draft), Jurafsky and Martin