• Skip to primary navigation
  • Skip to content
  • Skip to primary sidebar

Vidya Muthukumar

  • Home
  • Biography
  • Research
  • Publications
  • Teaching/Advising
  • Service
  • Music

ECE 8803: Online Decision Making in Machine Learning (Fall 2023)

Times: Tuesday and Thursday, 2 – 3:15 pm

Location: Molecular Sciences and Engineering building, room 1224

Instructor: Vidya K Muthukumar (vmuthukumar8@gatech.edu)

Office Hours: Tuesday and Thursday 4:45-5:30 pm (after lecture)

Prerequisites: undergraduate probability (ECE3077 or equivalent), undergraduate linear algebra (MATH 2551 or equivalent). Mathematical maturity and familiarity with proof-based arguments will be assumed.

Brief description: In many applications of machine learning (ML), data is collected sequentially; moreover, decisions can impact performance both in the present and the future. This class will deal with the design of ML algorithms for real- time decision making, including reinforcement learning. Classical applications in engineering and modern applications in the ML pipeline will both be discussed, but the focus of the course will be foundational — on understanding design principles and the inner workings of algorithms for online decision-making.

Upon successful completion of this course, students will be able to:

  • Understand and explain the basic design principles of any online algorithm under diverse assumptions on the environment and reward feedback mechanism.
  • Understand how these principles relate to classical concepts in information theory, signal processing, communications, and control theory.
  • Assess the efficacy of an online algorithm for an engineering/machine learning application based on its performance guarantees, tractability of implementation, scalability and assumptions made on the environment.
  • Appreciate how online algorithms relate to other aspects of the machine learning pipeline.

Grading/Format: The course will be graded as follows:

  • Homeworks (top 4/5): 45%
  • Midterm (take-home, date TBD): 25%
  • Course project: 30%

Piazza/Canvas: The primary mode of interactive communication in this course will be Piazza. Please sign up at the course page, and monitor Piazza for announcements regarding lecture, homeworks, midterm and project. As is standard, we will also use Canvas to keep track of assignments and share resources related to the class.

Resources and schedule

Lecture schedule:

DateTopicResources
22 AugLogistics and introductionIntroductory slides
24 AugReview session on probability and basics of MLProbability review
Basics of ML review
29 AugBasics of prediction of an adversarial sequenceLecture note
31 AugThe multiplicative weights algorithmLecture note
5 SepThe multiplicative weights algorithm and decision-making using expert adviceLecture note
7 SepNo-regret through perturbationLecture note
12 SepNo-regret through perturbationLecture note
14 SepNo-regret through perturbation, introduction to online linear optimizationLecture note
19 SepOnline linear optimizationLecture note
21 SepOnline convex optimization and stochastic optimizationLecture note
26 SepOverview of adaptive methods in online learningLecture note
28 SepOnline learning and zero-sum game theoryLecture note
3 OctIntroduction to limited-information feedbackLecture note
5 OctLimited-information feedback and UCBLecture note
12 OctWrapping up UCB; informal discussion of lower boundLecture note
19 OctThompson sampling algorithm, Part 1Lecture note
26 Oct Thompson sampling algorithm, Part 2Lecture note
31 OctStructured bandits: Linear and Gaussian processesLecture note
2 NovContextual bandits and adversarial banditsLecture note
7 NovDynamic programming and optimal control Lecture note
9 NovTabular RL with a generative modelLecture note
14 NovModel-based exploration in tabular RLLecture note
16 NovValue iteration and Q-learningLecture note
21 NovPolicy-based methodsLecture note
28 Nov An overview of RL theory with function approximationLecture note
30 Nov Guest lecture (Bo Dai): RL and function approximation in practice 
5 DecLAST DAY OF CLASS: Poster presentations OR misc topics (TBD)

Homework schedule:

Submission due date and self-grade upload deadline are both 11:59 ET. Submission and self-grade upload will be done via Canvas.

 Rough set of topicsUpload dateDue dateSelf-grade due date
Homework 0 (optional)Review of probability and linear algebra22 Aug29 AugN/A

Primary Sidebar

Vidya Muthukumar

Assistant Professor
Schools of ECE and ISyE
Georgia Institute of Technology
vmuthukumar8 [at] gatech [dot] edu
Coda S1139
Google Scholar

Education

Ph.D. Electrical Engineering and Computer Sciences (2020), University of California Berkeley

B.Tech. (with honors), Electrical Engineering (2014), Indian Institute of Technology Madras

Copyright © 2025 · eleven40 Pro on Genesis Framework · WordPress · Log in

  • Home
  • Biography
  • Research
  • Publications
  • Teaching/Advising
  • Service
  • Music