• Skip to primary navigation
  • Skip to content
  • Skip to primary sidebar

Vidya Muthukumar

  • Home
  • Biography
  • Research
  • Publications
  • Teaching/Advising
  • Service
  • Music

ECE 8803: Online Decision Making in Machine Learning (Spring 2025)

Times: Tuesday and Thursday, 3:30-4:45 pm

Location: MoSE building, room 1224

Instructor: Vidya K Muthukumar (vmuthukumar8@gatech.edu)

Office Hours: Tuesday and Thursday 4:45-5:30 pm (after class), location TBD

Prerequisites: undergraduate probability (ECE3077 or equivalent), undergraduate linear algebra (MATH 2551 or equivalent). Mathematical maturity and familiarity with proof-based arguments will be assumed.

Brief description: In many applications of machine learning (ML), data is collected sequentially; moreover, decisions can impact performance both in the present and the future. This class will deal with the design of ML algorithms for real- time decision making, including reinforcement learning. Classical applications in engineering and modern applications in the ML pipeline will both be discussed, but the focus of the course will be foundational — on understanding design principles and the inner workings of algorithms for online decision-making.

Upon successful completion of this course, students will be able to:

  • Understand and explain the basic design principles of any online algorithm under diverse assumptions on the environment and reward feedback mechanism.
  • Understand how these principles relate to classical concepts in information theory, signal processing, communications, and control theory.
  • Assess the efficacy of an online algorithm for an engineering/machine learning application based on its performance guarantees, tractability of implementation, scalability and assumptions made on the environment.
  • Appreciate how online algorithms relate to other aspects of the machine learning pipeline.

Grading/Format: The course will be graded as follows:

  • Homeworks (top 4/5): 45%
  • Midterm (take-home, tentative dates March 13-14): 25%
  • Course project: 30%

Piazza/Canvas: The primary mode of interactive communication in this course will be Piazza. Please sign up at the course page, and monitor Piazza for announcements regarding lecture, homeworks, midterm and project. As is standard, we will also use Canvas to keep track of assignments and share resources related to the class.

Resources and schedule

Lecture schedule (tentative, subject to change)

DateTopicResources
7 JanLogistics and introduction
9 JanReview session on probability and basics of MLProbability review notes
ML review notes
14 JanBasics of prediction of an adversarial sequenceLecture note
16 JanThe multiplicative weights algorithmLecture note
21 JanDecision-making using expert advice; application to linear programsLecture note
23 JanNo-regret through perturbationLecture note
28 JanNo-regret through perturbationLecture note
30 JanNo-regret through perturbation, introduction to online linear optimizationLecture note
4 FebOnline linear optimizationLecture note
6 FebOnline convex optimization and stochastic optimizationLecture note
11 FebOverview of adaptive methods in online learningLecture note
13 FebOnline learning and zero-sum game theoryLecture note
Extra note
18 FebIntroduction to limited-information feedbackLecture note
20 FebLimited-information feedback and UCBLecture note
25 FebWrapping up UCB; lower boundsLecture note
27 FebThompson sampling algorithmLecture note
Extra note
4 MarNo class (instructor conflict)
6 MarStructured bandits: Linear and Gaussian processesLecture note
11 MarContextual bandits and adversarial banditsLecture note
13-14 MarTake-home midterm
17-21 MarNo class (spring break)
25 MarDynamic programming and optimal controlLecture note
27 MarTabular RL with a generative modelLecture note
1 AprModel-based exploration in tabular RLLecture note
3 AprValue iteration and Q-learningLecture note
8 AprPolicy-based methodsLecture note
10 AprRL with function approximation, theoryLecture note
15 AprRL with function approximation, practice 
17 AprNo class (instructor travel)
22 AprLAST DAY OF CLASS: Poster presentationsCoda 9th floor atrium

Homework schedule:

Submission due date and self-grade upload deadline are both 11:59 ET. Submission and self-grade upload will be done via Canvas.

 Rough set of topicsUpload dateDue dateSelf-grade due date
Homework 0 (optional)Review of probability and linear algebra7 Jan14 JanN/A
Homework 1Basics of online prediction17 Jan4 Feb17 Feb
Homework 2Online optimization6 Feb21 Feb28 Feb

Primary Sidebar

Vidya Muthukumar

Assistant Professor
Schools of ECE and ISyE
Georgia Institute of Technology
vmuthukumar8 [at] gatech [dot] edu
Coda S1139
Google Scholar

Education

Ph.D. Electrical Engineering and Computer Sciences (2020), University of California Berkeley

B.Tech. (with honors), Electrical Engineering (2014), Indian Institute of Technology Madras

Copyright © 2025 · eleven40 Pro on Genesis Framework · WordPress · Log in

  • Home
  • Biography
  • Research
  • Publications
  • Teaching/Advising
  • Service
  • Music