• Skip to primary navigation
  • Skip to content
  • Skip to primary sidebar

Vidya Muthukumar

  • Home
  • Biography
  • Research
  • Publications
  • Teaching/Advising
  • Service
  • Music

ISyE 4803: Online Learning and Decision Making (Spring 2024)


Times:
Tuesday and Thursday, 9:30-10:45 am

Location: Groseclose 119

Instructor: Vidya K Muthukumar (vmuthukumar8@gatech.edu)

GTA: Krishna V Acharya (krishna.acharya@gatech.edu)

MTA: Lokranjan Lakshmikanthan (llakshmi3@gatech.edu)

Office Hours: Instructor OH Tuesdays 11 am-12 pm, Groseclose 336

GTA OH TBD

MTA OH TBD

Prerequisites: ISYE 3133 (Optimization) and ISYE 2027 (Probability). Mathematical maturity and familiarity with working with abstract mathematical notation will be assumed. A few elementary proof-based arguments will also be covered.

Brief description: At the heart of most machine learning applications today – like advertisement placement, movie recommendation, and node prediction in evolving networks – is an optimization engine trying to provide the best decision with the information observed thus far in time, i.e. the problem of online learning. To solve these problems, one must make online, real-time decisions and continuously improve the performance with the arrival of data and feedback from previous decisions. The course aims to provide a foundation for the development of such online methods and for their analysis. We will discuss fundamental principles for learning from an unknown environment, limited feedback, and learning with dynamic, long-term consequences.

Upon successful completion of this course, students will be able to:

  • Understand where online learning is applicable in many real-world scenarios.
  • Develop algorithms that combine partial information as best as possible to make online decisions.
  • Understand how exploration of decision space and exploitation from historic data must be prioritized to be able to reach optimal decisions.
  • Understand the dynamic-programming principle of sequential decision-making when decisions have long-term consequences, and appreciate principles for learning in such long-term environments.

Grading/Format: The course will be graded as follows:

  • Assignment 0: 2% bonus (released Jan 9, due Jan 16, covers course prerequisites)
  • Homeworks (4 in total): 50%
  • Two in-class midterms (tentative dates: Feb 13 and March 26): 30% in total
  • Final exam (April 29, 8-10:50 am): 20%

Lecture medium and study resources: All lectures will be in person and on the board (projected through tablet). I will record lectures and make the recording and accompanying handwritten notes available a few hours after lecture. These recordings are intended to be an accompaniment to in-person attendance rather than a substitute.
In addition to this, I also will provide supplemental type-written notes.

Piazza/Canvas: The primary mode of interactive communication in this course will be Piazza. Please sign up at the course page, and monitor Piazza for announcements regarding lecture, homeworks and exams. As is standard, we will also use Canvas to keep track of assignments and share resources related to the class.

Resources and schedule

Lecture schedule (tentative, subject to change)

DateTopicAdditional resources guide (internally available on Canvas/Piazza)
9 JanLogistics and introductionLecture recording and slides
11 JanReview session on prerequisitesLecture recording and notes,
Chapter 1 of supplementary material
16 JanIntroduction to prediction of an adversarial binary sequence and the halving algorithmLecture recording and notes,
Chapter 2 and Chapter 3 of supplementary material
18 JanBinary sequence prediction: The halving algorithm and the weighted majority algorithmLecture recording and notes, Chapter 3 of supplementary material
23 JanBinary sequence prediction: The randomized weighted majority algorithmLecture recording and notes, Chapter 3 of supplementary material
25 JanPrediction with expert advice: General loss functionsLecture recording and notes, Chapter 4 of supplementary material
30 JanPrediction with expert advice: General loss functions (continued)Lecture recording and notes, Chapter 4 of supplementary material
1 FebApplication: solving linear programs, continuedLecture recording and notes, Chapter 5 of supplementary material
6 FebApplication: solving linear programs (continued)Lecture recording and notes, Chapter 5 of supplementary material
8 FebReview: Online sequence predictionLecture recording and notes
13 FebIn-class midterm 1
15 FebMidterm 1 discussionLecture recording and notes, midterm solutions
20 FebNO CLASS (instructor travel)
22 FebNO CLASS (instructor travel)
27 FebLimited-information feedback (bandits): Introduction and heuristicsLecture recording and notes, Chapter 6 of supplementary material
29 FebBandits, pure-greedy and epsilon-greedy algorithmsLecture recording and notes, Chapter 6 of supplementary material
5 MarchBandits, UCBLecture recording and notes, Chapter 7 of supplementary material
7 MarchBandits, UCBLecture recording and notes, Chapter 7 of supplementary material
12 MarchBandits, UCBLecture recording and notes, Chapter 7 of supplementary material
14 MarchBandits, Thompson samplingLecture recording and notes, Chapter 7 of supplementary material
19 MarchNO CLASS (Spring break)
21 MarchNO CLASS (Spring break)
26 MarchReview: BanditsLecture recording and notes
28 MarchIn-class midterm 2
2 AprilBandits, Thompson sampling and wrap-up
4 AprilDynamic programming and optimal control
9 AprilDynamic programming and optimal control, continued
11 AprilA birds’ eye view of reinforcement learning
16 AprilFairness and ethical considerations in decision making
18 AprilLAST DAY OF CLASS: Review and summary of current research directions in online learning
29 AprilFinal exam (8-10:50 am)

Assignment schedule (tentative, subject to change)

AssignmentRelease dateDue dateScope of homework
09 Jan16 JanPrerequisites
118 Jan7 FebBinary and general loss sequence prediction
215 Feb1 MarchGeneral loss sequence prediction and LPs
35 March5 AprilLimited-information feedback (bandit) algorithms
44 April19 AprilRecommender systems and dynamic programming

Primary Sidebar

Vidya Muthukumar

Assistant Professor
Schools of ECE and ISyE
Georgia Institute of Technology
vmuthukumar8 [at] gatech [dot] edu
Coda S1139
Google Scholar

Education

Ph.D. Electrical Engineering and Computer Sciences (2020), University of California Berkeley

B.Tech. (with honors), Electrical Engineering (2014), Indian Institute of Technology Madras

Copyright © 2025 · eleven40 Pro on Genesis Framework · WordPress · Log in

  • Home
  • Biography
  • Research
  • Publications
  • Teaching/Advising
  • Service
  • Music