Math 152: Data Mining

Contact Information and Office Hours:

  • Instructor: Thang Huynh
  • Email: tlh007@ucsd.edu
  • Office Hours: MWF 12:00pm - 1:00 pm at AP&M 6341
  • Lecture Times: MWF 8:00-8:50am at CENTR 105

Teaching Assistants:

  1. Dun Qiu:
    • Email: duqiu@ucsd.edu
    • Office Hours: 11am - 1pm on Thursday at AP&M 2000B
  2. Xindong Tang

Syllabus – Click Here


Calendar – Click Here


Piazza – Click Here


Catalog Description:

We will cover among other topics (tentative): sampling, finding frequent items, counting distinct elements, general frequency moment estimation, finding frequent item sets, dimensionality reduction, and matrix approximation.

Textbooks: There is no course textbook. We will primarily be following Edo Liberty’s course notes, and Jelani Nelson’s. I will post a reference for each lecture.


Lecture Notes:

  1. Week 1:

  2. Week 2:

  3. Week 3 and 4:

  4. Week 5 and 6:

  5. Week 7 and 8:

  6. Week 9 and 10:


Homework - Click Here


Exams

  • Final: June 14. It is a cumulative exam. Click here for an old exam and solution

  • Midterm 2: May 24. Probabilistic Inequalities: Markov’s, Chebyshev’s, and Chernoff’s (Week 3-4); Data Stream (Week 5-6); SVD (Week 7-8) Click here for an old exam

  • Midterm 1: April 26. Cover Linear Algebra (Week 1); Basic probability (Week 2-3); Probabilistic Inequalities: Markov’s, Chebyshev’s, and Chernoff’s (Week 3-4). Click here for an old exam


Course Resources

Syllabus You are responsible for knowing the information and policies in the syllabus.
Homework


Final Exam Responsibilities An outline of the responsibilities of faculty and students with regard to final exams.

Avatar
Thang Huynh
S.E.W. Visiting Assistant Professor