Skip to content

Latest commit

 

History

History
228 lines (163 loc) · 13.2 KB

index.md

File metadata and controls

228 lines (163 loc) · 13.2 KB
layout
home
  • TOC {:toc}

Personnel and Logistics

Lectures (A1): Tuesdays and Thursdays, 3:30pm -- 4:45pm
Location: SAR 101 (635 Commonwealth Ave)

Discussion Session A2: Fridays, 10:10am -- 11:00am
Location: MUG 203 (771 Commonwealth Ave)

Discussion Session A3: Fridays, 11:15am -- 12:05pm
Location: CDS 365 (665 Commonwealth Ave)

Discussion Session A4: Fridays, 12:20pm -- 1:10pm
Location: FLR 122 (808 Commonwealth Ave)

Discussion Session A5: Fridays, 1:25pm -- 2:15pm
Location: MUG 203 (771 Commonwealth Ave)

Instructor: Thomas Gardos

  • Office: CDS 1623
  • Office Hours: Tuesdays and Thursdays, 1:30 -- 3:30pm
  • email: tgardos <at> bu <dot> edu

Teaching Assistant: Xavier Thomas

  • Office Hours: TBD
  • Office Hours TBD
  • email: xthomas <at> bu <dot> edu

Course Assistant: Hemangi Suthar

  • email: hemangis <at> bu <dot> edu

Course Description

In this course students will gain an understanding of the fundamentals in deep learning and then apply those concepts in exercises and applications in python. We’ll start with the origins of artificial neural networks, learn about loss functions, understand gradient descent, back propagation and various training optimization techniques. Students will be familiar with canonical network architecture such as multi-layer perceptions, convolutional neural networks, recursive neural networks, LSTMs and GRU, attention and transformers. Through explanations, examples and exercises students will build intuition on how deep learning algorithms work and how they are implemented in popular deep learning frameworks such as PyTorch. Students will be able to define, train and evaluate deep learning models as well as adapt deep learning frameworks to new functionality. Students will gain exposure to pre-trained large language models and other foundation models and the concepts of few-shot learning and reasoning. Finally, students will be able to apply many of the techniques they learned in a final class project.

Learning Outcomes

Upon successful completion of this course, students will be able to:

  1. Grasp foundational theories and practices in the deep learning arena.
  2. Design and implement various neural network architectures using Python and PyTorch.
  3. Employ regularization, optimization, and advanced training techniques to enhance model performance.
  4. Analyze real-world datasets, applying suitable deep learning techniques to derive actionable insights.
  5. Understand the benefits and drawbacks of various neural architectures in specific contexts.
  6. Understand the pros and cons of pre-trained large language and other foundation models and how best to employ them
  7. Complete a data-centric project, showcasing end-to-end deep learning implementation.

Prerequisites

Python Programming – Should be proficient in python and associated data science packages, or studiously working towards proficiency. See for example Scientific Python Lectures for lessons on python language and relevant packages, or The Python Tutorial for a tutorial on the core language.

Packages such as NumPy NumPy - Learn and SciPy (SciPy User Guide have tutorials and documentation as well.

The more proficient you are, the more effective you will be at the assignments and projects. We will dedicate some discussion sessions to ensure your environment is setup correctly and review some of the basics as well as answer any questions.

Math Proficiency – In order to understand the foundational concepts, it is important to have proficiency in a number of areas of mathematics. These include linear algebra, first year calculus and trigonometry as well as some concepts from Real Analysis. We will cover these concepts in the class and some recitation sessions, but refreshing or building your foundation will help.

Reference Material

The primary textbook for this course will be Understanding Deep Learning, by Simon Prince . The book is available online as a preprint and should be available in print from MIT Press in early 2024.

We’ll also reference Introduction to Linear Algebra, Sixth Edition (2023), by Gilbert Strang.

Lecture notes, consisting primarily of Jupyter notebooks will be posted online as well.

Given the fast moving nature of this area, we’ll also be citing many articles available online as well as other online reference materials in each lecture. As part of the class, we will guide the students in constructing their own bibliography and give tips on how to efficiently and effectively read research papers.

Computing Environment

Students are of course free to use their own personal computer, but you will also have access to Boston University’s Shared Computing Cluster and GPUs. Access instructions will be provided. For more information, see:

Learning Management Software

To be added.

  • [Piazza]({{ site.data.urls["piazza"].url }})
  • [Gradescope]({{ site.data.urls["gradescope"].url }})

Course Requirements

Homeworks

  • There will be approximately weekly assignments (usually posted on Fridays)to reinforce the key concepts and python/pytorch coding patterns.
  • Due dates will be 7 days after the assignment is posted.
  • A 24 hour late due date is provided for each assignment although with a 10% penalty, e.g $$P_l = 0.9 P_o$$
  • You'll also have 7 days after the assignment is graded to revise and resubmit for up to 75% of the lost points, e.g. $$P_o + 0.75 (P_o - P_r)$$
  • If you were originally late, you can still revise and resubmit but the score will be $$0.9 (P_o + 0.75 (P_o - P_r))$$

where

  • $$P_o$$ is the original score of the assignment on the due date
  • $$P_r$$ is the revised score after revising and resubmitting within 7 days of graded assignment posting
  • $$P_l$$ is the late score of the assignment after late penalty is applied

Attendance/Polls

There will be (at least one) in-class Piazza poll in each lecture to:

  • help facilitate in-class discussion
  • serve as quick knowledge check

It:

  • will be opened in class for a few minutes and then closed immediately after,
  • doubles as attendance proxy,
  • will be graded pass/fail -- basically submit to pass

You must be in class to complete the poll.

You get four (4) absences with no excuse required as well as valid excused absences, e.g. illness. Please email instructor, preferably in advance of class.

Midterm Challenge

  • We will assign a mid-term deep learning training contest
  • You must work individually and you will have 2 weeks to complete the assignment
  • You will be given a dataset and a task to complete
  • There will be a leaderboard and extra credit for the top 10 leaders

Final Project

  • A final project where you will apply deep learning methods to a problem or application of interest to you. See the [Project]({{ site.baseurl }}{% link project.md %}) page for more information.

Course Assessment

  • Final Project: 45%
  • Mid-term Project/Competition: 25%
  • Homeworks: 25%
  • Class Participation/Attendance/Polls: 5%

Final Project

See the [Project]({{ site.baseurl }}{% link project.md %}) page for more information.

Student Code of Conduct

All students are expected to abide by University conduct policies as detailed in the following links:

Academic Honesty

You may discuss homework assignments with classmates, but you are solely responsible for what you turn in. Collaboration in the form of discussion is allowed, but all forms of cheating (copying parts of a classmate’s assignment, plagiarism from books or old posted solutions) are NOT allowed. We – both teaching staff and students – are expected to abide by the guidelines and rules of the [Academic Code of Conduct](which is at http://www.bu.edu/academics/policies/academic-conduct-code/).

Graduate students must also be aware of and abide by the GRS Academic Conduct code.

You can probably, if you try hard enough, find solutions for homework problems online. Given the nature of the Internet, this is inevitable. Let me make a couple of comments about that:

  1. If you are looking online for an answer because you don’t know how to start thinking about a problem, talk to the TA or instructor, who may be able to give you pointers to get you started. Piazza is great for this – you can usually get an answer in an hour if not a few minutes.
  2. If you are looking online for an answer because you want to see if your solution is correct, ask yourself if there is some way to verify the solution yourself. Usually, there is. You will understand what you have done much better if you do that. So ... it would be better to simply submit what you have at the deadline (without going online to cheat) and plan to allocate more time for homeworks in the future.

Generative AI Assistance (GAIA) Policy

In general, we follow the policy outlined in the CDS GAIA Policy.

Extracting and paraphrasing from the student responsibilities of that policy. Where there is conflicting information between the CDS policy and below, the policy below should take precedence.

Students shall:

  1. Give credit to AI tools whenever used, even if only to generate ideas rather than usable text, illustrations or code.
  2. When using AI tools on written assignments, unless prohibited, add an appendix showing
    1. the entire exchange, highlighting the most relevant sections;
    2. a description of precisely which AI tools were used (e.g. ChatGPT private subscription version or DALL-E free version),
    3. an explanation of how the AI tools were used (e.g. to generate ideas, turns of phrase, elements of text, long stretches of text, lines of argument, pieces of evidence, maps of conceptual territory, illustrations of key concepts, etc.);
    4. an account of why AI tools were used (e.g. to save time, to surmount writer’s block, to stimulate thinking, to handle mounting stress, to clarify prose, to translate text, to experiment for fun, etc.).
    5. Optional but recommended: Employ AI detection tools and originality checks prior to submission, ensuring that their submitted work is not mistakenly flagged.
  3. When using AI tools on coding assignments, unless prohibited
    1. Add the prompt text and tool used as comments before the generated code. Clarify whether the code was used as is, or modified somewhat, moderately or significantly.
  4. Not use AI tools during in-class examinations, or assignments, unless explicitly permitted and instructed.
  5. Use AI tools wisely and intelligently, aiming to deepen understanding of subject matter and to support learning.

As these generative assistive tools become widely deployed and pervasive, we believe they will become integral to most people's workflow. However, for foundational concepts, as are taught in this course, it is in your best interest and worth it to struggle some in creating your answers and solutions. It is just as important to learn what doesn't work, and which paths are dead ends, as it is to learn what does work. When you are posed with new and unique problems, that intuition you develop will be vital in choosing directions. More pragmatically, some of the most coveted jobs at the most selective companies require technical interviews where they expect you to know these foundational concepts without assistance.

And finally, to reiterate, it is vitally important, and a core part of academic integrity, to cite when you are using Generative AI Assistive technologies. Arguably, not citing and risking plagiarism is worse than taking short cuts and using and then citing GAIA.

Accommodations for Students with Disabilities

If you have a disability and have an accommodations letter from the Disability & Access Services office, I encourage you to discuss your accommodations and needs with me as early in the semester as possible. I will work with you to ensure that accommodations are provided as appropriate. If you suspect that you may have a disability and would benefit from accommodations but are not yet registered with BU Disability & Access Services, I encourage you to find more information at https://www.bu.edu/disability/.

This syllabus provides a general plan for the course; deviations may be necessary depending on the progress of the class.