Jay Taylor's notes
back to listing indexStanford University CS224d: Deep Learning for Natural Language Processing
[web search]
Original source (cs224d.stanford.edu)
Clipped on: 2017-04-22
Please see cs224n.stanford.edu for the current (Winter 2017) version of this class.
Course Description
Natural language processing (NLP) is one of the most important technologies of the information age. Understanding complex language utterances is also a crucial part of artificial intelligence. Applications of NLP are everywhere because people communicate most everything in language: web search, advertisement, emails, customer service, language translation, radiology reports, etc. There are a large variety of underlying tasks and machine learning models powering NLP applications. Recently, deep learning approaches have obtained very high performance across many different NLP tasks. These models can often be trained with a single end-to-end model and do not require traditional, task-specific feature engineering. In this spring quarter course students will learn to implement, train, debug, visualize and invent their own neural network models. The course provides a deep excursion into cutting-edge research in deep learning applied to NLP. The final project will involve training a complex recurrent neural network and applying it to a large scale NLP problem. On the model side we will cover word vector representations, window-based neural networks, recurrent neural networks, long-short-term-memory models, recursive neural networks, convolutional neural networks as well as some very novel models involving a memory component. Through lectures and programming assignments students will learn the necessary engineering tricks for making neural networks work on practical problems.
Previous Years Project Reports |
---|
2015 Reports |
2016 Reports |
Class Time and Location
Spring quarter (March - June, 2016)Lecture: Tuesday, Thursday 3:00-4:20
Location: Gates B1
Office Hours
Richard: Tue 4:30-6:30pm, Huang Basement(for research and project discussions)
TAs:
David: Mon 6:00-8:00pm, Huang 138
Bharath: Teus 1:00-3:00pm, Huang Basement
James: Wed, 5:30-7:30pm, Gates B26
Sameep: Thur, 12:45-2:45pm, Gates B21
Naveen: Fri, 1:00-3:00pm, Huang Basement
Qiaojing: Sun, 4:00-6:00pm, Gates B24
Grading Policy
Assignment #1: 15%Assignment #2: 15%
Assignment #3: 15%
Midterm: 15%
Final Project: 40%
Course Discussions
Stanford students: Piazza (for Stanford students)Online discussions: Reddit Group (for non-Stanford students)
Our Twitter account: @CS224d
Assignment Details
See the Assignment Page for more details on how to hand in your assignments.Course Project Details
See the Project Page for more details on the course project.Prerequisites
- Proficiency in Python
All class assignments will be in Python (and use numpy). There is a tutorial here for those who aren't as familiar with Python. If you have a lot of programming experience but in a different language (e.g. C/C++/Matlab/Javascript) you will probably be fine. - College Calculus, Linear Algebra (e.g. MATH 19 or 41, MATH 51)
You should be comfortable taking derivatives and understanding matrix vector operations and notation. - Basic Probability and Statistics (e.g. CS 109 or other stats course)
You should know basics of probabilities, gaussian distributions, mean, standard deviation, etc. - Equivalent knowledge of CS229 (Machine Learning)
We will be formulating cost functions, taking derivatives and performing optimization with gradient descent.
Recommended
- Knowledge of natural language processing (CS224N or CS224U)
We will discuss a lot of different tasks and you will appreciate the power of deep learning techniques even more if you know how much work had been done on these tasks and how related models have solved them. - Convex optimization
You may find some of the optimization tricks more intuitive with this background. - Knowledge of convolutional neural networks (CS231n)
The first problem set will probably be easier for you. We cannot assume you took this class so there will be ~3 lectures that overlap in content. You can use that time to dive deeper into some aspects.
FAQ
Is this the first time this class is offered?
This is the second offering of this course. The class is designed to introduce students to deep learning for natural language processing. We will place a particular emphasis on Neural Networks, which are a class of deep learning models that have recently obtained improvements in many different NLP tasks.
Can I follow along from the outside?
We'd be happy if you join us! We plan to make the course materials widely available: The assignments, course notes and slides will be available online. We may provide videos. We won't be able to give you course credit.
Can I take this course on credit/no cred basis?
Yes. Credit will be given to those who would have otherwise earned a C- or above.
Can I audit or sit in?
In general we are very open to sitting-in guests if you are a member of the Stanford community (registered student, staff, and/or faculty). Out of courtesy, we would appreciate that you first email us or talk to the instructor after the first class you attend.
Can I work in groups for the Final Project?
Yes, in groups of up to two people.
I have a question about the class. What is the best way to reach the course staff?
Stanford students please use an internal class forum on Piazza so that other students may benefit from your questions and our answers. If you have a personal matter, email us at the class mailing list Will be added shortly .
Can I combine the Final Project with another course?
Yes, you may. There are a couple of courses concurrently offered with CS224d that are natural choices, such as CS224u (Natural Language Understanding, by Prof. Chris Potts and Bill MacCartney). If you are taking a related class, please speak to the instructors to receive permission to combine the Final Project assignments.
As an SCPD student, how do I make up for poster presentation component?
For the final poster presentation you can submit a video via youtube about your project.
As an SCPD student, how do I take the midterm?
For the midterm, we can use standard SCPD procedures of having your manager or somebody at your company monitor you during the exam.
Will there be virtual office hours for SCPD students
All office hours will be accesible on google hangouts. The link to the hangout is available on piazza
Webdesign by Andrej Karpathy