Lectures are on Tuesdays and Thursdays from 3:30 PM - 4:45 PM. The course website is on GitHub (https://github.com/chaneyn/CEE690-02). Class announcements will be made via Sakai (CEE690.02.F20).
Professor Nathaniel W. Chaney (Nate)
Email: nathaniel.chaney@duke.edu
Office: FCIEMAS 2463
Office hours: Thursdays after class via Zoom
Laura Torres
Email: lpt14@duke.edu
Office hours: Mondays 8-10am via Zoom
Environmental Spatial Data Analysis (ESDA) provides an introduction on how to leverage large volumes of spatial environmental data using primarily Python. The topics that will be covered include an overview of basic spatial statistics, spatial interpolation, kriging, conditional simulation, terrain analysis, dimensionality reduction, and spatial prediction. Existing software packages in Python will be introduced and used to explore the listed topics.
Although there are no class prerequisistes, a strong foundation in programming will make this class much easier. Please contact Nate if you have concerns.
There are no required textbooks. Reading will be provided via journal articles, online materials, and tutorials.
The course grade is based on three items:
- Homework: 40%
- Participation: 20%
- Final Project: 40%
There will be 4 homework assignments. Each assignment will be provided and completed within a corresponding Jupyter notebook. Completed assignments will be submitted via a private GitHub repository that each student will have for the course; assignments submitted via any other method will not be accepted. Each assignment must be submitted before class on the day listed on the schedule below. Late homeworks will not be accepted.
- Students should follow along the lecture on their personal jupyter lab Docker container that they will use for the course.
- Each student will present twice. The first presentation will involve describing a dataset and the second will be present a journal article.
Collaboration in completing assignments is permitted. However, each student must write up their assignment independently. We will be checking for identical homeworks.
The final project can be done in groups of 2 or individually. The expectations for the project will increase with the group size. It will involve the following components:
- Initial Proposal (October 22nd via email)
- 3 pages max, single-spaced, 12 point font size, 1 inch margin
- Contains: Title, introduction, objectives, data, methodology, and timeline of tasks
- Oral presentation (November 17th and 19th in class)
- 12 minute oral presentation, 3 minutes for questions
- Everyone needs to be present for each presentation
- Final report (November 24th via email)
- 10 pages max, single-spaced, 12 point font size, 1 inch margin
- Contains: Title, introduction, data, methods, results, discussion, and conclusion
Note that the schedule is subject to change.
Date | Topic | New Software | Assignments | Article |
---|---|---|---|---|
08/18 | Introduction | Jupyter/GitHub/Bash | - | - |
08/20 | Python overview | Python | - | Lin, J., 2012 |
08/25 | Multi-dimensional arrays I | NumPy | - | Lu et al., 2018 (Owen Daly) |
08/27 | Visualizing data | Matplotlib | - | Rougier et al., 2014 (Keqi He) |
09/01 | Data storage | Pickle/H5py/NetCDF/GeoTiff | - | Extance, 2016 (Laura Torres) |
09/03 | Probability/Statistics I | Scipy | - | Holmes, 2018 (Celine Robinson) |
09/08 | Probability/Statistics II | - | HW #1 due | Walther and Moore, 2005 (Sarah Scott) |
09/10 | Bayesian Statistics | - | - | Prathvikumar, 2019 (Gary Jiang) |
09/15 | Map projections I | Cartopy | - | Lapaine, 2017 (Lijia Gao) |
09/17 | Map projections II | GDAL | - | Asay, 2020 and Simmon, 2017 (Huda Aslam) |
09/22 | Multi-dimensional arrays II | CDO/Xarray | - | Hoyer and Hamman, 2017 (Cary Shindell) |
09/24 | Vector Data | OGR/Shapely/GeoPandas | - | Kreveld, 2006 (Rundong Ji) |
09/29 | Cluster Analysis I | Scikit-Learn | - | Mishra, 2017 (Peiran Wang) |
10/01 | Cluster Analysis II | - | - | - |
10/06 | Dimensionality Reduction | - | HW #2 due | - |
10/08 | Decision Trees | - | - | Homer et al., 2004 (Huda Aslam) |
10/13 | No class | - | - | - |
10/15 | Random Forests | - | - | Kaminska, J., 2018 (Sarah Scott) |
10/20 | Boosting | - | - | Cai, J., et al., 2020 (Celine Robinson) |
10/22 | Simple Kriging | - | Proposal due | Wong, D., et al., 2004 (Gary Jiang) |
10/27 | Ordinary Kriging | - | - | Pouladi, N., et al., 2019 (Lijia Gao) |
10/29 | Semivariogram | - | - | Hengl, T., et al., 2007 (Owen Daly) |
11/03 | Regression Kriging | - | HW #3 due | - |
11/05 | Terrain Analysis I | - | - | - |
11/10 | Terrain Analysis II | - | - | Moore, I., et al., 1991 (Keqi He) |
11/12 | Scaling up code | Numba/Mpi4py/Dask | - | Bakharia, A., 2018 and Grover, P., 2018 (Cary Shindell) |
11/17 | Oral Presentations | - | - | - |
11/19 | Oral Presentations | - | - | - |
11/24 | Written report due | - | HW #4 due | - |