17 Final Project

  • Start: Tuesday, December 14

17.1 Office Hours

Staff and Link Day Time
Zoom with Dave Tuesday 10:00 AM - 12:00 PM
Zoom with Dave Wednesday 5:00 PM - 7:00 PM
Discussion Forum Any! Any!

17.2 Deadline

Your submission must be submitted to Canvas by Thursday, December 16, 2021 at 11:59 PM, Champaign time. As this is very near the end of the final exam week, any late submission received without an absence letter from an emergency dean will incur a significant penalty. At minimum, a late submission will lose 20% of the points possible for the project.

17.3 Submission

You should submit a single .zip file named your-app-name.zip where your-app-name is whatever you name your app. This .zip file should contain an RStudio Project in a folder named your-app-name. This folder should contain the following, which will be described in this document:

  • your-app-name.Rproj - a required RStudio project file.
  • app.R - a required file that defines the Shiny application.
  • about.Rmd - a required file containing information describing the application.
  • R/ - a required directory that contains .R files that contain additional functions.
  • data/ - an (optional) directory that contains .csv or other files necessary to import data.

17.4 Data

For this project, you can use any data! The following thread on Ed should be used to discuss possible dataset and their appropriateness:

The following are some potential data sources which we will update as we find more through conversations on Ed:

  • wadefagen’s Useful Datasets - A number of “useful” datasets, especially related to UIUC. Of particular interest may be the GPA dataset that we have previously explored.
  • Illinois Shield Testing Data - Data on Illinois’ COVID testing. Data is available through a link to Box at the bottom of the page.
  • OPEN Powerlifting - Data on results from powerlifting meets. CSV files can be obtained on the Data page. As the data is extremely large, consider pre-processing.
  • nflfastR - A package to efficiently scrape NFL play-by-play data. See related packages as well.
  • ffverse - A collection of fantasy football R packages.
  • baseballr - A package for scraping various baseball data from websites such as FanGraphs.com, Baseball-Reference.com, and baseballsavant.com.
  • 538 Data - Data used in 538 articles.
  • Kaggle Datasets - Data from the ML practice platform Kaggle. Be aware of the licences of the various dataset. (And try to consider data that can most readily be legally shared.)
  • sportsdataverse - Collect of packages for obtaining sports data in R.
  • cfbfastR - An R package for working with CFB data
  • CDC Data - Catalog of CDC data, which currently is heavily populated with COVID data.

We recommend that you attempt to find a dataset ASAP and verify that you can load it into are and perform basic data manipulations.

17.5 Requirements

In short, you must create a Shiny application with at minimum three inputs and two outputs. The files and how to submit them is described above. The grading details below indicate additional details about the application and the necessary files.

17.6 Grading

Your project will be graded based on the following criteria. As each criteria is very specific, partial credit will only be used in very rare situations. (Assume no partial credit is possible.) Note that we are not grading on quality or creativity of analysis, only functionality and correctness of the application. (However, you should still work towards these things!) The total points available for the project is the sum of the points listed.

  • [4] After unzipping submission, all required files exist.
  • [5] After opening app.R, the applications runs without error using the “Run App” feature inside RStudio.
  • [1] Data used by the application should either be loaded directly from the web, or stored in the data/ directory and loaded accordingly. (We strongly recommend the latter so that your application will not break if a link becomes broken later.)
  • [1] The UI is well organized. Hint: leverage sidebarLayout(), fluidRow(), or other UI layout features documented on the R Shiny cheat sheet.
  • [1] The UI contains a navigation bar with at least two tabs.
  • [3] The application contains three input elements.
  • [1] Of the minimum three input elements, two must be unique.
  • [1] Of the minimum three input elements, the choices of one of the inputs must be conditionally dependent on another input.
  • [2] The application contains at least two output elements, one graphical and one tabular.
  • [1] The graphical output element is created using ggplot2.
  • [1] The graphical output element reacts to at least one of the input elements.
  • [1] The application uses at least one reactive element that is user defined using reactive().
  • [1] The application performs some data manipulation using dplyr.
  • [1] The tabular output element reacts to at least one of the input elements.
  • [1] The R/ directory should contain at least one .R file which contains at least one function.
  • [1] The function described above is used in the application.
  • [1] One of the tabs in the navigation bar links to the rendered output from about.Rmd.
  • [1] The about page has four sections: Author, Purpose, Data, and References.
  • [1] The author section of the about page should contain the name and email of the author, as a bulleted list.
  • [1] The purpose section of the about page should contain a very brief (think one paragraph) description of what the application is meant to do and what it could be used for.
  • [1] The data section of the about page should describe the source of the data. It should also describe the data in enough detail such that a user of the application understands the data enough to use the application.
  • [1] The references section of the about page should contain a list of references used, which at minimum includes the source of the data.
  • [3] When using the application, no errors or crashes occur.