Skip to main content Link Search Menu Expand Document (external link)

Programming Assignment (PA) Schedule

Documents and Dates

  • PA0: Setting up AWS and Dask
  • PA1: Data Exploration with Dask
  • PA2: Feature Engineering and Model Selection with Spark
    • Release Date: 05/16/2023
    • Statement and Grading Rubric
    • Discussion by TA: to be posted on Canvas 05/23/2023
    • Due Date: 06/09/2023

Instructions and Caveats

  • Team composition:
    • You can work on projects in teams of between 1-3 individuals
    • Submit your team decision via a Google Form we provide before PA0’s release. One submission suffices per team.
    • Team decisions cannot be changed.
    • The TAs will then confirm your team memberships and team IDs.
  • Academic integrity:
    • It is okay to discuss about the assignment with your peers at a conceptual level. It is also okay to post conceptual or high-level questions, logistical questions, and useful references on Piazza. But do not share any code across teams and do not post any of your solution code for discussion. A team’s code submission must be entirely their own.
    • Do not go searching for any code posted online by other students or prior editions. We will use advanced program analysis tools to compare your code submissions. These go well beyond basic string or syntactic comparisons to catch plagiarism.
    • If plagiarism is detected in your code or if any other form of academic integrity violation is identified, you will get zero for that component of your score and get downgraded substantially. I will also notify the University authorities for appropriate disciplinary action to be taken, up to and including expulsion from the University.
  • There are no late days for the programming assignments. So, plan your work accordingly!