Homework 5: Big Data, Big Responsibalities
Due: Tuesday October 8 2019, 9:00PM.
Handin
- Download your file(s) and make sure the follow the naming convention listed in the assignment handout. Hand in your work on Gradescope. Look at this guide for directions.
- You may submit as many times as you want. Only your latest submission will be graded. This means that if you submit after the deadline, you will be using a late day – so do NOT submit after the deadline unless you plan on using late days.
- Do not put your name anywhere in any of the handin files. Doing so will result in a deduction.
- The README template can be found here
After completing the homework, you will submit:
If you are using late days, make sure to make a note of that in your readme. Remember, you may only use a maximum of 3 late days per assignment. If the assignment is late (and you do NOT have anymore late days) no credit will be given.
Helpful Things
Documentation
The Assignment
No coding for the homework this week! Good luck with design checks :)
README.txt
Fill out your README as usual that will contain the ethics assignment.
Ethics Assignment
Additional responses required for this week will pertain to the ethics assignment.
Data Perspectives: Algorithmic Bias
Beyond teaching you some technical skills around computing and data, CS111 also wants to help you think about some of the broader societal issues surrounding data and computing. To this end, we will have a few short reading assignments that ask you to reflect on implications and possible principles of working with computing and data. This is the second such assignment.
We estimate this assignment will take less than an hour.
Topic: Big Data and Surveillance
Read this MIT Technology Review article that describes several issues researchers should consider when handling big data sets. Then, read this New York Times article that explores how law enforcement agencies use location data collected by Google for a new investigative technique. If necessary, use this guide to access the New York Times through a free academic pass.
You’ll put your answers to the following questions in the README, clearly marking which answers belong to which questions. Your answer to each question should be a few sentences long—aim for thoughtfulness over length! Your answers should be clear and concise and should demonstrate serious reflection, but there are no right or wrong answers.
Answer any three of the four questions below:
- Choose an example of Big Data analysis that you’ve read about in a class, seen in the news, heard in a statistic, or even performed yourself. Explain how one of the characteristics of big data that Crawford and boyd identify could affect how you interpret the result of the analysis.
- The MIT Technology Review article highlights several concerns about the use of Big Data. Propose two solutions to address these issues. One solution should address the concern on an individual level (i.e. what should a data scientist or researcher do differently). The other solution should address the issue on a broader, structural level (think about laws, cultural beliefs, and corporate policies surrounding Big Data).
- The new investigative technique described in the New York Times article can help solve crimes, but it can also implicate innocent people and represent a violation of privacy. Do you believe this technique should be used? Explain why.
- Consider the “if you build it, they will come” principle and the Washington State prosecutor’s belief that “if companies are collecting data, law enforcement should be able to obtain a court order to use it.” Should companies change their data collection and storage practices given this information? Do you believe it is the responsibility of tech companies, the legal system, or individuals to protect personal data?
Grading
We will check that your responses:
- are related to the readings
- demonstrate evidence of critical thinking and reflection
- are well written, rather than being full of spelling and grammar mistakes.
Brown University CSCI 0111 (Fall 2019)
Do you have feedback? Fill out this form.
Homework 5: Big Data, Big Responsibalities
Due: Tuesday October 8 2019, 9:00PM.
Handin
After completing the homework, you will submit:
README.txt
If you are using late days, make sure to make a note of that in your readme. Remember, you may only use a maximum of 3 late days per assignment. If the assignment is late (and you do NOT have anymore late days) no credit will be given.
Helpful Things
Documentation
The Assignment
No coding for the homework this week! Good luck with design checks :)
README.txt
Fill out your README as usual that will contain the ethics assignment.
Ethics Assignment
Additional responses required for this week will pertain to the ethics assignment.
Data Perspectives: Algorithmic Bias
Beyond teaching you some technical skills around computing and data, CS111 also wants to help you think about some of the broader societal issues surrounding data and computing. To this end, we will have a few short reading assignments that ask you to reflect on implications and possible principles of working with computing and data. This is the second such assignment.
We estimate this assignment will take less than an hour.
Topic: Big Data and Surveillance
Read this MIT Technology Review article that describes several issues researchers should consider when handling big data sets. Then, read this New York Times article that explores how law enforcement agencies use location data collected by Google for a new investigative technique. If necessary, use this guide to access the New York Times through a free academic pass.
You’ll put your answers to the following questions in the README, clearly marking which answers belong to which questions. Your answer to each question should be a few sentences long—aim for thoughtfulness over length! Your answers should be clear and concise and should demonstrate serious reflection, but there are no right or wrong answers.
Answer any three of the four questions below:
Grading
We will check that your responses: