Homework 5: Big Data, Big Responsibalities

Due: Tuesday October 8 2019, 9:00PM.

Handin

After completing the homework, you will submit:

If you are using late days, make sure to make a note of that in your readme. Remember, you may only use a maximum of 3 late days per assignment. If the assignment is late (and you do NOT have anymore late days) no credit will be given.

Helpful Things

Documentation

The Assignment

No coding for the homework this week! Good luck with design checks :)

README.txt

Fill out your README as usual that will contain the ethics assignment.

Ethics Assignment

Additional responses required for this week will pertain to the ethics assignment.

Data Perspectives: Algorithmic Bias

Beyond teaching you some technical skills around computing and data, CS111 also wants to help you think about some of the broader societal issues surrounding data and computing. To this end, we will have a few short reading assignments that ask you to reflect on implications and possible principles of working with computing and data. This is the second such assignment.

We estimate this assignment will take less than an hour.

Topic: Big Data and Surveillance

Read this MIT Technology Review article that describes several issues researchers should consider when handling big data sets. Then, read this New York Times article that explores how law enforcement agencies use location data collected by Google for a new investigative technique. If necessary, use this guide to access the New York Times through a free academic pass.

You’ll put your answers to the following questions in the README, clearly marking which answers belong to which questions. Your answer to each question should be a few sentences long—aim for thoughtfulness over length! Your answers should be clear and concise and should demonstrate serious reflection, but there are no right or wrong answers.

Answer any three of the four questions below:

  1. Choose an example of Big Data analysis that you’ve read about in a class, seen in the news, heard in a statistic, or even performed yourself. Explain how one of the characteristics of big data that Crawford and boyd identify could affect how you interpret the result of the analysis.
  2. The MIT Technology Review article highlights several concerns about the use of Big Data. Propose two solutions to address these issues. One solution should address the concern on an individual level (i.e. what should a data scientist or researcher do differently). The other solution should address the issue on a broader, structural level (think about laws, cultural beliefs, and corporate policies surrounding Big Data).
  3. The new investigative technique described in the New York Times article can help solve crimes, but it can also implicate innocent people and represent a violation of privacy. Do you believe this technique should be used? Explain why.
  4. Consider the “if you build it, they will come” principle and the Washington State prosecutor’s belief that “if companies are collecting data, law enforcement should be able to obtain a court order to use it.” Should companies change their data collection and storage practices given this information? Do you believe it is the responsibility of tech companies, the legal system, or individuals to protect personal data?

Grading
We will check that your responses:

Brown University CSCI 0111 (Fall 2019)
Do you have feedback? Fill out this form.