Homework 1-4

In this homework, you will be analyzing a dataset using the spreadsheet tools SORT and FILTER.

Task 1

Get the movie dataset from kaggle. This will require you to make a kaggle account. Add the csv file to your Google Drive and open it in Google Sheets.

In a separate sheet from the initial table of metadata, provide the directors and movie titles sorted by their budgets. Which movie made in 2016 was the highest grossing? (Find your answer using sort.)

Task 2

Find all of the movies that are 2-3 hours long and in English. Write a few sentences explaining what filter(s) you used, how many movies you found, and the average IMDB rating of those movies.

On a separate sheet, provide titles of all of the movies that had more facebook likes for the director than the lead actor.

Task 3

Using any combination of sorting and filtering, find the longest movie that had more than 10,000 facebook likes and got an IMDB score greater than 8, preferably with a minimum of scrolling through data. Write down the name and director you found and how you found it.

Task 4

On a separate sheet, please put down how many hours you worked on this assignment, any people you worked with, and whether you went to TA hours for help or clarification on this assignment.

Extra Credit

Using the tools you've learned about so far, come up with a data analysis problem you can do with this dataset. As a jumping off point, you could consider seeing how the various types of facebook likes are related to the IMDB rating. Tell us about your problem and your work as well as what you find.


Handin

Once you're done, share your files with cs0030handin@gmail.com by midnight, 2/8.

You should be handing in two files: a Google drive spreadsheet and a text file. Make sure your submission has your name in the filename: FirstLast_HW1-4. “FirstLast” should be replaced with your first and last name or we will take off points. Make sure every task has been completed.