Team Supermassive Blackhole: Movie Finder
Thursday, February 16, 2023
Congratulations to Team Supermassive Blackhole on winning The Erdős Institute’s Spring 2022 Data Science Bootcamp with their project Movie Finder!
Comprised of University of Michigan students Anna Brosowsky, Sayantan Khan, Nancy Wang, Ethan Zell, and Yili Zhang, Team Supermassive Blackhole successfully combined three existing data sets to build a movie search app that gives the title of a movie using what the user remembers about the plot. The user can also input the approximate year and genre of the movie to provide additional information for the search engine. For example, inputting “cowboy doll gets jealous of spaceman toy,” provides the search result of the 1995 film Toy Story with an accuracy of 0.84. According to the team, “our model prototype achieves 84% accuracy on the test data, which is a big improvement over the 21% achieved by our baseline model of comparing substring similarity scores.”
When discussing how the team settled upon these data sets and specific project, Ethan noted “there was a need to have a project that could have multiple goals and levels of success so we could be quite ambitious but also ensure that we have a product at the end of the bootcamp.” He emphasized there was a need to have a deliverable that would demonstrate why the project matters. To guarantee that the search engine would be working by the end of the four-week intensive bootcamp, Yili highlighted the importance of teamwork within the group: “we split our tasks so some team members focused their efforts on data cleaning while others worked more on machine learning. It is all about teamwork and a little bit about divide and conquer, so each of us had our own tasks and we would meet regularly throughout the week to discuss.” Anna further noted the tasks were split by interest, familiarity, and practicality. Team Supermassive Blackhole attributes their success to their ability to work together in a group, their ability to complete their own tasks in a timely manner, their interest in the project, and their project mentor, Frank Hidalgo.
The team noted that with more time and access to other resources, they envision a variety of future possibilities for the app. This includes allowing users to continually train the data by providing feedback on the search engine results and expanding the app to include books or tech-related information, such as finding answers to the question: “new iPad update issue with Google Drive.” Sayantan also noted that fellow students outside of the team utilized the search engine in opposition to its intended purpose: to find movies to watch based on arbitrary movie plot inputs.
Although the team agrees their success was due to intensive teamwork, they noted that all future bootcamp participants should aim to start their projects as soon as possible! Further, when addressing the future bootcamp participants, Nancy noted: “individuals should apply to the data science bootcamp as soon as they can, even if they are not sure if they want to go into industry or not, because it is an extremely useful skill and it would help them more when they apply for internships or jobs.”
Congratulations again to Team Supermassive Blackhole as well as all of the other teams who completed a Spring 2022 Data Science Bootcamp project!
I would like to find patterns with prime numbers.