top of page
header.png
Team Supermassive Blackhole: Movie Finder

written by

Olivia Haimerl

Thursday, February 16, 2023

Congratulations to Team Supermassive Blackhole on winning The Erdős Institute’s Spring 2022 Data Science Bootcamp with their project Movie Finder!

 

Comprised of University of Michigan students Anna Brosowsky, Sayantan Khan, Nancy Wang, Ethan Zell, and Yili Zhang, Team Supermassive Blackhole successfully combined three existing data sets to build a movie search app that gives the title of a movie using what the user remembers about the plot. The user can also input the approximate year and genre of the movie to provide additional information for the search engine. For example, inputting “cowboy doll gets jealous of spaceman toy,” provides the search result of the 1995 film Toy Story with an accuracy of 0.84. According to the team, “our model prototype achieves 84% accuracy on the test data, which is a big improvement over the 21% achieved by our baseline model of comparing substring similarity scores.”

 

When discussing how the team settled upon these data sets and specific project, Ethan noted “there was a need to have a project that could have multiple goals and levels of success so we could be quite ambitious but also ensure that we have a product at the end of the bootcamp.” He emphasized there was a need to have a deliverable that would demonstrate why the project matters. To guarantee that the search engine would be working by the end of the four-week intensive bootcamp, Yili highlighted the importance of teamwork within the group: “we split our tasks so some team members focused their efforts on data cleaning while others worked more on machine learning. It is all about teamwork and a little bit about divide and conquer, so each of us had our own tasks and we would meet regularly throughout the week to discuss.” Anna further noted the tasks were split by interest, familiarity, and practicality. Team Supermassive Blackhole attributes their success to their ability to work together in a group, their ability to complete their own tasks in a timely manner, their interest in the project, and their project mentor, Frank Hidalgo.

 

The team noted that with more time and access to other resources, they envision a variety of future possibilities for the app. This includes allowing users to continually train the data by providing feedback on the search engine results and expanding the app to include books or tech-related information, such as finding answers to the question: “new iPad update issue with Google Drive.” Sayantan also noted that fellow students outside of the team utilized the search engine in opposition to its intended purpose: to find movies to watch based on arbitrary movie plot inputs.

 

Although the team agrees their success was due to intensive teamwork, they noted that all future bootcamp participants should aim to start their projects as soon as possible! Further, when addressing the future bootcamp participants, Nancy noted: “individuals should apply to the data science bootcamp as soon as they can, even if they are not sure if they want to go into industry or not, because it is an extremely useful skill and it would help them more when they apply for internships or jobs.”

 

Congratulations again to Team Supermassive Blackhole as well as all of the other teams who completed a Spring 2022 Data Science Bootcamp project!

TEAM

Exploring Causality between News Sentiment and Stock Movement Prediction

Jem Guhit, Nawaz Sultani, Saeid Hajizadeh, Samson Johnson

clear.png

Motivation: Financial markets are often affected by sentiment conveyed in news headlines. Understanding the causal relationship between news sentiment and stock price movements can provide deeper insight into market dynamics.
Goal: Investigate the causal effects between news sentiments and stock price movements. This includes predicting stock movement trends based on news sentiment analysis and understanding how stock movement changes based on future news sentiment. This project aims to study these effects to improve stock movement predictions and optimize portfolio performance

Project Proposal Structure:
- Improve on the sentiment analysis tool used by exploring transformer models to get more accurate sentiment scores
- Explore Bi-directional Models and CNN
- Refine Baseline Model (used ARIMA in the last project)
- Refine simulation of trading strategy that is used to calculate average percentage of portfolio growth – did our models make profit?


Screen Shot 2022-06-03 at 11.31.35 AM.png
github URL
bottom of page