Skip to content

Downloads subtitles, generates word/count charts based on the word counts in the subtitles.

Notifications You must be signed in to change notification settings

mustafaulker/Subtitle-Analyzer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Subtitle Analyzer

Downloads the subtitles of IMDB Top 100 movies.
Lists all the words in the subtitles and their counts in an Excel file. Filters stopwords. (e.g. "the", "a/an", "him", etc.)
Generates word/count charts based on the data obtained.

Usage

Requirements

  • Python >= 3.6
  • Run pip install -r requirements.txt for other dependencies.

Execution

Run python src/subtitle_analyzer.py in the project directory.

Output

Generated Excel file and Chart images will be in the data_output file under project directory.

Output Example

Horizontal Chart

About

Downloads subtitles, generates word/count charts based on the word counts in the subtitles.

Topics

Resources

Stars

Watchers

Forks

Languages