Real-Time-Twitter-Data-Analysis

  • Tech Stack: Tweepy, Kafka, Keras, Tensorflow, Dash, Plotly, NLTK, mySQL
  • Github URL: Project Link

The program reads a stream of tweets, replies, and mentions pertaining to a set of keywords or in response to tweets by specified Twitter accounts. TwitterStream.py then partially cleans the text before sending it to the master consumer using a Kafka stream. MasterConsumer.py then cleans the text further and preps it for analysis by a neural network, keyword extraction, and analysis of users being followed. Once the consumer is done it uploads the data to a mySQL server. The front end then selects the data it wants from the SQL server and displays it on the dashboard.