Hariharan Portfolio

Azure -Twitter sentiment analysis with Stream Prcossing

- This an end to end project where we pull live tweets from twitter with help Spark clusters on Azure Data Bricks and perform sentiment analysis on the data with Azure Language Services.

- Then we store the final Data in a Azure DB and a Parquet file in ADS Gen 2 Data Lake as a backup, this is orchestrated with the help of azure Data Factory.

- We finally use Power BI to connect with the Final DB in Azure SQL DB. Note: Azure Synapse can be utilized instead of Azure DB.

- In this project I have retrieved tweets regarding Marvel since the new Ms Marvel movie was just released, the final Power BI will show how much people feel about the movie in twitter.

- Click on the image to go to my GitHub

AWS - Youtube Analytics

- This an end to end project where we do some basic transformtion and data processing on youtube analytics Data from the youtube API.

- The data is stored in an initial landing bucket which is then transformed with the help of Amazon Glue and Amazon Lambda.

- Finally we use a Glue job to store our data in a analytics DB in Amazon Athena.

- This DB is connected with Tableau and we are able to do some data exploration and identify key metrics with Tableau Visualization.

- Click on the image to go to my GitHub

GCP - Covid prediction

- A prediction model built on data Toronto Public health.

- We use the GCP data proc to create a cluster where we load the data.

- The project aims at creating a predictive model that can predict the number of patients who are going to be admitted in the ICU while affected with covid.

- The predictive model will be developed with the help of Apache Spark. The algorithm used for this predictive model will be the Random Tree Classifier algorithm.

- The final result is moved to Google Cloud Storage, This then Loaded to A tableau workbook where further exploration is done.

AQI Dashboard - Tableau

- A Dashboard model built on data AQI data of the USA, sourced from Kaggle.

- We use Jupyter notebook to pre process the data.

- The data is then imported to Tableau.

- The Tableau Dashboard allows the user to understand several metrics and their trends over the years.

- The dashboard also allows the user to view each states trends over the years as well as the AQI performance of USA.

Store sales - Tableau

- A Dashboard model built on sample store sales, sourced from Kaggle.

- We use Jupyter notebook to pre process the data.

- The data is then imported to Tableau.

- The Tableau Dashboard allows the user to understand several sales metric of the store over 3 months.

- The dashboard also allows the user to view each branch and month wise trends over the 3 months for trends like customer rating, most sold category and average foot traffic throughout the day.

Los Angeles Government Payroll - Python

- We use Jupyter notebook to pre process the data.

- Initial data cleansing is done with supplementary research about Los Angeles wage laws.

- The jupyter notebook allows the user to understand several gender and racial insights of the state over 9 years.

Hariharan Sasidharan

Hey there,

Skills