Name: Vivek Basavanth Hanagoji

Job Role: Data Analyst

Experience: 2 Years 9 Months

Address: Boston, USA

Skills

SQL 95%
PYTHON 85%
Data Visualization 90%
Statistical Analysis 85%
Machine Learning 80%

About

About Me

With 3 years of comprehensive experience in data science and analytics, and a Master of Science in Information Systems specializing in Data Analytics and Engineering, I bring a robust skill set to the table. I am proficient in data analysis, statistical analysis, hypothesis testing, customer behavior analysis, and machine learning. I have a proven track record of leading impactful projects and providing effective mentorship, driving results and fostering growth within teams.

  • Profile: Data Analystics & Engineering
  • Education: Master of Science in Information Systems, Bachelor of Engineering
  • Language: English, Hindi, Kannada, Marathi
  • BI Tools: Microsoft Power BI, Looker & Tableau
  • Interest: Traveling, Travel Photography
  • Programming Languages: Python, R, SQL, PySpark, Bash Scripting
  • Orchestration Tools: Apache Airflow, Talend, dbt
  • Data Science Libraries: Pandas, Sci-kit Learn, NumPy, TensorFlow, Matplotlib, Seaborn
  • Databases: MySQL, SQL Server, PostgreSQL, Pinecone Vector DB
  • Data Cloud Services: BigQuery, Amazon Redshift, Amazon S3, Snowflake, Amazon EC2, Google Cloud Storage
  • Visualization Tools: PowerBI, Tableau, Looker
  • Other Skills: Excel, Git, JIRA, Apache ANT, SAP, Google Analytics & SEO

0 +   Projects completed

LinkedIn

Experience

Work Experience

Experienced Data Engineer with a solid 3 years background in IT. Proven expertise in Python for data ingestion, transformation, and analysis. Eager to apply analytical skills and technical knowledge in a Data Analytics and Engineering role starting May 2024.

Experience


2020-2022

Software Engineer

Vodafone Intelligent Solutions, Pune, India

Tech Stack:
Spark, SparkSQL, Scala, Python, Airflow, Redshift, SQL, AWS S3, IAM, Terraform, GitHub Actions, CI/CD, Data Modeling, OLAP, ETL, SCD Type 2 & Type 4, DataFrames, Datasets, Automated Testing

2019-2020

Graduate Engineer Trainee - Data

Vodafone Intelligent Solutions, Pune, India

Tech Stack:
SQL, Redshift, PowerBI, HTML, CSS, Python, JIRA, KPI Dashboards, Data Modeling, Financial Analytics, Pricing Elasticity, Revenue Forecasting, Campaign Analysis, Scrum, Data Visualization, Performance Scorecards



Education

Education

With a Master of Science in Information Systems from Northeastern University and a Bachelor of Engineering in Electronics and Telecommunication from Savitribai Phule Pune University, my academic journey has equipped me with a strong foundation in both theoretical knowledge and practical skills.

Education


2022-2024

Master of Science in Information Systems

Northeastern University, Boston
Northeastern University Campus

Relevant Courses: Big Data Systems and Intelligent Analytics, Designing Advanced Data Architecture and Business Intelligence, Data Science

2015-2019

Bachelor of Engineering in Electronics & TeleCommunication

Savitribai Phule Pune University, Pune, India
Savitribai Phule Pune University Campus

Relevant Course: Fundaments of Programming Language, Machine Learning, System Programming and Operating Systems

Projects

Projects

Below are the sample Data Analytics projects on SQL, Python, Power BI & ML.

Generating fake face images using GAN

This project begins with the generator creating easily identifiable fake images, which evolve through training cycles to become increasingly realistic.

Discriminators are then trained to detect differences between real and generated images, enhancing the generator's accuracy with each iteration.

Explored the profound capabilities of neural networks to understand and replicate the subtleties of human facial features.

IPL First Innings Score Prediction Using Linear Regression

Cleaned and preprocessed a historical IPL dataset, removing irrelevant columns and using OneHotEncoding for categorical features, resulting in a streamlined dataset of 5000+ records for consistent team performance analysis.

Developed and trained a Linear Regression model using 80% of the preprocessed data, achieving an R² score of 0.85 on the test set.

User-Friendly Web Application Deployment: Deployed the model as a web application using Flask, allowing users to input match details and receive real-time score predictions.

Report generation for Chest X-ray images using Vision Transformers and GPT-2

Data Collection and Preprocessing: Gathered and preprocessed chest X-ray datasets, ensuring data consistency and suitability for model input.

Model Selection and Training: Fine-tuned pre-trained ViT and GPT-2 models on specific datasets to achieve optimal performance tailored to our use case.

Text Generation: Implemented text generation functions using GPT-2 and BART models, and combined their outputs for enhanced medical report generation.

Ensemble Model Creation: Developed an ensemble model combining BART and GPT-2 to generate comprehensive medical reports.


NYC motor collision vehicle data analysis

Efficiently transferred over 10 million data records from BigQuery to MySQL Server by implementing a streamlined data migration process using Talend.

Designed and implemented effective ETL workflows in Talend, including Staging and Integration processes, to streamline data merging and cleansing from the NYC Motor Collision Dataset. pr

Leveraged Tableau and PowerBI to analyze and identify key trends, patterns, and actionable insights, tracking year-over-year (YOY) performance to facilitate important key performance indicators (KPIs).

GenAI chatbot for SEC government documents

Crafted an OpenAI-driven chatbot for SEC government documents, enabling precise query resolution for over 75 forms with 95% accuracy in similarity searches.

Engineered 2 Airflow ETL pipelines for SEC document processing, boosting efficiency by 40% and enhancing data storage.

Containerized application deployment using Docker on a GCP VM Instance ensures secure, scalable user access and robust data protection with an AWS Postgres instance.

0 Achievements
0 Projects
0 Years of Experience

More projects on Github

I love to solve business problems & uncover hidden data stories


GitHub

Contact

Contact Me

Below are the details to reach out to me!

Address

Boston, MA

LinkedIn

vivekhanagoji

Email Address

hanagojivivek@gmail.com

Download Resume

Resume



Have a Question? Click Here



Copyright © All rights reserved | This template is made with by Colorlib