Name: Vivek Basavanth Hanagoji

Job Role: Data Analyst

Experience: 2 Years 9 Months

Address: Boston, USA

Skills

SQL 95%
PYTHON 85%
Data Visualization 90%
Statistical Analysis 85%
Machine Learning 80%

About

About Me

With 3 years of comprehensive experience in data science and analytics, and a Master of Science in Information Systems specializing in Data Analytics and Engineering, I bring a robust skill set to the table. I am proficient in data analysis, statistical analysis, hypothesis testing, customer behavior analysis, and machine learning. I have a proven track record of leading impactful projects and providing effective mentorship, driving results and fostering growth within teams.

  • Profile: Data Analystics & Engineering
  • Education: Master of Science in Information Systems, Bachelor of Engineering
  • Language: English, Hindi, Kannada, Marathi
  • BI Tools: Microsoft Power BI, Looker & Tableau
  • Interest: Traveling, Travel Photography
  • Programming Languages: Python, R, SQL, PySpark, Bash Scripting
  • Orchestration Tools: Apache Airflow, Talend, dbt
  • Data Science Libraries: Pandas, Sci-kit Learn, NumPy, TensorFlow, Matplotlib, Seaborn
  • Databases: MySQL, SQL Server, PostgreSQL, Pinecone Vector DB
  • Data Cloud Services: BigQuery, Amazon Redshift, Amazon S3, Snowflake, Amazon EC2, Google Cloud Storage
  • Visualization Tools: PowerBI, Tableau, Looker
  • Other Skills: Excel, Git, JIRA, Apache ANT, SAP, Google Analytics & SEO

0 +   Projects completed

LinkedIn

Experience

Work Experience

Experienced Data Engineer with a solid 3 years background in IT. Proven expertise in Python for data ingestion, transformation, and analysis. Eager to apply analytical skills and technical knowledge in a Data Analytics and Engineering role starting May 2024.

Experience


2020-2022

Associate Software Engineer - Cloud Services and Software in Salesforce (SFDC)

Vodafone Intelligent Solutions, Pune, India

Vodafone connects millions globally, enhancing digital inclusion and sustainability, leveraging Salesforce for smarter customer relationship management and innovation.

  • Optimized Spark-based ETL pipelines processing 5TB daily telemetry data using Apache Airflow, reducing monitoring time from 20 mins to sub-second latency for 30,000 CDRs (call detail records) across 7 European regions.
  • Collaborated with business stakeholders to identify analytics requirements and materialized data sources into a cloud-based AWS S3 data lake, integrating 20+ legacy systems and accelerating data retrieval by 60%.
  • Administered Salesforce Tableau CRM, which entailed extraction of data from Salesforce, development of datasets with dataflows and dataflow builder, and execution, scheduling, and tracking of dataflows.
  • Demonstrated success in leading and participating in 30+ Agile sprints, ensuring timely delivery of project milestones and fostering a collaborative team environment for productivity, attaining a 100% on-time delivery rate.
  • Utilized Workbench and Salesforce Object Query Language (SOQL) for advanced data patching, correcting over 5,000 records with discrepancies during year-end activities, enhancing data reliability by 45%.
  • Led on-call support data initiatives, collaborating with Project Managers and Data Scientists to optimize critical pipelines, achieving 99.9% SLA uptime, and boosting customer Net Promoter Score (NPS) by 15 points.
  • • Contributed to Nucleus, Vodafone’s metadata-driven analytics platform, optimizing ETL pipelines using PySpark that reduced data onboarding from 3 weeks to 4 hours for 1,900 daily loads across 50 million European subscribers.
  • Collaboratively worked with test and development teams, resolving common issues in Salesforce.
  • Supervised and guided a team of 2 indirect reportees on knowledge transfer and daily tasks.
  • Led on-call support data initiatives, collaborating with Project Managers and Data Scientists to optimize critical pipelines, achieving 99.9% SLA uptime, and boosting customer Net Promoter Score (NPS) by 15 points.
  • Handled BAU activities in PreProd and Production environments.
  • Worked on automations and extended support to work on POCs using DevOps tools such as AutoRabit, Gearset, and Copado.

2019-2020

Salesforce Administrator

Vodafone Intelligent Solutions

  • Streamlined data entry processes by 20% by implementing data validation rules and custom fields in Salesforce.
  • Utilized tools like Process Builder, Email Templates, Workflow Rules, Workflow Actions, and Approval Flows to improve system functionality, ensuring overall Salesforce efficiency.
  • Automated complex data patching processes using Apex Data Loader, processing over 10,000 records monthly, resulting in a 50% reduction in manual data entry errors and a 30% increase in process efficiency.
  • Led team to implement OwnBackup ETL tool for Salesforce, securing 100+ backups and anonymizing data in full-copy sandboxes.
  • Administered Salesforce sandboxes, performed refresh activities, and managed Git and Jenkins CI-CD pipelines.
  • Resolved support requests and conflicts during the deployment process.
  • Communicated daily status updates through regular correspondence and scheduled calls with integrated teams.



Education

Education

With a Master of Science in Information Systems from Northeastern University and a Bachelor of Engineering in Electronics and Telecommunication from Savitribai Phule Pune University, my academic journey has equipped me with a strong foundation in both theoretical knowledge and practical skills.

Education


2022-2024

Master of Science in Information Systems

Northeastern University, Boston
Northeastern University Campus

Relevant Courses: Big Data Systems and Intelligent Analytics, Designing Advanced Data Architecture and Business Intelligence, Data Science

2015-2019

Bachelor of Engineering in Electronics & TeleCommunication

Savitribai Phule Pune University, Pune, India
Savitribai Phule Pune University Campus

Relevant Course: Fundaments of Programming Language, Machine Learning, System Programming and Operating Systems

Projects

Projects

Below are the sample Data Analytics projects on SQL, Python, Power BI & ML.

Generating fake face images using GAN

This project begins with the generator creating easily identifiable fake images, which evolve through training cycles to become increasingly realistic.

Discriminators are then trained to detect differences between real and generated images, enhancing the generator's accuracy with each iteration.

Explored the profound capabilities of neural networks to understand and replicate the subtleties of human facial features.

IPL First Innings Score Prediction Using Linear Regression

Cleaned and preprocessed a historical IPL dataset, removing irrelevant columns and using OneHotEncoding for categorical features, resulting in a streamlined dataset of 5000+ records for consistent team performance analysis.

Developed and trained a Linear Regression model using 80% of the preprocessed data, achieving an R² score of 0.85 on the test set.

User-Friendly Web Application Deployment: Deployed the model as a web application using Flask, allowing users to input match details and receive real-time score predictions.

Report generation for Chest X-ray images using Vision Transformers and GPT-2

Data Collection and Preprocessing: Gathered and preprocessed chest X-ray datasets, ensuring data consistency and suitability for model input.

Model Selection and Training: Fine-tuned pre-trained ViT and GPT-2 models on specific datasets to achieve optimal performance tailored to our use case.

Text Generation: Implemented text generation functions using GPT-2 and BART models, and combined their outputs for enhanced medical report generation.

Ensemble Model Creation: Developed an ensemble model combining BART and GPT-2 to generate comprehensive medical reports.


NYC motor collision vehicle data analysis

Efficiently transferred over 10 million data records from BigQuery to MySQL Server by implementing a streamlined data migration process using Talend.

Designed and implemented effective ETL workflows in Talend, including Staging and Integration processes, to streamline data merging and cleansing from the NYC Motor Collision Dataset. pr

Leveraged Tableau and PowerBI to analyze and identify key trends, patterns, and actionable insights, tracking year-over-year (YOY) performance to facilitate important key performance indicators (KPIs).

GenAI chatbot for SEC government documents

Crafted an OpenAI-driven chatbot for SEC government documents, enabling precise query resolution for over 75 forms with 95% accuracy in similarity searches.

Engineered 2 Airflow ETL pipelines for SEC document processing, boosting efficiency by 40% and enhancing data storage.

Containerized application deployment using Docker on a GCP VM Instance ensures secure, scalable user access and robust data protection with an AWS Postgres instance.

0 Achievements
0 Projects
0 Years of Experience

More projects on Github

I love to solve business problems & uncover hidden data stories


GitHub

Contact

Contact Me

Below are the details to reach out to me!

Address

Boston, MA

LinkedIn

vivekhanagoji

Email Address

hanagojivivek@gmail.com

Download Resume

Resume



Have a Question? Click Here



Copyright © All rights reserved | This template is made with by Colorlib