Open to Work · Remote Preferred · Open to Relocate Anywhere in Ireland
Vishal Chaudhary

Turning Data into
Actionable Insights

Data Analyst / Engineer

Python • SQL • ETL • Tableau • AWS • Snowflake • dbt • EDA

3+ years building ETL pipelines, executive dashboards, and ML models across SaaS and data-driven environments — translating complex data into strategic business intelligence.

25%
Reporting Efficiency Improved
30%
Analysis Time Reduced
20%
Data Prep Time Reduced
8%
Churn Reduced via Insights
70–90%
ML Model Accuracy Range
500K+
E-Commerce Transactions Analysed
100K+
Healthcare Records Processed
1,470+
Employees in HR Dataset
18,894
Football Matches Analysed
10+
Executive Dashboards Built

About Me

I'm a Data Analyst / Engineer based in Dublin, Ireland, with an MSc in Data Analytics (Distinction) from National College of Ireland — completed in 2026. I specialise in building end-to-end data solutions — from ETL pipelines and cloud data warehouses to executive dashboards and predictive models that drive real business decisions.

With 3+ years of hands-on experience across the full analytics stack, I've designed scalable ETL workflows using dbt, AWS Glue, and Snowflake; built SaaS KPI dashboards in Tableau; and applied ML models that contributed to a 12% reduction in customer churn. I thrive at the intersection of data engineering, cloud infrastructure, and business intelligence.

Dublin · Open to Relocate Anywhere in Ireland MSc Data Analytics — Completed ✓ Remote Preferred · On-site Flexible
Download Resume

Education & Certifications

MSc in Data Analytics ✓ Completed

National College Of Ireland • Distinction · 2025–2026

Bachelor of Computer Applications

Maharaja Agrasen Himalayan Garhwal University • Grade 2:1

IBM IBM Certifications

Introduction to Data AnalyticsVerify
Excel Basics for Data AnalysisVerify
Data Visualization & DashboardsVerify
Python for Data Science & AIVerify
SQL and Relational Databases 101Verify

M Microsoft Certifications

Microsoft 365 FundamentalsVerify
Work Smarter with Microsoft ExcelVerify
Work Smarter with Microsoft WordVerify
Work Smarter with Microsoft PowerPointVerify

Work Experience

Professional journey in data analytics

Data Analyst

Technohunk Info Solutions

April 2022 – Sept 2024

UP, India

  • Built and maintained scalable ETL pipelines using SQL, dbt, and AWS Glue to ingest raw S3 data into Snowflake, reducing manual data preparation time by 20% and improving data quality across 3 business functions.
  • Designed 6+ executive-level Tableau dashboards sourced from Snowflake, tracking SaaS KPIs (ARR, churn, retention, feature adoption) for finance, operations, and leadership teams.
  • Led large-scale EDA on product and financial datasets using Python, SQL, and AWS Athena, improving reporting efficiency by 25% and driving 3 product roadmap decisions across 5+ product lines.
  • Applied ML techniques (Random Forest, Logistic Regression, K-Means) on 12,000+ customer records to build churn prediction and segmentation models — 80% accuracy, contributing to a 12% churn reduction within 3 quarters.
  • Optimised SQL queries on Snowflake and Athena, reducing query runtime by 35% and cutting dashboard load times across key reporting workflows.
  • Standardised reporting frameworks and data definitions across finance and operations, reducing errors by 40% and saving ~3 hours/week in manual reconciliation.
  • Defined 15+ KPIs in partnership with product, finance, and operations stakeholders; managed analytics workflows in Agile/Jira and documented data models in Confluence.
Python SQL dbt Snowflake AWS Glue AWS Athena AWS S3 Tableau EDA Scikit-learn Jira

Data Analyst Intern

Technohunk Info Solutions

Nov 2021 – April 2022
  • Assisted in data cleaning, transformation, and validation of raw datasets using SQL and Excel to support ongoing analytics projects.
  • Conducted exploratory data analysis to identify trends and anomalies, supporting senior analysts in insight generation.
  • Developed reports and dashboards in Tableau to track key business metrics and performance indicators.
  • Wrote and optimised SQL queries to extract and analyse data from relational databases in Agile sprints.
Python SQL Tableau Excel

What I Do

Core services I deliver as a Data Analyst / Engineer

Data Analysis & EDA

Deep-dive exploratory analysis on large datasets to uncover patterns, trends, and outliers that drive business understanding. From raw CSV to boardroom insight.

SQL & Database Design

Complex query writing, data modelling, performance optimisation, and scalable analytics-ready pipelines using PostgreSQL, MySQL, and dbt.

Dashboard & Visualisation

Executive-ready interactive dashboards in Tableau, Power BI, and Cognos that turn complex data into clear narratives for non-technical stakeholders.

Machine Learning

End-to-end ML pipelines using XGBoost, Random Forest, and deep learning. Feature engineering, model evaluation, SHAP interpretability, and deployment.

Business Intelligence

KPI design, SaaS metric tracking (ARR, churn, retention), and reporting frameworks that align data outputs with strategic business goals.

Causal & Statistical Analysis

Going beyond correlation — applying SDID, SHAP, LIME, and causal inference methods to validate true relationships and support evidence-based decisions.

ETL Pipeline Engineering

End-to-end ETL design using dbt, AWS Glue, and Snowflake — from raw S3 ingestion to analytics-ready datasets, with data quality checks and transformation logic built for scale.

Cloud Data & AWS

Building cloud-native analytics infrastructure on AWS — S3, Athena, Glue, Redshift, RDS, CloudWatch — optimised for cost, performance, and reliability at scale.

Featured Projects

Real-world data analytics case studies — click any card to explore in depth

View all repositories on GitHub
View Case Study
Healthcare ML 82% Accuracy

Diabetes Readmission Prediction

Logistic Regression & ML pipeline on 101,763 hospital records to predict high-risk patient readmissions. Academic research at NCI.

Python Scikit-learn Logistic Regression
View Case Study
Retail Analytics 30% Time Saved

E-Commerce Sales Analytics

RFM clustering + Random Forest on 541,909 UCI transactions. R²=0.98, CRISP-DM methodology, customer segmentation.

SQL Python Power BI
View Case Study + Live Demo
HR Analytics Live Dashboard ✦

HR Dashboard Analytics

Tableau dashboard revealing 16.12% attrition rate across 1,470 employees. Sales dept = 56% of total turnover. Embedded live.

Tableau SQL Python
View Case Study
Sports Analytics 64% Accuracy

Football Match Prediction

XGBoost on 18,894 matches across 10 European leagues. SHAP + LIME interpretability. 150+ engineered features.

XGBoost SHAP API-Football
View Case Study
Causal ML 94.36% AUC

Causality in Video Recommendations

SDID causal validation on KuaiRand-1K (3.13M interactions). 54.2% precision gain over standard DiD. NCI MSc research.

SDID XGBoost Causal Inference
View Case Study
Geospatial 67% XGBoost

Road Collisions Analysis

XGBoost & Random Forest on 104,258 UK collision records. Speed limit top predictor. CRISP-DM. SMOTE balancing.

Python XGBoost Geopandas
View Case Study
Environmental Washington State

EV Adoption Impact Analysis

EPA + DOE data pipeline using MongoDB & PostgreSQL. King County: 200k+ EV registrations. AQI vs. adoption correlation.

Python MongoDB PostgreSQL
View Case Study
Simulation 94% Accuracy

Mining Logistics Simulation

Discrete event simulation using SimPy. 94% throughput prediction accuracy. Fleet sizing optimisation via what-if analysis.

Python SimPy Optimization
View Case Study
Deep Learning 87% Accuracy

Multimodal Emotion Recognition

ResNet-50 + BERT fusion for 7-class emotion classification from images and text. SMOTE for class imbalance handling.

TensorFlow OpenCV NLP
View Case Study + Live Demo
Financial Analytics Live Dashboard ✦

Bank Loan Report

Interactive Tableau dashboard analysing loan portfolio health — good vs. bad loan KPIs, funded amounts, repayment trends, and borrower risk profiles.

Tableau SQL Excel
Live Interactive Dashboard

HR Analytics Dashboard

Explore the live Tableau dashboard analysing attrition patterns across 1,470 employees. Interact with filters, drill down by department, and see real workforce insights.

Interactive — use filters to explore by department, gender, and role Open in Tableau
Live Interactive Dashboard

Bank Loan Report

Explore the live Tableau dashboard analysing loan portfolio health — track good vs. bad loan KPIs, funded amounts, repayment trends, and borrower risk profiles across the full dataset.

Interactive — use filters to explore by loan grade, purpose, state, and term Open in Tableau

Technical Skills

Tools and technologies across the full analytics & engineering stack

Hover any node to reveal proficiency · 3 orbital rings · 18 technologies

Also Familiar With

Full stack of tools across the analytics & engineering ecosystem

Data Engineering

ETL Pipelines dbt Snowflake AWS Glue Databricks Data Modelling Data Quality

Cloud & AWS

AWS S3 AWS Athena AWS Lambda Redshift RDS CloudWatch IAM Cost Management QuickSight

Analytics & BI

Tableau Power BI Cognos Analytics EDA KPI Design Data Storytelling DAX / Power Query

Programming & Libraries

Python SQL R Pandas NumPy Scikit-learn XGBoost Matplotlib / Seaborn SHAP

Databases

Snowflake PostgreSQL MySQL MongoDB Oracle Amazon Aurora Query Optimisation

Automation & Monitoring

Automation AWS Lambda CloudWatch Git / GitHub Jira Confluence Agile / Scrum Data Governance

Data Insights

Thoughts on data analytics, machine learning, and business intelligence

Machine Learning

April 2025 · 6 min read

Why XGBoost Outperforms Deep Learning on Tabular Data

A breakdown of why gradient boosting consistently beats neural networks on structured tabular datasets — and what the evidence from football match prediction and causal ML reveals about choosing the right model for the job.

Vishal Chaudhary Coming Soon
Data Engineering

March 2025 · 8 min read

From Raw SQL to Executive Dashboard: A Data Analyst's Pipeline

The full journey from messy data to a polished executive dashboard — covering SQL modelling, dbt transformations, Python cleaning, and Tableau visualisation. With real examples from SaaS KPI tracking.

Vishal Chaudhary Coming Soon
Causal ML

February 2025 · 7 min read

When Correlation Lies: Lessons from Causal ML in Recommendation Systems

How standard ML models in video platforms confuse correlation with causation — and why Synthetic Difference-in-Differences (SDID) delivered 54.2% better causal precision than traditional methods.

Vishal Chaudhary Coming Soon

Words on Data

Wisdom that shapes how I think about analytics

"

Without data, you're just another person with an opinion.

WE
W. Edwards Deming
Pioneer of Quality Management
"

Data is the new oil. It's valuable, but if unrefined it cannot really be used.

CH
Clive Humby
Mathematician & Data Strategist
"

Torture the data, and it will confess to anything. The goal is insight, not confirmation.

RC
Ronald Coase
Nobel Prize Economist
Currently Exploring

Always Learning, Always Growing

Azure Data Fundamentals (DP-900)
Large Language Models (LLMs)
Generative AI & Prompt Engineering
Snowflake & dbt Cloud
Open to Work — Available Immediately

Let's Connect

I'm actively seeking data analytics roles in Dublin and open to remote opportunities. Whether it's a full-time role, contract, or collaboration — let's talk.

Typically responds within 24 hours.

Vishal Chaudhary

© 2026 Vishal Chaudhary. Built with passion for data.