I'm a data analyst focused on transforming complex data into meaningful stories. I work with SQL, Python, and visualization tools to help teams make smarter decisions.
I’m a data analyst with hands-on experience working with large-scale datasets and building data-driven solutions across multiple industries. My work includes data cleaning, exploratory analysis, and developing predictive models to support business decisions. I have experience with Python, SQL, and machine learning techniques such as regression, classification, and ensemble methods. I’m currently focused on deepening my expertise in machine learning and transitioning into more ML-focused roles.
B.Sc. in Data Science
The University of Texas at Dallas, 2022
Data Analyst
Raas Infotek, 2024-Present
End-to-end regression modeling | EDA → Feature Engineering → Model Evaluation
Built a regression model to predict California housing prices using demographic and geographic data from ~20K+ records. Conducted exploratory data analysis and statistical validation (OLS) to identify significant predictors. Improved model performance by incorporating spatial features, increasing R² from ~0.50 to ~0.60 and reducing RMSE by ~8%. Identified median income and location as key drivers of housing prices.
Exploratory Data Analysis | Data Cleaning → Feature Exploration → Insight Generation
Conducted EDA on ~20K Airbnb listings to analyze pricing dynamics and neighborhood trends.
Identified location and room type as primary drivers of price, with central districts commanding premium rates.
Explored host-level attributes and review behavior to uncover patterns in customer engagement.
Python, SQL, R, MATLAB
Regression, Classification, GLM, Neural Networks, OLS, Model Evaluation (RMSE, R²)
EDA, Feature Engineering, Data Cleaning, Statistical Analysis
Tableau, Power BI, Plotly, Matplotlib, Seaborn
ETL Pipelines, Data Validation, SQL Optimization
Git, Jupyter, Google Colab, VS Code
A/B Testing, Causal Analysis, Hypothesis Testing
SQL Server, Relational Databases, Data Modeling