Climate Analysis: Investigating Regional Climate Dynamics

This project explores how climate patterns change over time using advanced data science techniques, including dimensionality reduction, clustering, and topological data analysis. By analyzing extensive climate data, I identified meaningful trends that offer insights into climate variability and urban heat effects.

πŸš€ Project Overview

This study focuses on understanding regional climate shifts through a data-driven approach. Using historical climate data from 1950 to 2023, I applied dimensionality reduction and clustering techniques to reveal trends in temperature, precipitation, and atmospheric heat transfer. The findings highlight evolving climate zones and increasing variability in regional climates.

πŸ“‰ Dimensionality Reduction

SVD Analysis Climate Regions

To extract the most relevant climate patterns, I applied Principal Component Analysis (PCA) and Singular Value Decomposition (SVD), reducing the dataset while retaining 98.7% of its variance. This allowed me to isolate the most influential climate trends over decades, making clustering techniques more effective and reducing the dataset from 36 variables to 15.

πŸ”Ή K-Means Clustering

K-Means clustering was used to define distinct climate zones by grouping areas with similar climate characteristics. Through optimization methods like the elbow method and silhouette scores, I determined the ideal number of clusters.

K-Means Clustering K-Means Clustering

These findings show the various regions with different climates, giving confidence to the use of K-Means Clustering for further analysis.

πŸ”Ή Hierarchical Clustering

Hierarchical clustering was used to uncover nested climate zones, capturing both major and minor variations without requiring a predefined number of clusters. Post-2010, a large shift in the different climate regions is evident.

Hierarchical Clustering

These results suggest that climate zones are not staticβ€”they evolve in response to urbanization, deforestation, and atmospheric changes. Hierarchical clustering provided a nuanced view of these transformations, allowing for better predictions of future climate trends.

πŸ”— Topological Data Analysis (TDA)

Traditional clustering methods assume predefined structures, limiting their ability to capture complex, non-linear relationships in climate data. To overcome this, I applied Mapper-based Topological Data Analysis (TDA), which preserves high-dimensional structures and identifies hidden patterns that might be missed by standard statistical methods.

πŸ›  How It Was Done

Topological Mapper Results by Time Topological Mapper Results by Region Region Map

πŸ“Š Key Findings