Path: blob/master/ML Clustering Analysis/3 Targeted Marketing Strategies Based on Financial Clustering.ipynb
3074 views
Analyze salary vs. spending data using K-Means clustering in Python to segment customers into distinct financial groups.
By identifying patterns in income and expenditure, businesses can create personalized marketing strategies tailored to each cluster’s financial behavior. This approach enhances customer targeting, optimizes promotional efforts, and improves overall marketing efficiency.
Exploratory Data Analysis (EDA)
Visualizing the Data Distribution
5. Determining the Optimal Number of Clusters
Elbow Method The Elbow Method involves plotting the within-cluster sum of squares (WCSS) against the number of clusters and finding the "elbow" point.
5.2. Silhouette Score
The Silhouette Score helps in evaluating the quality of clusters.
Silhouette Coefficient
One of the metrics to evaluate the quality of clustering is referred to as silhouette analysis. Silhouette analysis can be applied to other clustering algorithms as well. Silhouette coefficient ranges between −1 and 1, where a higher silhouette coefficient refers to a model with more coherent clusters.
Significance of Silhouette Score in k-Means Clustering The Silhouette Score is a metric used to evaluate the quality of clustering in unsupervised learning, such as k-Means. It measures how well data points are assigned to their clusters by comparing:
Cohesion – How close a point is to others in its own cluster.
Separation – How far a point is from points in the nearest neighboring cluster. The Silhouette Score (S) for each point is calculated as:
Best Range for Silhouette Score
The range of the Silhouette Score is between -1 and 1:
+1 → Perfect clustering (point is far from other clusters and well within its own cluster).
0 → Overlapping clusters (point is on the decision boundary).
-1 → Poor clustering (point is closer to another cluster than its own).
What is a Good Silhouette Score?
0.5 to 1.0 → Strong clustering, well-separated clusters.
0.2 to 0.5 → Reasonable clustering, but some overlap exists.
Below 0.2 → Weak clustering, clusters may not be meaningful.
Negative values → Misclassification, points may be in the wrong clusters.
6. Applying K-Means Clustering
Applying K-Means Clustering for Different Values of K (3 to 5), Assign group names to the final clusters
Based on the Elbow and Silhouette methods 4 clusters provide the best fit
Conclusion
Low Salary & Low Spending:
Individuals in this cluster have relatively low salaries and spend less.
This group might represent entry-level employees or individuals in lower-income brackets.
Insight: Focus on budget-friendly products or services and potential financial advice for this group.
High Salary & High Spending:
Individuals in this cluster have high salaries and spend more.
This group could represent high-income professionals or individuals with significant disposable income.
Insight: Target premium products, luxury services, or investment opportunities for this group.
High Salary & Low Spending:
Insight: Target for Ssaving Schemes
Moderate Salary & Spending:
Individuals in this cluster have moderate salaries and spending patterns.
This group may include mid-level professionals or individuals with moderate financial flexibility.