Skip to content

codered2104/customer-personality-cleaning

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

1 Commit
Β 
Β 

Repository files navigation

Task 1 – Data Cleaning: Customer Personality Analysis

Dataset Used

Customer Personality Analysis from Kaggle
Original File: marketing_campaign.csv


πŸ›  Tools Used

  • Python 3.12
  • Pandas
  • Jupyter Notebook / VS Code

Cleaning Steps Performed

  1. Loaded dataset using read_csv() with tab separator (\t)
  2. Cleaned column names:
    • Lowercased
    • Removed spaces and replaced with underscores
  3. Handled missing values:
    • Filled missing income values with the column mean
  4. Removed duplicate rows
  5. Standardized text fields (education, marital_status)
    • Converted to lowercase, removed extra spaces
  6. Converted dt_customer column to datetime format
  7. Fixed data types:
    • income as float
    • year_birth as int
  8. Saved cleaned data as cleaned_customer_data.csv

Output Files

  • cleaned_customer_data.csv – Cleaned dataset
  • Task1.ipynb – Python code
  • README.md – This summary document

##Submission Project for Data Analyst Internship – Task 1

About

Data cleaning for internship

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors