Dataset
The dataset comes from a competition hosted by Analytics Vidhya.
Train dataset contains 550,068 observations about the black Friday in a retail store. 12 variables either numerical and categorical are included.
​
Variable Definition
User_ID User ID
Product_ID Product ID
Gender Sex of User
Age Age in bins
Occupation Occupation (Masked)
City_Category Category of the City (A,B,C)
Stay_In_Current_City_Years Number of years stay in current city
Marital_Status Marital Status
Product_Category_1 Product Category (Masked)
Product_Category_2 Product may belongs to other category also (Masked)
Product_Category_3 Product may belongs to other category also (Masked)
Purchase Purchase Amount (Target Variable)
The test dataset contains 233599 examples about black Friday in this retial store without 'Purchase' attributes.