Welcome to my Data Analysis page
I initially obtained this data set from Kaggle as a csv file. It comprises of 5 columns and 4378 rows. I run Python using the Jupyter Notebook interface.
The data set has sales figures for a wide variety of automobile makes - European, American, Japanese etc.
The Month values range from 1 to 12. The year values range from 2007 to 2017.
The data for 2017 is not complete however as only January figures for that year are published.
I wish to have full year figures so I discard the 2017 figures.
My analysis is therefore sales figures for the period 2007 to 2016.
I upload and modify the data set as follows
The dataframe 'df' forms my new raw data set from which I work.
I was curious about the 10 year total of cars sold
To make analysis of the 'df' dataframe easier I thought I should break the sales down by year
I got the total sales for 2007
I did the same with all the other years
I then created a new dataframe called 'Annual_sales' and obtained a bar graph from it
Car sales took a dive during the global economic depression around 2008 but has been on the steady rise since
At this point I was curious about Toyota sales for 2007
How about Toyota sales for all the Januaries between 2007 to 2016?
At this point I really wanted to get an idea of how each automobile make performed over the 10 year period
I took the original dataframe 'df' and extracted the sum of the Quantity value based on Make. Being only interested in the Quantity figure at this point, I created a new dataframe 'AllSales_10yrs'
I was curious as to whether the Quantity sum value in 'AllSales_10yrs' would match that in 'df'
Unfortunately no! 1334195 initially vs 1334179 at this point
I will explore why later on
With the 'AllSales_10yrs' dataframe revealing 65 different makes, I was only interested in the top 10
I create a new dataframe 'df2' with the top 10 Makes. I then plot a pie chart of this information
Conclusion
Volkswagen commanded a 21.4% share in car sales in Norway for the period 2007 to 2016, followed by Toyota with 17.4% and Ford and Volvo with 10.3% each