IS 445 - Group 5 - Final Project

Group Members: Nataly Panczyk, Betty Guerrero, Thomas McShane, Shubham Jain and Brian Rao

Objective

The goal of this project was to create a variety of complementary visualizations using data collected from the 2020 World Happiness Report. This notebook will provide a walk through of the final visualizations created by our group as well as descriptions for each visualization, major challenges in creating that visualization, and why we think these visualizations are valuable.

About the Data

https://worldhappiness.report/ed/2021/#appendices-and-data

The first eight World Happiness Reports were produced by the founding trio of co-editors assembled in Thimphu in July 2011 pursuant to the Bhutanese Resolution passed by the General Assembly in June 2011 that invited national governments to "give more importance to happiness and well-being in determining how to achieve and measure social and economic development." The overall goal of this organization is to create new metrics for measuring economic progression around the world beyond fiscal quantities. This dataset was particularly interesting to us because it's funded and supported by prominent universities, non-profits, and private companies, and shows one effective impact of these types of institutions when they work collaboratively for a genuinely good cause. This dataset seeks to benefit the development of our world for the humans that inhabit it, which is what inspired us to choose it for our final visualization project.

Comparing Life Expectancies in Different Countries From 2005 to 2006

The first component of this dataset that our group sought to analyze was the life expectancy in the various countries over time. We went about creating this visualization by first sorting the data by country, and then plotting the various points in 2 different ways using ipywidgets. First, the plot will display a checkbox that defaults to the static visualization. This will show the life expectancies in every country with a different colored marker (as indicated by the legend) for each year in the dataset, 2006-2020. By unchecking this box, a new widget will appear (a slider), that allows the user to define the end year of the dataset. This allows for the user to interact with the visualization on a more iterative level and identify the countries with the greatest changes in life expectancy more easily. The greatest difficulty we faced in creating this visualization was getting ipywidgets to update our plots when the sliders and checkboxes were interacted with. Eventually, we resolved this issue by using the %matplotlib notebook command instead of %matplotlib inline.

Generosity Rankings by Country and Year

Below is a graph that shows all the countries that are included in our dataset. This graph allows you to be able to see the average generosity of each of the counties in the dataset over the entire time that there has been data for that country. In order to see the generosity of a country simply hover your mouse over the desired country. You can use the square icon on the left to draw a box around an area that you would like to zoom in on. You can also use the icon with four arrows to drag the map around. Lastly, use the home button to revert the graph to its original state

Average Freedom to Make Life Choices Countries

The following two visualizations show the top 3 and lowest 3 countries and their ranking of freedom to make life choices. The data used was the average of the highest 50 countries and the average of the lowest 50 countries across all of the years the data set included.

The above visualization uses data from the top 50 highest averages of all the years between 2005-2020 of the score on a scale of 0-1.0 on their freedom to make life choices. The visualization itself shows the top 17 countries with the highest average scores. From the visualization we can see that Uzbekistan, Denmark, and Finland are the top 3 countries with the highest average scores. This finding was especially interesting given the nature of these three countries. We know that countries like Finland are notorious for having universal health care and paid family time off. It makes me wonder how much of these qualities go into affecting one's perception of being able to make life choices. Another surprising find was that the United States was not one of the countries to make the list which was also very, very surprising to me.

The above visualization uses data from the lowest 50 highest averages of all the years between 2005-2020 of the score on a scale of 0-1.0 on their freedom to make life choices. The visualization itself shows the 11 countries with the lowest average scores. From the visualization we can see that Bosnua and Herzegovina, Palestinian Territories, and Pakistan are the 3 countries with the lowest average scores. This finding was also very interesting, but it makes more sense to me than the highest 50. I think given the world situations in many of these places such as 'Palestinian Territories' it is not surprising that they make the bottom of the list. I think when taking into consideration their world events and the current state of their land, it is no surprise they rank themselves as having less freedom to make life choices.

Positive affect in various Countries as a function of year

This plot below represents the happiness of poeple in countries in a particular year as selected using the slider. Happiness affect data is collected by surveying a sample from the country asking poeple if they smiled or had a happy moment during the past 24 hours of their lives.

Top and Bottom Avg. Log GDP per Capita Countries, with Avg. Social Support Ratings

Exploratory data analysis revealed Social Support to have a high correlation with Log GDP per Capita values in the data set. The following visualizations display the highest and lowest ranked countries by Log GDP per Capita, accompanied by Social Support ratings on a scale of 10. The visualization of the top 25 Log GDP per Capita countries displays generally high levels of accompanying average Social Support ratings across the board in those countries. On the other end, the lowest 25 countries by Log GDP per Capita displayed expectedly low average Social Support ratings, with some countries having surprisingly high values relative to the rest (noted below).

Above is a graph of the top 25 countries by average log GDP that shows generally high average Social Support ratings, the lowest of these being Qatar, United Arab Emirates, Kuwait, Hong Kong, Taiwan, Saudi Arabia, and Bahrain.

Above is a graph of the bottom 25 countries by average log GDP that shows generally low average Social Support ratings, the highest of these being Lesotho, Zimbabwe, Mali, Uganda, Mozambique, and Congo. It is clear that the bottom 25 average GDP per Capita countries have a higher variation in Social Support Rating than the top 25 average GDP per Capita countries.