The main goal of this analysis is to provide a visual and updated overview of the Covid-19 outbreak in Italy including:
The data is updated on daily basis.
We define a function to get the raw data from the corresponding url and retruning it as a dataframe.
This function allows us to get the latest updates.
Import the latest reports from the Presidenza del Consiglio dei Ministri - Dipartimento della Protezione Civile.
There are four datasets:
Date of the latest data to be gathered is yesterday to ensure the data is available as the files are updated daily at midnight.
We use the date variable "ieri" to build the urls dynamically. The date format used in the Italian repository differs from the one used by the CSSE at Johns Hopkins University, this is why we need to build it.
We rename the columns to translate them to English.
Active cases keep growing on national level, however the growth of new cases has been slowing down since the beginning of April. If the currently positive curve continues to follow the trends from China and Korea, the growth could be expected until at least one month since the countrywide lockdown started.
Lombardia is the region with the bigest share of cases of those currently active.
As the outbreak hit Lombardia first, the number of fatalities and recovered cases are higher, explaining the lower number of currently cases in comparison to the total of cases. As of 30.03 the number of active cases in the region is slowing down.
In the other regions, the outbreak started more recently, therefore the actual positive and total positive in those regions are very close to each other and the curve of active cases keeps growing.
As of 31.03.20, the peak of active cases is registered on 29.03.20. However, the number of active cases started to grow again in the first week of April, so the peak of cases hasn't been reached yet.
Looking at the trends in China and Korea, for the former the trend flattened for about five days after the peak before starting to go down; for the latter, there were 5 days of oscillations after the peak value until the curve started to go down steadily.