This is a limited proof of concept to search for research data, not a production system.

Search the MIT Libraries

Title: Geostatistical Analysis of SARS-CoV-2 Positive Cases in the United States

Type Dataset Peter K. Rogan (2020): Geostatistical Analysis of SARS-CoV-2 Positive Cases in the United States. Zenodo. Dataset.

Authors: Peter K. Rogan (University of Western Ontario) ; Eliseos Mucaki (Western University) ;



Geostatistics analyzes and predicts the values associated with spatial or spatial-temporal phenomena. It incorporates the spatial (and in some cases temporal) coordinates of the data within the analyses. It is a practical means of describing spatial patterns and interpolating values for locations where samples were not taken (and measures the uncertainty of those values, which is critical to informed decision making). This archive contains results of geostatistical analysis of COVID-19 case counts for all available US counties. Test results were obtained with ArcGIS Pro (ESRI). Sources are state health departments, which are scraped and aggregated by the Johns Hopkins Coronavirus Resource Center and then pre-processed by

This update of the Zenodo dataset (version 5) consists of three compressed archives containing geostatistical analyses of SARS-CoV-2 testing data. These datasets have been previously published in earlier versions of this archive (versions 2, 3 and 4):

Archive #1: “1.Geostat. Space-Time analysis of SARS-CoV-2 in the US (May25-Aug2-v4).zip” – results of a geostatistical analysis of COVID-19 cases incorporating spatially-weighted hotspots that are conserved over one week timespans (from version 4 of this Zenodo archive). Results are reported from the initial relaxation of distance constraints on Memorial Day weekend 2020 for ten consecutive 1-week intervals (May 25th through to August 2nd 2020). Hotspots, where found, are reported in each individual state, rather than the entire continental United States.

Archive #2: "2.Geostat. Spatial analysis of SARS-CoV-2 in the US (Mar24-Jul13-v3).zip" – the results from geostatistical spatial analyses only of corrected COVID-19 case data for the continental United States, spanning the period from March 24th through July 13th 2020 (from version 3 of this Zenodo archive).

Archive #3: "3.Initial Geostat. of SARS-CoV-2 in the US(v2).zip" – the results from the geostatistical analyses performed for version 2 of this Zenodo archive which analyzed COVID-19 case data prior to any case correction step.

These archives consist of map files (as both static images and as animations) and data files (including text files which contain the underlying data of said map files [where applicable]) which were generated when performing the following Geostatistical analyses: Hot Spot analysis (Getis-Ord Gi*) [‘Archive #1’: consecutive week-long Space-Time Hot Spot analysis; ‘Archives #2 and #3’: daily Hot Spot Analysis], Cluster and Outlier analysis (Anselin Local Moran's I) [‘Archives #2 and #3’], Spatial Autocorrelation (Global Moran's I) [‘Archives #2 and #3’], and point-to-point comparisons with Kriging and Densification analysis [‘Archive #3’].

The Word document provided ("Description-of-Archive.Updated-Geostatistical-Analysis-of-SARS-CoV-2 (version 5).docx") details the contents of each file and folder within these three archives, and gives general interpretations of these results.

More information

  • DOI: 10.5281/zenodo.3986171
  • Language: en


  • Geostatistics, COVID-19, SARS-CoV-2, hotspots, space-time analysis


  • Publication date: 2020
  • Issued: August 16, 2020


Much of the data past this point we don't have good examples of yet. Please share in #rdi slack if you have good examples for anything that appears below. Thanks!


electronic resource


DescriptionItem typeRelationshipUri