Visualizations of COVID-19 outbreaks and transmission networks around the world reveal how interconnected we truly are, across all borders and oceans.
Using viral genomes collected and shared by researchers from around the world, they found how shortly after the virus emerged in Wuhan, China in November 2019 it jumped to Asia, western Europe, Australia, Canada, and the United States, and eventually South America and Africa.
Coauthor Pavel Skums, an assistant professor of computer science at Georgia State University, has applied similar bioinformatics techniques to track transmission and outbreaks of other infections such as Hepatitis C.
He and Gerardo Chowell, professor of mathematical epidemiology in the School of Public Health, now have new tools and extraordinary amounts of data at their disposal. Creating a global COVID-19 transmission network like this is possible because of advances in genomic sequencing technologies that have made sequencing rapid and affordable.
Multiple points of entry show that “it is not enough to try and find the single patient zero.”
“The data on the virus is growing as fast as the virus,” Skums says. “This is actually the first outbreak in history where we have so much data. It’s the first global public health emergency for which next-generation sequencing technologies have been employed at such a vast scale. For Ebola, we had nothing of this magnitude.”
Global COVID-19 transmission networks
Skums says he was working at the Centers for Disease Control and Prevention at the time of Ebola, and “scientists were traveling to Africa to help produce and analyze the data.”
Scientists can now access global data from their own “shelter-at-home” computers, working together to solve the challenges the coronavirus presents. Skums, Chowell, and computer science doctoral students Pelin Icer Baykal and Fatemeh Mohebbi are mining freely available data from the GISAID database, a global database where researchers upload their virus sequences, as well as related clinical and epidemiological data.
The team’s analysis allows them to determine where the virus has peaked, is peaking, or is yet to peak.
“Right now we see that the hotspots like New York City, Italy, and Spain, have reached their maximum incidence rate,” Chowell says. “They are leveling off or just started to follow a downward trend, though at very high levels.”
Atlanta is about a week from the peak, he says, because interventions were not implemented there until recently.
Scientists sequenced the first genome of the novel coronavirus in January, but scientists around the globe have since sequenced more than 5,000 other genomes of the virus.
“Global modeling like this helps us understand that there was not one single introduction of the virus in each country,” says Skums.
Almost every country had multiple introductions of the virus, depicted as multiple arcs across land and ocean. For example, strains of the virus landed in Hong Kong via Shanghai, and jumped to the United Kingdom, Italy, Norway, Portugal, France, and even Iceland. France may have received the virus from multiple countries, ranging from Iceland to Switzerland, Finland, Portugal, Spain, and Australia. Washington can link to Canada, Shanghai, and Australia, among other places.
Multiple points of entry show that “it is not enough to try and find the single patient zero,” Skums says. Epidemiologists try to determine that first patient, because they can use the information to help determine the ultimate curve of exponential growth that fans out from that first infection. They do this through mathematical models that include how contagious the virus is, how long the incubation period is, whether it can be transmitted while someone is asymptomatic and other factors.
This information is not enough in a global pandemic.
“Our model really shows that closing travel from one country, such as China, won’t make enough of a difference,” Skums says.
By the time a country or the world realizes the danger of a pandemic, the seeds have already widely dispersed.
“An epidemic is formed by clusters, or local outbreaks, that are not entirely synchronous,” says Chowell, who has studied the arc of outbreaks ranging from 1918 flu pandemic to the Ebola epidemic of 2014. Chowell says one can create an overall picture of a pandemic across the world from those clusters, but one can also drill down to discover just how and where the virus is moving, find out where more severe outbreaks are occurring and predict what lies ahead for the next 2-3 weeks locally and globally.
A lot like Twitter
The researchers’ model shows multiple interconnected vertices across the world, and each vertex, they say, represents one distinct genome of the virus that has been sequenced.
The model, which researchers update weekly, shows where the most intense outbreaks are and the infectious arcs that branch out from them, like the fan of an immense spider web. Skums compares this to a person on Twitter who has a few million followers.
“Their social network is huge,” Skums says. “But another person on Twitter may have only a few followers.”
This kind of information is especially important as a new virus like this spreads rapidly through what is known as a “naïve population.” Humans have never encountered this virus before, so the human immune system has no innate defenses against it.
Skums hopes that as researchers study the transmission networks in the coming weeks and months, they may be able to see where outbreaks are more intense, which strains are most prevalent and whether certain strains may have mutations responsible for higher infectivity or virulence.
He and Chowell also have begun to model COVID-19 transmission networks at a state level, which will let individual states see when their peak has occurred and how many cases are forecast in the next few weeks.
The findings appear on medRxiv.
Source: Georgia State University