covid19india.org and covidIndia.org tell TNM on how they manage data amid the exponential rise in cases.

A health worker during a camp to collect swab samples for COVID19 testingPTI
Coronavirus Data Wednesday, July 08, 2020 - 12:46

In January 2020, when only a handful of COVID-19 cases had been reported in India, Girish Khera, the co-founder and COO of Scientific Animations, realised that the coronavirus could infect India at an alarming rate.

However, even as the country geared up to fight a virus it knew little about, there was a paucity of data. That's when several entrepreneurs and organisations came together to crunch the missing data. Now, covid19india.org and covidIndia.org have become the go-to pages for COVID-19 updates. 

For both these websites, data collection started with a simple spreadsheet. 

Girish, the founder of CovidIndia.org, told TNM that their early efforts began in late February when they released a 3D animation about the virus and the biology behind it. “That got 8 million views,” he said. 

He then reached out to a relative, an IAS officer, to ask whether India was prepared for a possible pandemic. He ended up meeting with officials from the Health Ministry, and started a website to track the COVID-19 cases in India. “It came from this desire to raise awareness about what was coming,” he said. 

The push behind COVID19India was something similar. “[On] March 12, 2020, a few of us were discussing how India doesn’t have one single source of information and maybe we can do something about it,” the team told TNM.

“The original dashboard was very primitive, a single page with a map and a table with just the state data. We were following an open-source COVID tracker made to track cases in Japan covid19japan.com and were inspired by them.”

The team’s idea was to have one spreadsheet which a few were allowed to edit and everyone could view. However, within a week, the website had too many visitors and editing became difficult. It was at this time that the team switched to a website that would display updated numbers.

As cases began to rise, the two websites branched out to involve more people to help with the data. In the beginning, with not much data on hand, and with multiple levels of administrations giving different data, collation became a challenge. 

Sourcing data

While both websites earlier relied on official Twitter handles and media reports for data, it has now streamlined as most states have dashboards or press releases.

Girish from CovidIndia.org said that they post data from any official source at the state or national level. 

“This includes state ministries, the Ministry of Health website, the Press Information Bureau or Twitter accounts of officials, whether they are state health ministers or the union health minister. As long as they are official accounts of officials, those also count as official data sources,” he said. 

To overcome roadblocks in fact-checking, Girish says they go only by official sources “unless there's something that looks like there's a typo or there's something that just doesn't fit, in which case we conduct multiple verifications.”

Girish compares today’s situation to the beginning of the pandemic when official sources were restricted to Wikipedia or the Johns Hopkins COVID-19 tracker. “In the beginning, it was very ‘Wild West-y. We were getting a lot of information from the news, and we had to corroborate that with official versions.”

Meanwhile, COVID19India said that collating the data to fit into a required format was a challenge. The team gets data from 35 different government sources, most of which come in different formats. Some bulletins report patient totals, some report individual patients details and some have paragraphs in vernacular language — and their challenge is to collate this data into a single accessible format.

Manpower

Both websites work with the help of a team of volunteers. 

“We have a team of volunteers. We have a 21-year-old, Himanshu, who has been capturing data since January 27. And he was doing it for the entire world in the month of February. Then I started focusing on India starting March. We built groups of volunteers around him and we've turned that into proper teams. We also went into partnerships, and we're lucky right now to have a partnership with Jaipuria Institute of Management. They gave us 50 volunteers and interns who do all the data collection at a base level for cases,” Girish said.

At COVID19India, there are around 97 volunteers feeding data into the website. 

“First around five to ten volunteers joined in and within a week, thousands of volunteers pitched in. Several groups were bifurcated to handle several functions. Currently, around 25 volunteers regularly monitor press releases and official Twitter handles to gather, structure and input data to the Google sheet. A large number of developers continue to contribute through Github. The number of contributors stands at 97 at the moment,” the team said.

Challenges

With the cases touching 7 lakh, both teams say data crunching has become a challenge. 

“We have a sheet where we monitor the time when a health department publishes data and we break up our volunteers into shifts. So if there's some slow data at any point during the day, we're kind of constantly checking to decide if we need to parachute in and show those,” Girish added.

He further said that the biggest failure on India’s part is not releasing enough data.

“All the decisions we're making today about cases, about the biology of the virus, and the treatment of disease, are based on the limited data that we have, and on the data that many other countries have provided. But India is different. it has a different geography and a different demographic. We have also seen that this particular disease presents itself very differently in different places,” Girish said.


Girish Khera

He further said that India is not publishing enough on the virus, but there just aren’t enough submissions. 

“There's a lot more data we could produce that could also help us understand what is different about the virus in India and what is the same. Therefore, that could show us how we should attack it, or how our treatment protocols are working. Different states are doing different treatment protocols and if one state is doing a really good job of treatment protocols, we should be shouting from rooftops that this works,” he said.

Besides supplying the necessary data, the websites have also now branched out — providing lists of labs and hospitals, quarantine centres, current rules and regulations, ambulance services, and even busting fake news. The two teams have managed to contribute to filling a data-sized hole in India’s COVID-19 fight. 

Show us some love! Support our journalism by becoming a TNM Member - Click here.