News register
Search section
Search area
News Type
News type
Search date
Search word OR
List of related articles
Contents of related articles
No info was found
List of related articles
Contents of related articles
No info was found
View details
Information

03/22/2020 Special > Special

Title

Hanyang's Dataset Can Predict the Coronavirus

Student of Hanyang distributed the coronavirus dataset to the world

황희원

Copy URL / Share SNS

http://www.hanyang.ac.kr/surl/d24IB

Contents
As the coronavirus became a global epidemic, a Hanyang student developed a coronavirus dataset to help predict the spread of coronavirus. The dataset, named Data Science 4 COVID19 (DS4C), contains conveniently reprocessed data about the spread of the disease and is currently being shared with the world’s data scientists, conveying a message of willingness to work together in the face of this pandemic.

DS4C is a structured dataset which offers reprocessed data about the coronavirus spread for easy analysis. It analyzes and visualizes the information provided by Korea Centers for Disease Control and Prevention (KCDC) and local governments. Kim Ji-hoo (Department of Computer Science, Master’s program), the project manager, explained that the aim of the dataset is to provide data in a more comprehensible format, ultimately to find meaningful patterns that can help predict the disease.
 
The coronavirus dataset, Data Science 4 COVID19 (DS4C), developed by Kim Ji-hoo (Department of Computer Science, Master’s program). The dataset is shared on Kaggle for the world's researchers.
(Photo courtesy of Kim)

Kim started the project because despite the seriousness of the virus and his willingness to help the situation by using the data, information available was very complicated to digest. So, he developed his own coronavirus dataset as a tool for any further data analysis.

The dataset organizes detailed information including patient information; patient routes traveled; Seoul’s floating population; time series data based on age, gender, and province weather; and more. This categorized information can perform various tasks, such as help predict the changes in the number of confirmed cases, visualize the data into a more legible format, and analyze the connections between the confirmed cases.

The English version of the dataset is currently available on Kaggle, an online community of data scientists from around the world, in order to help other data experts who are working on similar tasks. It has become hugely popular, quoted and referred to by many scientists around the world. "A developer in France even called me to personally thank me for sharing my dataset,” said Kim. Thanks to his contribution, he received a gold medal on Kaggle as the most referenced project, making him the first Korean recipient of this honor.
 
Kim was the first Korean to win a gold medal on Kaggle for developing the most referenced project.
(Photo courtesy of Kim)

Kim expressed his gratitude towards his colleagues who worked together on the project. “It helped me in many ways because their feedback allowed me to discover methods that I could never have thought of alone,” he said. Kim also said he is proud that his work contributed in making the data flow, reprocess, and reach other researchers and encouraged his fellow Hanyangians to help overcome the pandemic in their own individual fields. 



Hwang Hee-won        whitewon99@hanyang.ac.kr
Copy URL / Share SNS

0 Comments