A study on ways to extend public data for game ratings from Korea

: As of 2020, public data for game ratings provided by Game Ratings And Administration Committee(GRAC) are more limited than public data for movie and video ratings provided by Korea Media Ratings Board and do not provide data which allow us to see information on ratings clearly and in detail. To get information on game ratings, we need to find information by searching for specific target on homepage which is inconvenient for us. In order to improve such inconvenience and extend scope of provision in public data, the author of this paper intends to study public data API which has been extended based on information on video ratings. To draw items to be extended, this study analyzes data for ratings on homepage of GRAC and designs collection system to build database. This study intends to implement system that provides data collected based on extended public data items in a form which users want. This study is expected to provide information on ratings to GRAC which will strengthen fairness and satisfy game users and people’s rights to know and contribute to promotion and development of game industry.


Introduction
As people's leisure activity and cultural activity have increased, game is growing as people's central cultural activity [1]. In addition, technology has developed, smartphones are becoming more common in people and games which people enjoyed via PC and video game machine have been transplanted to smartphones with high specification which makes it possible for people to enjoy games anytime anywhere. As the demand for game has increased, game industry has grown. According to white paper on game in  Korean game market has continued to grow and it has higher amount of export than other markets. At the time that people's interest in game increases and Korean game industry accomplishes good result at home and abroad, regulatory review procedures impedes growth of game industry. According to Article 21 of Game Industry Promotion Act, A person who intends to produce or distribute a game product for the purpose of circulating the game product or providing for the use thereof shall receive a rating for the contents of the game product from the Committee or business entities designated before producing or distributing such game product [3]. Rating review aims to enhance protection of young people and game ethic and prevent speculation but studies and public opinions have reported that rating review is excessively regulative [4][5][6]. Due to "The Sea Story", speculative game related that occurred in Korea in 2006, Game Ratings And Administration Committee(GRAC) was launched [7]. Rigid and unreasonable review such as indie game regulation on zuzunza.com in 2019[8] and steam regulation in 2020 impeded growth of Korean game industry [9].
Voluntary ratings were implemented to relieve regulation of review and game monitoring unit for ratings was launched in 2019 but as shown in [ Figure 1], above mentioned monitoring unit was composed of people who had poor understanding of games such as job discontinued women and people with handicap which led to controversy on fairness and members of expertise[10-12]. It is necessary to study whether GRAC that is responsible for rating and reviewing games discloses information properly and whether rating and reviewing games are performed fairly.
GRAC belongs to public institution and information on game ratings and review is included in public data. Public data aims at "improving life of quality and national economy by promoting people's use of data owned and managed by public institution." which incurs obligation to disclose data on game ratings and review [13]. This study intends to analyze and classify content of public data provided by GRAC and ways to provide public data API information.
In addition, this study intends to provide information on ratings and review to game users and workers in industries so that they can use it.

Analysis of public data by Game Ratings And Administration Committee
GRAC provides information on games in a form of Open API(Application Programming Interface) according to public data policy.
However information on game rating review which can be obtained through is limited. Contents in [    Data obtained through API as shown in [ Figure 2] and [ Table 3] simply enumerate results of game ratings and content of information on game and reasons for which specific games received relevant rating is insufficient and limited. On the other hand shown in [ Figure 4] Open API which is movie rating provided by Korea Media Rating Board includes results of ratings as well as information on movie such as synopsis and lead role and provides reasons for ratings which allows people to check the reasons [15].
Public data that allow people to check information in more detail can be used to satisfy people's rights to know and can be used for application service aiming at providing information on movies. GRAC needs to provide data and information that conforms to intent of ratings and purpose of public data like Open API which is movie rating. People can check information on games and reviews by browsing it with game name and rating number through rating decision check page on homepage of GRAC.
However, this is not included in public data API and utilization is low because process is complex to get detailed review information on games and data are not provided in a form which can be processed. According to this study intends to collect and analyze data subject to rating review provided by GRAC to enhance utilization and quality of public data.

Method of study
As method of study, based on public data for video rating provided by Korea Media Rating Board as shown in [ Figure 3], data for game rating provided on homepage of GRAC is analyzed and items to be improved in public data for game rating are drawn.
System which allows us to collect data for data rating in a systemical way is designed and implemented and collection and classification are made and database is built to provide users with extended public data. . There are links that allow us to move to pages which are related to games and rating review such as "page of detailed information on games", "page of a written decision", "page of rating history", and "present condition of acquisition of overseas grade"  Table 5] shows items that people can check. Each item includes basic information on games such as "genre" and "nationality" and people can check basic information on rating for example "date of rating" and "rating number", [ Table 5] is summary of description and sample data according to items on relevant page.  Page of a written decision on game rating allows people to check a decision on rating of games which requested rating review as shown in [ Figure 6]. Content of decision on games that requested rating includes reason for rating, description of game in one sentence and information on game contents. In item of "indication of content information", sensationality, violence, fear, unsuitability of languages, drug, crime, and speculation are indicated based on standard for consideration of rating in relevant studies.
People can collect data for indication of content information which influences rating and reason for decision on game rating on page of a written decision on games. [ Table 6] is summary of description and sample data according to items on relevant page.   Table 7] proposes items to be improved by drawing items needed from "page of detailed information on games" and "page of a written decision on game rating" and setting two items of game information and rating review information based on public data provided by Korea Media Rating Board. Relevant items can be extracted from database of GRAC that has information on games and can be collected from page of GRAC. Using sample data allows us to check information on game rating review more clearly and prepare foundation that can be used in analyzing and visualizing big data.

Designing and implementing public data collection system for extending game rating
As shown in [Table 7], this study intends to design system which collects game rating data to provide extended game rating public data to users. As shown in [Table 8], collection system was designed and built based on system environment. Parsing technique which is easy to specific data or items was used and Jsoup library aiming at parsing in Java environment was used. Jsoup is library which is easy to find or extract data that users want by using DOM search or CSS selector and can implement process of collection by accessing page with simple code. Jsoup allows people to make analysis and access ineffective tag and proven tag and provides trees structure type approach[17]. System was designed as shown in [ Figure 7] to collect rating data from homepage of GRAC. System consists of URL collection module, connection module, detailed collection module, and storage module. Scheduler controls requests and modules. System collects detailed page URL of each game and then verifies whether data exists in relevant URL. Then access detailed page URL of each game and collect data for rating and store it on database and build data for game rating. Deliver state of whether data are stored normally by scheduler and time on which collection is finished and time on which storage is completed. Data for game rating were collected by implementing system and a total of 80180 data were collected from May 12, 2020 to May 15, 2020. If a user requests information on game on game rating database built through collection system, pre-indexed data set delivers requested data. Arrange requested data sequentially based on ID and then provide it to users in a form of XML and JSON as shown in [ Figure 8].  Table 10] shows API result data and extension result data provided by GRAC. Existing data show fragmentary one such as game name, decided rate and rating institution. On the other hand, extended data show detailed information on games genre, platform, and description of games and reasons for review, sensationality, violence. Extended data can be used as basic data in analyzing rating.

Conclusion
It is necessary to examine whether GRAC provides information on game rating review properly and fairly in terms of game users and game industry. Studies show that GRAC has played a role as public institution and its rating review has impeded game industry. Most papers cover validity of information on rating review in terms of sociology and law and few papers cover validity of information on rating review in terms of engineering. This study analyzed types and contents of data for game rating review provided by GRAC and designed data collection system and collected relevant data to obtain extended public data. Analysis showed that data for game rating review were dispersed. This study made a plan for collection to collect dispersed data and designed and implemented structured system. A total of 80180 data were collected and foundation for providing extended public data was built.
The author of this study intends to conduct a study on advanced big data analysis such as prediction of time that takes to review games, overseas rating and comparative analysis by using fourth industrial revolution's ICT for example machine learning and artificial intelligence based on experiment and collected data by extending big data analysis method such as grouping and factor analysis.