KRIBB Repository

검색

Open Access@KRIBBAging Convergence Research Center 1. Journal Articles

Use of graph database for the integration of heterogeneous biological data

Cited 0 time in scopus

Metadata Downloads

Full metadata record

DC Field	Value	Language
dc.contributor.author	Byoungha Yoon	-
dc.contributor.author	Seon-Kyu Kim	-
dc.contributor.author	Seon-Young Kim	-
dc.date.accessioned	2017-08-29	-
dc.date.available	2017-08-29	-
dc.date.issued	2017	-
dc.identifier.issn	I000-0158	-
dc.identifier.uri	https://oak.kribb.re.kr/handle/201005/17108	-
dc.description.abstract	Understanding complex relationships among heterogeneous biological data is one of the fundamental goals in biology. In most cases, diverse biological data are stored in relational databases, such as MySQL and Oracle, which store data in multiple tables and then infer relationships by multiple-join statements. Recently, a new type of database, called the graph-based database, was developed to natively represent various kinds of complex relationships, and it is widely used among computer science communities and IT industries. Here, we demonstrate the feasibility of using a graph-based database for complex biological relationships by comparing the performance between MySQL and Neo4j, one of the most widely used graph databases. We collected various biological data (protein-protein interaction, drug-target, gene-disease, etc.) from several existing sources, removed duplicate and redundant data, and finally constructed a graph database containing 114,550 nodes and 82,674,321 relationships. When we tested the query execution performance of MySQL versus Neo4j, we found that Neo4j outperformed MySQL in all cases. While Neo4j exhibited a very fast response for various queries, MySQL exhibited latent or unfinished responses for complex queries with multiple-join statements. These results show that using graph-based databases, such as Neo4j, is an efficient way to store complex biological relationships. Moreover, querying a graph database in diverse ways has the potential to reveal novel relationships among heterogeneous biological data.	-
dc.publisher	Korea Soc-Assoc-Inst	-
dc.title	Use of graph database for the integration of heterogeneous biological data	-
dc.title.alternative	Use of graph database for the integration of heterogeneous biological data	-
dc.type	Article	-
dc.citation.title	Genomics & Informatics	-
dc.citation.number	1	-
dc.citation.endPage	27	-
dc.citation.startPage	19	-
dc.citation.volume	15	-
dc.contributor.affiliatedAuthor	Byoungha Yoon	-
dc.contributor.affiliatedAuthor	Seon-Kyu Kim	-
dc.contributor.affiliatedAuthor	Seon-Young Kim	-
dc.contributor.alternativeName	윤병하	-
dc.contributor.alternativeName	김선규	-
dc.contributor.alternativeName	김선영	-
dc.identifier.bibliographicCitation	Genomics & Informatics, vol. 15, no. 1, pp. 19-27	-
dc.identifier.doi	10.5808/GI.2017.15.1.19	-
dc.subject.keyword	Neo4j	-
dc.subject.keyword	biological network	-
dc.subject.keyword	data mining	-
dc.subject.keyword	graph database	-
dc.subject.keyword	heterogeneous biological data	-
dc.subject.keyword	query performance	-
dc.subject.local	Neo4j	-
dc.subject.local	biological network	-
dc.subject.local	data mining	-
dc.subject.local	graph database	-
dc.subject.local	heterogeneous biological data	-
dc.subject.local	query performance	-
dc.description.journalClass	N	-

Appears in Collections:: Aging Convergence Research Center > 1. Journal Articles

Files in This Item:

There are no files associated with this item.

Show simple item record

qrcode

트윗하기

Open Access@KRIBB was built as a OAK to the National Central Library.