KRIBB Repository

검색

Open Access@KRIBBDivision of A.I. & Biomedical Research Genomic Medicine Research Center 1. Journal Articles

GenoCore : a simple and fast algorithm for core subset selection from large genotype datasets

Cited 49 time in scopus

Metadata Downloads

Full metadata record

DC Field	Value	Language
dc.contributor.author	Seongmun Jeong	-
dc.contributor.author	Jae-Yoon Kim	-
dc.contributor.author	Soon-Chun Jeong	-
dc.contributor.author	S T Kang	-
dc.contributor.author	J K Moon	-
dc.contributor.author	Namshin Kim	-
dc.date.accessioned	2017-08-29	-
dc.date.available	2017-08-29	-
dc.date.issued	2017	-
dc.identifier.issn	1932-6203	-
dc.identifier.uri	10.1371/journal.pone.0181420	ko
dc.identifier.uri	https://oak.kribb.re.kr/handle/201005/17266	-
dc.description.abstract	Selecting core subsets from plant genotype datasets is important for enhancing cost-effectiveness and to shorten the time required for analyses of genome-wide association studies (GWAS), and genomics-assisted breeding of crop species, etc. Recently, a large number of genetic markers (>100,000 single nucleotide polymorphisms) have been identified from high-density single nucleotide polymorphism (SNP) arrays and next-generation sequencing (NGS) data. However, there is no software available for picking out the efficient and consistent core subset from such a huge dataset. It is necessary to develop software that can extract genetically important samples in a population with coherence. We here present a new program, GenoCore, which can find quickly and efficiently the core subset representing the entire population. We introduce simple measures of coverage and diversity scores, which reflect genotype errors and genetic variations, and can help to select a sample rapidly and accurately for crop genotype dataset. Comparison of our method to other core collection software using example datasets are performed to validate the performance according to genetic distance, diversity, coverage, required system resources, and the number of selected samples. GenoCore selects the smallest, most consistent, and most representative core collection from all samples, using less memory with more efficient scores, and shows greater genetic coverage compared to the other software tested. GenoCore was written in R language, and can be accessed online with an example dataset and test results at https://github.com/lovemun/Genocore.	-
dc.publisher	Public Library of Science	-
dc.title	GenoCore : a simple and fast algorithm for core subset selection from large genotype datasets	-
dc.title.alternative	GenoCore : a simple and fast algorithm for core subset selection from large genotype datasets	-
dc.type	Article	-
dc.citation.title	PLoS One	-
dc.citation.number	7	-
dc.citation.endPage	e0181420	-
dc.citation.startPage	e0181420	-
dc.citation.volume	12	-
dc.contributor.affiliatedAuthor	Seongmun Jeong	-
dc.contributor.affiliatedAuthor	Jae-Yoon Kim	-
dc.contributor.affiliatedAuthor	Soon-Chun Jeong	-
dc.contributor.affiliatedAuthor	Namshin Kim	-
dc.contributor.alternativeName	정성문	-
dc.contributor.alternativeName	김재윤	-
dc.contributor.alternativeName	정순천	-
dc.contributor.alternativeName	강성택	-
dc.contributor.alternativeName	문중경	-
dc.contributor.alternativeName	김남신	-
dc.identifier.bibliographicCitation	PLoS One, vol. 12, no. 7, pp. e0181420-e0181420	-
dc.identifier.doi	10.1371/journal.pone.0181420	-
dc.description.journalClass	Y	-

Appears in Collections:: Division of A.I. & Biomedical Research > Genomic Medicine Research Center > 1. Journal Articles
Ochang Branch Institute > 1. Journal Articles

Files in This Item:

14904.pdf 331.51 kB / Adobe PDF
Download

Show simple item record

qrcode

트윗하기

Open Access@KRIBB was built as a OAK to the National Central Library.