リケラボ論文検索は、全国の大学リポジトリにある学位論文・教授論文を一括検索できる論文検索サービスです。

リケラボ 全国の大学リポジトリにある学位論文・教授論文を一括検索するならリケラボ論文検索大学・研究所にある論文を検索できる

リケラボ 全国の大学リポジトリにある学位論文・教授論文を一括検索するならリケラボ論文検索大学・研究所にある論文を検索できる

大学・研究所にある論文を検索できる 「RIL-StEp: epistasis analysis of rice recombinant inbred lines (RILs) reveals candidate interacting genes that control seed hull color and leaf chlorophyll content」の論文概要。リケラボ論文検索は、全国の大学リポジトリにある学位論文・教授論文を一括検索できる論文検索サービスです。

コピーが完了しました

URLをコピーしました

論文の公開元へ論文の公開元へ
書き出し

RIL-StEp: epistasis analysis of rice recombinant inbred lines (RILs) reveals candidate interacting genes that control seed hull color and leaf chlorophyll content

Sakai, Toshiyuki Abe, Akira Shimizu, Motoki Terauchi, Ryohei 京都大学 DOI:10.1093/g3journal/jkab130

2021.07

概要

Characterizing epistatic gene interactions is fundamental for understanding the genetic architecture of complex traits. However, due to the large number of potential gene combinations, detecting epistatic gene interactions is computationally demanding. A simple, easy-to-perform method for sensitive detection of epistasis is required. Due to their homozygous nature, use of recombinant inbred lines excludes the dominance effect of alleles and interactions involving heterozygous genotypes, thereby allowing detection of epistasis in a simple and interpretable model. Here, we present an approach called RIL-StEp (recombinant inbred lines stepwise epistasis detection) to detect epistasis using single-nucleotide polymorphisms in the genome. We applied the method to reveal epistasis affecting rice (Oryza sativa) seed hull color and leaf chlorophyll content and successfully identified pairs of genomic regions that presumably control these phenotypes. This method has the potential to improve our understanding of the genetic architecture of various traits of crops and other organisms.

この論文で使われている画像

参考文献

Aoki K, Ogata Y, Shibata D. 2007. Approaches for extracting practical

information from gene co-expression networks in plant biology.

Plant Cell Physiol. 48:381–390.

Azodi CB, Bolger E, McCarren A, Roantree M, de los Campos G, et al.

2019. Benchmarking parametric and machine learning models

for genomic prediction of complex traits. G3 (Bethesda). 9:

3691–3702.

Bailey DW. 1971. Recombinant-inbred strains: an aid to finding identity, linkage, and function op histocompatibility and other genes.

Transplantation. 11:325–327.

Bush WS, Moore JH. 2012. Chapter 11: genome-wide association

studies. PLoS Comput Biol. 8: e1002822.

¨ , Haley CS. 2004. Epistasis: too often neglected in complex

Carlborg O

trait studies? Nat Rev Genet. 5:618–625.

Chen SH, Sun J, Dimitrov L, Turner AR, Adams TS, et al. 2008. A support vector machine approach for detecting gene-gene interaction. Genet Epidemiol. 32:152–167.

Colak R, Kim T, Kazan H, Oh Y, Cruz M, et al. 2016. JBASE: joint

Bayesian

analysis

of

subphenotypes

and

epistasis.

Bioinformatics. 32:203–210.

Cordell HJ. 2002. Epistasis: what it means, what it doesn’t mean, and

statistical methods to detect it in humans. Hum Mol Genet. 11:

2463–2468.

Feltus FA. 2014. Systems genetics: a paradigm to improve discovery

of candidate genes and mechanisms underlying complex traits.

Plant Sci. 223:45–48.

Fisher RA. 1919. XV.—the correlation between relatives on the supposition of Mendelian inheritance. Trans R Soc Edinb. 52:

399–433.

Fragoso CA, Heffelfinger C, Zhao H, Dellaporta SL. 2016. Imputing

genotypes in biallelic populations from low-coverage sequence

data. Genetics. 202:487–495.

Fukuda A, Shimizu H, Shiratsuchi H, Yamaguchi H, Ohdaira Y, et al.

2012. Complementary genes that cause black ripening hulls in f 1

plants of crosses between Indica and Japonica rice cultivars.

Plant Prod. Sci. 15:270–273.

Gallagher MD, Chen-Plotkin AS. 2018. The post-GWAS era: from association to function. Am J Hum Genet. 102:717–730.

Gilad Y, Rifkin SA, Pritchard JK. 2008. Revealing the architecture of

gene regulation: the promise of eQTL studies. Trends Genet. 24:

408–415.

Heffner EL, Sorrells ME, Jannink JL. 2009. Genomic selection for crop

improvement. Crop Sci. 49:1–12.

Hemani G, Theocharidis A, Wei W, Haley C. 2011. EpiGPU: exhaustive pairwise epistasis scans parallelized on consumer level

graphics cards. Bioinformatics. 27:1462–1465.

Higuchi-Takeuchi M, Ichikawa T, Kondou Y, Matsui K, Hasegawa Y,

et al. 2011. Functional analysis of two isoforms of leaf-type

ferredoxin-NADP þ-oxidoreductase in rice using the heterologous expression system of Arabidopsis. Plant Physiol. 157:96–108.

Huang X, Zhao Y, Wei X, Li C, Wang A, et al. 2012. Genome-wide association study of flowering time and grain yield traits in a worldwide collection of rice germplasm. Nat Genet. 44:32–39.

Ishwaran H, Rao JS. 2005. Spike and slab variable selection:

Frequentist and Bayesian strategies. Ann Stat. 33:730–773.

Jarosz AF, Wiley J. 2014. What are the odds? A practical guide to computing and reporting Bayes factors. J Probl Solving. 7:2–9.

Downloaded from https://academic.oup.com/g3journal/article/11/7/jkab130/6237892 by KYOTO UNIVERSITY Medical Library user on 03 October 2022

one combination (Combination 2 of seed hull color trait: K ¼ 281)

due to the biased distribution of genotypes of this combination

(Supplementary Figure S4). Therefore, we believe the threshold of

K > 100 may be appropriate to detect candidate epistatic genomic

regions in most cases. However, spurious epistasis may be possibly identified due to the bias of genotypes in a population, and

careful validation is needed to conclude the epistatic effect of

candidate loci. The identified regions may contain multiple

genes, and we could not specify the responsible genes by genetic

analysis alone. A challenge of GWAS is to bridge the gap between

the identification of the genomic regions and of the causative

genes responsible for the phenotype (Gallagher and Chen-Plotkin

2018). Using more RILs and applying a stricter threshold in the

epistasis analysis should make it possible to pin down a much

smaller genomic region. However, applying a stricter threshold

has a risk of missing true positives. Indeed, the genomic regions

of the interacting gene pair BH4 and Phr1 were not in the regions

that showed the highest Bayes factor values (Figure 3). It is challenging to strike the appropriate balance between controlling for

type I and type II errors (Todorov and Rao 1997). Identifying genes

using only statistical significance thresholds is usually not possible and not appropriate.

Here, we successfully specified strong candidate genes presumably controlling the seed hull phenotype using knowledge about

the candidate genes and the sequence analysis. However, this approach may not be applicable in every case. There may be several

approaches to validate the epistatic relationship between the

genes, such as co-expression analysis to explore genes in the same

biological processes (Aoki et al. 2007; Mao and Chen 2012; van Dam

et al. 2018) or eQTL analysis to identify genetic variants regulated

by specific genes (Gilad et al. 2008; Feltus 2014). Combining information from other sources of evidence with RIL-StEp results may

enhance our capability to identify interacting genes.

To summarize, we propose a novel approach based on simple

linear models to detect epistatic interactions underlying quantitative traits in the RIL population. By applying RIL-StEp, we succeeded in identifying genomic regions related to rice seed hull

color and chlorophyll content. Incorporating additional information allowed us to identify candidate genes involved in seed hull

color. Thus, our approach has the potential to identify epistasis

in various biological traits.

R.T. and T.S. conceptualized the study. T.S., A.A., and M.S. performed the research. The original draft was written by T.S. and

reviewed by R.T., A.A., and M.S. All authors read and approved

the final manuscript.

A Self-archived copy in

Kyoto University Research Information Repository

https://repository.kulib.kyoto-u.ac.jp

T. Sakai et al.

11

Kawahara Y, de la Bastide M, Hamilton JP, Kanamori H, McCombie

Rosyara UR, de Jong WS, Douches DS, Endelman JB. 2016. Software

WR, et al. 2013. Improvement of the Oryza sativa Nipponbare ref-

for genome-wide association studies in autopolyploids and its

erence genome using next generation sequence and optical map

application to potato. Plant Genome. 9:1–10.

Runcie DE, Crawford L. 2019. Fast and flexible linear mixed models

data. Rice (N Y). 6:4.

Kojima Y, Ebana K, Fukuoka S, Nagamine T, Kawase M. 2005.

for genome-wide genetics. PLoS Genet. 15:e1007978.

Schmieder R, Edwards R. 2011. Quality control and preprocessing of

germplasm. Breed Sci. 55:431–440.

Laurie C, Wang S, Carlini-Garcia LA, Zeng ZB. 2014. Mapping epi-

metagenomic datasets. Bioinformatics. 27:863–864.

Shao T, Qian Q, Tang D, Chen J, Li M, et al. 2012. A novel gene IBF1 is

static quantitative trait loci. BMC Genet. 15:

Li H, Durbin R. 2009. Fast and accurate short read alignment with

required for the inhibition of brown pigment deposition in rice

Burrows–Wheeler transform. Bioinformatics. 25:1754–1760.

Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, et al.; 1000 Genome

Project Data Processing Subgroup. 2009. The sequence align-

hull furrows. Theor Appl Genet. 125:381–390.

Stanislas V, Dalmasso C, Ambroise C. 2017. Eigen-Epistasis for

detecting gene-gene interactions. BMC Bioinformatics. 18:1–14.

Sukumaran S, Dreisigacker S, Lopes M, Chavez P, Reynolds MP. 2015.

ment/map format and SAMtools. Bioinformatics. 25:2078–2079.

Li J, Malley JD, Andrew AS, Karagas MR, Moore JH. 2016. Detecting

Genome-wide association study for grain yield and related traits

gene-gene interactions using a permutation-based random forest

environments. Theor Appl Genet. 128:353–363.

Sun X, Lu Q, Mukheerjee S, Crane PK, Elston R, et al. 2014. Analysis

method. BioData Min. 9:1–17.

Li X. 2017. A fast and exhaustive method for heterogeneity and epistasis

analysis

based

on

multi-objective

optimization.

Bioinformatics. 33:2829–2836.

Liang F, Paulo R, Molina G, Clyde MA, Berger JO. 2008. Mixtures of g

priors for Bayesian variable selection. J Am Stat Assoc. 103:

410–423.

Lo CC, Chain PSG. 2014. Rapid evaluation and quality control of next

generation sequencing data with FaQCs. BMC Bioinformatics. 15:

366–368.

Luan T, Woolliams JA, Lien S, Kent M, Svendsen M, et al. 2009. The accuracy of genomic selection in Norwegian red cattle assessed by

cross-validation. Genetics. 183:1119–1126.

Mackay TFC. 2014. Epistasis and quantitative traits: using model

in an elite spring wheat population grown in temperate irrigated

pipeline for the epistasis search - statistical versus biological filtering. Front Genet. 5:106–107.

Taylor MB, Ehrenreich IM. 2015. Higher-order genetic interactions

and their contribution to complex traits. Trends Genet. 31:

34–40.

Todorov AA, Rao DC. 1997. Trade-off between false positives and

false negatives in the linkage analysis of complex traits. Genet

Epidemiol. 14:453–464.

Tuo S. 2018. FDHE-IW: a fast approach for detecting high-order

epistasis in genome-wide case-control studies. Genes (Basel). 9:

435.

Uddling J, Gelang-Alfredsson J, Piikki K, Pleijel H. 2007. Evaluating

the relationship between leaf chlorophyll concentration and

organisms to study gene-gene interactions. Nat Rev Genet. 15:

SPAD-502 chlorophyll meter readings. Photosynth Res. 91:

22–33.

Mackay TFC, Moore JH. 2014. Why epistasis is important for tackling

37–46.

~ sa U, van der Graaf A, Franke L, de Magalha

~ es JP. 2018.

van Dam S, Vo

complex human disease genetics. Genome Med. 6:125–128.

Mao D, Chen C. 2012. Colinearity and similar expression pattern of

Gene co-expression analysis for functional classification and

rice DREB1s reveal their functional conservation in the cold-res-

gene-disease predictions. Brief Bioinform. 19:575–592.

Wakefield J. 2009. Bayes factors for Genome-wide association stud-

ponsive pathway. PLoS One. 7:e47275.

Morey RD, Rouder JN, Jamil T, Urbanek S, Forner K, et al. 2018.

ies: comparison with P-values. Genet Epidemiol. 33:79–86.

Wan X, Yang C, Yang Q, Xue H, Fan X, et al. 2010. BOOST: a fast ap-

Package

‘BayesFactor’.https://cran.r-project.org/web/packages/

proach to detecting gene-gene interactions in genome-wide

BayesFactor/BayesFactor.pdf (26 April 2021, date last accessed).

Narasimhan V, Danecek P, Scally A, Xue Y, Tyler-Smith C, et al. 2016.

case-control studies. Am J Hum Genet. 87:325–340.

Wang D, Salah El-Basyoni I, Baenziger PS, Crossa J, Eskridge KM, et al.

BCFtools/RoH: a hidden Markov model approach for detecting

2012. Prediction of genetic values of quantitative traits with epi-

autozygosity

data.

static effects in plant breeding populations. Heredity (Edinb). 109:

Bioinformatics. 32:1749–1751.

Niel C, Sinoquet C, Dina C, Rocheleau G. 2015. A survey about meth-

313–319.

Wei WH, Hemani G, Haley CS. 2014. Detecting epistasis in human

ods dedicated to epistasis detection. Front Genet. 6:

Niel C, Sinoquet C, Dina C, Rocheleau G. 2018. SMMB: a stochastic

complex traits. Nat Rev Genet. 15:722–733.

Xu Y, Crouch JH. 2008. Marker-assisted selection in plant breeding:

Markov blanket framework strategy for epistasis detection in

from publications to practice. Crop Sci. 48:391–407.

Yang CH, Chuang LY, Da Lin Y. 2017. CMDR based differential evolu-

from

next-generation

sequencing

GWAS. Bioinformatics. 34:2773–2780.

Park MY, Hastie T. 2008. Penalized logistic regression for detecting

tion identifies the epistatic interaction in genome-wide associa-

gene interactions. Biostatistics. 9:30–50.

Phillips PC. 2008. Epistasis—the essential role of gene interactions in

tion studies. Bioinformatics. 33:2354–2362.

Yu J, Pressoir G, Briggs WH, Bi IV, Yamasaki M, et al. 2006. A unified

the structure and evolution of genetic systems. Nat Rev Genet. 9:

mixed-model method for association mapping that accounts for

855–867.

Riley PA. 1997. Melanin. Int J Biochem Cell Biol. 29:1235–1239.

Ritchie MD. 2011. Using biological knowledge to uncover the mystery

multiple levels of relatedness. Nat Genet. 38:203–208.

Yu W, Kwon MS, Park T. 2015. Multivariate quantitative multifactor

in the search for epistasis in genome-wide association studies.

Hum Hered. 79:168–181.

Yu W, Lee S, Park T. 2016. A unified model based multifactor dimen-

Ann Hum Genet. 75:172–182.

Ritchie MD, Hahn LW, Roodi N, Bailey LR, Dupont WD, et al. 2001.

Multifactor-dimensionality reduction reveals high-order interactions among estrogen-metabolism genes in sporadic breast cancer. Am J Hum Genet. 69:138–147.

dimensionality reduction for detecting gene-gene interactions.

sionality reduction framework for detecting gene-gene interactions. Bioinformatics. 32:i605–i610.

Yu Y, Tang T, Qian Q, Wang Y, Yan M, et al. 2008. Independent

losses of function in a polyphenol oxidase in rice: differentiation

Downloaded from https://academic.oup.com/g3journal/article/11/7/jkab130/6237892 by KYOTO UNIVERSITY Medical Library user on 03 October 2022

Development of an RFLP-based rice diversity research set of

A Self-archived copy in

Kyoto University Research Information Repository

https://repository.kulib.kyoto-u.ac.jp

12 |

G3, 2021, Vol. 11, No. 7

Zhang Y, Liu JS. 2007. Bayesian inference of epistatic interactions in

case-control studies. Nat Genet. 39:1167–1173.

Zhou Z, Jiang Y, Wang Z, Gou Z, Lyu J, et al. 2015. Resequencing 302

wild and cultivated accessions identifies genes related to domestication and improvement in soybean. Nat Biotechnol. 33:

408–414.

Zhu BF, Si L, Wang Z, Zhou Y, Zhu J, et al. 2011. Genetic control of a

transition from black to straw-white seed hull in rice domestication. Plant Physiol. 155:1301–1311.

Communicating editor: E. Akhunov

Downloaded from https://academic.oup.com/g3journal/article/11/7/jkab130/6237892 by KYOTO UNIVERSITY Medical Library user on 03 October 2022

in grain discoloration between subspecies and the role of positive selection under domestication. Plant Cell. 20:2946–2959.

Yuan L, Yuan CA, Huang DS. 2017. FAACOSE: a fast adaptive ant colony optimization algorithm for detecting SNP epistasis.

Complexity. 2017:1.

Zhang C, Dong SS, Xu JY, He WM, Yang TL. 2019. PopLDdecay:

a fast and effective tool for linkage disequilibrium decay

analysis based on variant call format files. Bioinformatics. 35:

1786–1788.

Zhang Y. 2012. A novel Bayesian graphical model for genome-wide

multi-SNP association mapping. Genet Epidemiol. 36:36–47.

...

参考文献をもっと見る