- 1Key Laboratory of Experimental Marine Biology, Institute of Oceanology, Chinese Academy of Sciences, Qingdao, China
- 2Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China
- 3Center for Ocean Mega-Science, Chinese Academy of Sciences, Qingdao, China
- 4Yantai Institute of Coastal Zone Research, Chinese Academy of Sciences, Yantai, China
- 5University of Chinese Academy of Sciences, Beijing, China
- 6The Innovation of Seed Design, Chinese Academy of Sciences, Wuhan, China
The Pacific white shrimp (Litopenaeus vannamei) is the most widely cultured shrimp in the world. A great attention has been paid to improve its body weight (BW) at harvest through genetic selection for decades. Genome-wide association study (GWAS) is a tool to dissect the genetic basis of the traits. In this study, a GWAS approach was conducted to find genes related to BW through genotyping 94,113 single nucleotide polymorphisms (SNPs) in 200 individuals from a breeding population. Four BW-related SNPs located in LG19 and LG39 were identified. Through further candidate gene association analysis, the SNPs in two candidate genes, deoxycytidylate deaminase and non-receptor protein tyrosine kinase, were found to be related with the body weight of the shrimp. Marker-assisted best linear unbiased prediction (MA-BLUP) based on the SNPs in these two genes was used to estimate the breeding values, and the result showed that the highest prediction accuracy of MA-BLUP was increased by 9.4% than traditional BLUP. These results will provide useful information for the marker-assisted breeding in L. vannamei.
Introduction
The Pacific white shrimp, Litopenaeus vannamei, is the most widely cultured shrimp all over the world (FAO, 2019). Due to its extremely high industry value, genetic improvement by selection breeding has been performed for decades (Argue et al., 2002; Sui et al., 2016). Growth trait is considered as the most important economic trait in L. vannamei and has been taken as the main target trait in most breeding programs (Argue et al., 2002; Sui et al., 2016).
As a typical quantitative trait with medium to high heritability (Castillo-Juárez et al., 2007; Sui et al., 2016; Nolasco-Alzaga et al., 2018), improvement of the growth trait was performed mainly by best linear unbiased prediction (BLUP) (Castillo-Juárez et al., 2007; Sui et al., 2016; Nolasco-Alzaga et al., 2018), which is essentially based on an “infinitesimal model” (Mather, 1941). This model assumes that the quantitative trait is controlled by an infinite number of gene loci that are evenly distributed across the genome and each locus has an infinitely small effect (Fisher, 1918; Bulmer, 1971). However, substantial advances in molecular genetics and genomic research provided us an opportunity to capture the quantitative trait loci (QTLs) or genes that have definite effects on growth.
An essential method for identifying QTLs is genetic linkage mapping. Several studies based on this method have identified QTLs for growth trait in L. vannamei, and no related genes were found due to the limitation of marker density (Andriantahina et al., 2013; Yu et al., 2015). Besides, QTLs detected in a mapping population with a biparental family are usually difficult to be transferred to large, more complex breeding groups (Takeda and Matsuoka, 2008). With the development of high-throughput genotyping platforms for single nucleotide polymorphisms (SNPs), genome-wide association study (GWAS) has become a more powerful tool to dissect the genetic basis underlying the traits. It utilizes high-density SNPs distributed along the genome to determine the QTLs by statistical associations between SNP genotypes and particular phenotypes. GWAS has been successfully carried out in fish, referring to many traits, such as filet texture, fat content (Sodeland et al., 2013), growth (Gonzalez-Pena et al., 2016; Li et al., 2017), sexual maturation (Gutierrez et al., 2015), disease resistance (Liu et al., 2015; Zhou et al., 2017; Rochus et al., 2018; Tan et al., 2018), low oxygen tolerance (Zhong et al., 2017), and fatty acid composition (Horn et al., 2020). As for crustaceans, reports to use GWAS to discover SNPs or genes related to interesting traits are relatively less. Based on the restriction-site-associated DNA (RAD) sequencing data in a biparental family, GWAS analysis for growth trait was performed and two growth-related candidate genes protein kinase C delta type and ras-related protein Rap-2a were identified in L. vannamei (Yu et al., 2019). In another GWAS analysis, 23,049 SNPs were obtained in a population of 200 L. vannamei using 2b-RAD sequencing, and class C scavenger receptor was identified as a growth-related candidate gene (Wang et al., 2019). Then, as the genome was assembled, another growth-related candidate gene, MMD2, was found based on the correlation analysis results in Wang et al. (2019) and further validated (Wang et al., 2020). Although these researches illustrated the power of identifying QTLs or genes using GWAS in L. vannamei, low marker density and simple population structure restrict their applied values.
With the accomplishment of the whole genome sequencing in L. vannamei (Zhang et al., 2019), performing GWAS analysis and screening trait-related genes with reference genome and high-density SNPs become possible. In the present study, the 2b-RAD sequencing data were mapped to the published genome sequence, and the SNPs were called based on the reference sequence, then GWAS for body weight (BW) was performed. Candidate growth-associated genes were further validated in other populations. Moreover, a trial to apply marker-assisted selection (MAS) was conducted. These results will provide useful information for marker-assisted selection in L. vannamei.
Materials and Methods
Identification of Growth-Associated Markers by GWAS
The experimental population used in GWAS was the same as described previously (Wang et al., 2017, 2019). Briefly, the GWAS population which was named as population 1, consisting of 13 full-sib families, was bred in Guangtai Marine Breeding Company in Hainan Province of China in 2015. Each family was raised separately for seedlings. After grown to 3 cm, 50 individuals per family were randomly selected, mixed, and cultured in a 10-m2 pond. When they reached the market size, 200 shrimp were selected randomly to be taken as GWAS population and the BW of each shrimp was measured. The 2b-RAD library sequencing data of these 200 individuals have been published previously (Wang et al., 2017, 2019), and 23,049 SNPs were obtained in these researches. As the reference genome has been finished and published recently (Zhang et al., 2019), the sequencing data were mapped to the genome sequence, and the SNPs were called based on the reference sequence. After quality control filtering by minor allele frequency at 0.05 and SNP call rate at 0.9, 94,113 SNPs were used in subsequent GWAS.
Considering a possible stratification due to multiple origins of the shrimp, GWAS was performed using the egscore function in GenABEL package implemented in R environment (Aulchenko et al., 2007). In this method, classic principal component analysis (PCA) of the genomic kinship matrix was used to evaluate the population stratification (Price et al., 2006). Then, genome-wide significance was assessed by the Bonferroni method, in which the conventional p value was divided by the number of tests performed (SNPs tested). The model used for GWAS was as follows:
where y is the vector of phenotypic records (BW), b is the vector of fixed effects including sex and the first four principal components, X is the incidence matrices for fixed effects in b, s is the effect of the SNP genotype, W is the design vector for s, and e is the vector of random residuals.
Validation of Growth-Associated SNPs by the General Linear Model
After GWAS, another independent population bred in 2017 which was named as population 2 in this study was used to validate the identified growth-associated SNPs. Population 2 contains 303 individuals from multiple families with similar day age. The top 50 candidate SNPs with the smallest p values obtained in GWAS were genotyped in population 2 by the target sequencing method (Shanghai Biowing Applied Biotechnology Co., Ltd., Shanghai, China). The general linear model (GLM) was used to verify associations between them and BW in population 2. The GLM was as follows:
where y is the vector of phenotypic records (BW), b1 is the vector of sex fixed effects, X1 is incidence matrix for sex fixed effect in b, s is the effect of the SNP genotype, W is the design vector for s, and e is the vector of random residuals. The GLM analyses were executed using the package stats in R.
Annotation of Significant SNPs
After validation, the upstream 10 kb and downstream 10 kb around the significant SNPs were blasted to L. vannamei genome1 and the nearby genes were identified. The 5′ untranslated regions (UTRs) and open reading frames (ORFs) of growth-associated candidate genes were predicted by the Expert Protein Analysis System (EXPASY2).
Identification of Significant SNPs in Candidate Genes by the Mixed Linear Model
To identify SNPs associated with BW located in two candidate genes, the population bred in 2018 which was named as population 3 was used to genotyped SNPs located in 5′ UTR of the candidate genes. Population 3 contained 321 individuals from more than 50 full-sib families with the same day age. Mixed linear model (MLM) analysis was applied to eliminate the effects of population structure. Twelve microsatellite markers (Table 1) were used to genotype this population and their mothers to reconstruct pedigree in the COLONY (Jones and Wang, 2010). The restricted maximum-likelihood method was carried out in ASReml (Gilmour et al., 2012) to solve the following MLM:
where y is the vector of phenotypic records (BW); b1 is the vector of sex fixed effects; X1 is the design matrices for b; u is the random polygenic additive genetic effect except for target SNP, assumed to be , where is the polygenic additive variance; Z is the additive genetic relationship matrix for u; s is the fixed effect of the SNP genotype; W is the design vector for s; and e is the vector of random residuals.
After MLM analysis, linkage disequilibrium (LD) analyses were performed for BW-associated SNPs using the package LDheatmap in R, and additive (a), dominance (d), and substitution (α) effects of each BW-associated SNP were calculated as follows (Falconer and Mackay, 1996):
where AA, BB, and AB were the least squares means of BW of three genotypes, and p and q are the allele frequencies of A and B.
Comparison Between BLUP and Marker-Assisted BLUP
After obtaining SNPs located in candidate growth genes, the methods of MAS were further explored. We calculated the estimated breeding values (EBVs) based on marker-assisted BLUP (MA-BLUP). The EBV of an individual based on MA-BLUP is the sum of polygenic effect () and SNP genotype effects (.) which were obtained by solving MLM equations:
For comparison, EBVs from traditional BLUP model were also obtained using a similar MLM but without SNP effects. Tenfold cross-validation was used to compare the accuracies of BLUP and MA-BLUP. The GLMs were also included in the comparison, which represented selection accuracies based on significant markers. The phenotypes under GLMs were predicted by fixed effects of sex and SNP genotype, which were the same as those used in the validation of growth-associated SNPs. Cross-validation was performed in population 3. For each tenfold cross-validation, the dataset was randomly divided into 10 equal zed subsets. Nine subsets were used as training data and the other one is used as prediction data. Cross-validation was repeated 500 times. Accuracy was represented by average Pearson correlation between observed phenotypes and predicted phenotypes in the prediction dataset.
Results
Genome-Wide Association Analysis
According to the reference genome information (Zhang et al., 2019), a total of 83,015 SNPs were distributed along the 44 linkage groups (LGs) with one SNP per 0.05 cM on average, while the other 11,098 SNPs were not assigned to any LGs. The Manhattan plot of the GWAS result is shown in Figure 1, in which the unassigned SNPs are shown in the rightmost column. We selected the 50 SNPs with the smallest p values for further verification. Their p value ranged from 1.42E–05 to 1.71E–04. A total of eight significant SNPs were located in LG19, and four and three were located in LG11 and LG15, respectively. The other 24 SNPs were located in 17 different LGs. Therefore, potential trait-associated SNPs showed a wide distribution in genome. More details about these 50 SNPs are shown in Table 2.
Figure 1. GWAS analysis for BW of L. vannamei. Two horizontal lines represent the genome-wide significant threshold (6.27) and the top 50 most significant threshold (3.77).
Validation of Growth-Associated SNPs by GLM
Eight candidate SNPs with small p values in GWAS were amplified and genotyped in population 2 by using the target sequencing approach. Their primer sequences are shown in Table 3. As there is a high ratio of repeat sequences and insertion–deletion mutations, the other 42 candidate SNPs were not genotyped. The GLM analysis results are listed in Table 4. Minor allele frequencies in all except for SNP555_246199 were larger than 0.10. The results of GLM analyses indicated that four of them were significantly correlated with BW after Bonferroni correction.
Annotation of Significant SNPs
The SNPs showing significant correlations with BW after Bonferroni correction were further analyzed and annotated. The flanking genes around the significant SNPs are listed in Table 5. Four growth-associated SNPs were found to be located near the genes. There were two unknown genes around the most significant SNP3085_190487, and the second most significant SNP1678_860109 was located in intron 2 of an unknown gene. The other two candidate growth-associated genes near the SNP3406_465799 and SNP2570_399791 were predicted as deoxycytidylate deaminase (dCMPD) and non-receptor protein tyrosine kinase (NPTK). Distances from growth-associated SNPs to candidate genes ranged from 2.6 to 10.0k. Particularly, there were seven transcript variants for L. vannamei NPTK and two for dCMPD in the NCBI GenBank. By comparing sequences of transcripts, we found that the sequence variation among different transcripts existed in 5′ UTR for both genes.
Table 5. Gene annotations and the distances from significant SNPs to candidate genes (bp). Positive indicates gene locates in downstream of the SNP, and negative indicates in upstream.
Validation of Growth-Associated SNPs in Candidate Genes by MLM
In this study, the two annotated candidate genes were further analyzed in population 3 to identify the SNPs located in genes. Polymorphisms around 5′ UTR of these two genes were detected by PCR amplification and Sanger sequencing. Two pairs of primers were designed to amplify 5′ UTR for each gene, and the primer sequences are shown in Table 6.
High polymorphisms were detected in amplified fragments of dCMPD and NPTK. Roughly, there were more than 45 SNPs and 5 indels in the amplified fragments of dCMPD and more than 32 SNPs and 2 indels in those of NPTK. However, most SNPs could be genotyped only in a few samples by Sanger sequencing due to the existence of the indels. Finally, 12 SNPs in NPTK and 8 SNPs in dCMPD with MAF > 0.01 were used to detect their associations with BW. The p values for each SNP are shown in Table 7. One SNP within intron 1 of NPTK (NPTK-278) and two SNPs located about 100 bp upstream of exon 1 of dCMPD (dCMPD-118 and dCMPD-129) were found significantly correlated with BW (p < 0.01).
Linkage disequilibrium analyses among these significant SNPs showed that dCMPD-118 and dCMPD-129 were in strong LD (Table 8), which indicates an overlap exists between their effects on BW. Therefore, the SNP with a larger effect, dCMPD-118, was included for further analysis. The least squares mean, additive, dominance, and substitution effects of NPTK-278 and dCMPD-118 are shown in Table 9. An overdominance effect was found in dCMPD.
Table 9. The least squares mean, additive (a), dominance (d), and substitution (α) effects of two independent associated SNPs NPTK-278 and dCMPD-118.
Comparison Between BLUP and MA-BLUP
Estimated breeding valuess predicted from MA-BLUP and traditional BLUP were compared. For MA-BLUP analysis, we calculated EBVs using three models, two containing the fixed effect of NPTK-278 or dCMPD-118, respectively, and the other one containing both. In particular, the interaction effect between NPTK-278 and dCMPD-118 was not significant, so it was not considered. The correlations among different EBVs are shown in Table 10. The Pearson and Spearman coefficients between different models were high correlated.
Table 10. Relationship among EBVs from BLUP (model 1) and MA-BLUP (models 2–4). The upper triangle represents Pearson correlation coefficients and the lower triangle represents Spearman correlation coefficients.
The prediction accuracies of all the models including GLMs are listed in Table 11. MA-BLUP containing fixed effects of NPTK-278 and dCMPD-118 (model 4) has the highest accuracy, and those containing one SNP fixed effect (model 2 and model 3) are also larger than the traditional BLUP animal model (model 1). The predicted phenotypes by GLMs (models 5–7) showed weak or no correlation with the observed phenotypes.
Discussion
Identification of trait-associated genetic markers or candidate genes is very important both for understanding the molecular mechanisms of phenotypic variation and for breeding applications. In aquaculture species, GWAS analysis was regarded as a useful tool for correlating genetic and phenotypic variations (Abdelrahman et al., 2017). In this study, we present a GWAS analysis to correlate genetic and growth phenotypic variations in L. vannamei. Although the results showed that no SNP reached genome-wide significance, a set of SNPs with small p values was still worthy of further validation and analysis. This is because the use of more samples and markers would probably increase the confidence levels, theoretically allowing the suggested QTL to be detected at a statistically significant level. Considering that Bonferroni correction was overly restrictive, we selected 50 SNPs with the smallest p values as the candidate SNPs for further validation. The association analyses between these SNPs and BW were performed in another population to avoid type I errors. These 50 SNPs were distributed widely in a genome which confirmed that growth trait has a polygenic architecture affected by multiple loci with small effects (Falconer and Mackay, 1996). Further association analysis results proved that some of the most significant SNPs were worthy to be reanalyzed although they did not reach genome-wide significance in GWAS.
After validation of the GWAS result, two candidate growth-associated genes were found, which were dCMPD and NPTK. dCMPD is an enzyme which catalyzes the deamination of dCMP to dUMP, thus providing nucleotide substrate for thymidylate synthase (Maley and Maley, 1959). Previous studies have indicated its potential importance in DNA replication. Its activity is elevated in such rapidly dividing tissues as embryo (Maley and Maley, 1959), regenerating liver and rat hepatomas (Maley and Maley, 1961). A research on HeLa cell also found that dCMPD showed the highest expression in late S phase and its expression declines in the following G2 phase (Gelbard et al., 1969). There was also a report about the role of dCMPD in growth, and inhibition of dCMPD could result in inhibition of growth of chick embryo (Roth and Buccino, 1963). Protein tyrosine kinases (PTKs) are enzymes catalyzing the transfer of the gamma-phosphate group of ATP to the hydroxyl groups of specific tyrosine residues in peptides. PTKs are primarily involved in the regulation of cellular functions that are directly related to the multicellular status of the organism, such as growth, differentiation, and cell–cell and cell–extracellular matrix interactions (Robinson et al., 2000). PTKs are divided into two groups according to the presence of transmembrane and extracellular domains, and some lacking these domains are referred to as NPTK (Robinson et al., 2000). So far, there was no report about the function of dCMPD and NPTK in the growth of L. vannamei or other crustaceans.
In L. vannamei, there were two transcript variants for dCMPD and seven for NPTK in NCBI, and sequence variation among different transcripts mainly existed in 5′ UTR for both genes. Besides, considering complex traits are mainly driven by noncoding variants (Pickrell, 2014; Li et al., 2016), polymorphisms around 5′ UTR of these two genes were identified, and their associations with phenotypic variations were detected using MLM analysis. Compared with GLM analysis, the advantage of MLM was accounting for genetic relationships which could avoid confounding effects due to population structure (Kennedy et al., 1992). One SNP in each gene was found significantly correlated with BW. These results showed that dCMPD and NPTK were growth-associated candidate genes in L. vannamei and this deserves further research. However, due to the possible presence of pleiotropy, how allelic frequency changes of these SNPs affect the other important economic traits also needs to be further studied.
Prior to this study, some BW-related genes have been identified in L. vannamei, such as myostatin (Lee et al., 2015), protein kinase C delta type, ras-related protein Rap-2a (Yu et al., 2019), class C scavenger receptor (Wang et al., 2019), and MMD2 (Wang et al., 2020). These findings suggested that growth trait has a polygenic architecture and provides the possibility of application of MAS for growth in L. vannamei. In this paper, we did not try to uncover the biological functions and mechanisms of the effects on phenotypic variations of these two genes, but to explore the methods on how to apply them in breeding.
For aquaculture species, the primary purpose of GWAS was to identify trait-related genes or markers to apply MAS. However, there has been very few MAS application cases in commercial aquaculture breeding. The biggest obstacle is that for economy traits, almost all of which are complex traits, even the most important loci in the genome have small effects, especially for growth traits. Therefore, the effects of associated genes or markers should be taken as complements to rather than a replacement of polygenic effects (additive genetic effects) in breeding. For the reasons above, MA-BLUP was developed (Fernando and Grossman, 1989) and previous studies showed that MA-BLUP could increase the accuracies of genetic evaluations compared with traditional BLUP (Van Arendonk et al., 1994; Zhang et al., 2003), especially on the condition that marker effect was relatively large (Lande and Thompson, 1990; Villanueva et al., 2005). In this study, there were certain differences between EBVs based on MA-BLUP and BLUP (Table 10). Cross-validation revealed that the performance of MA-BLUP was better than BLUP, and MA-BLUP with two SNP effects was better than with one SNP effect. By using the proposed MA-BLUP method, the prediction accuracy was increased by 9.4%. Besides, the prediction accuracies of GLMs consisting of only fixed effects of SNP were much lower than those of the MA-BLUP or BLUP. This suggested that selection only based on trait-related markers was inefficient.
Since the population for GWAS in this study was relatively small and its genetic background was narrow, the power of GWAS might be weakened. Therefore, we further analyzed the significant SNPs and the candidate genes in the other populations step by step and identified three body weight-related SNPs in two candidate genes. The roles of the two candidate growth genes should be further investigated to enrich our understanding of the genetic basis of growth trait in shrimp. Meanwhile, we give an example of applying the trait-associated SNPs in genetic selection by the MA-BLUP method. The prediction accuracy of MA-BLUP based on significant SNPs was higher than that of traditional BLUP. Foreseeably, as more molecular markers associated with different economic traits were identified, MA-BLUP will have a wide application prospect in the selective breeding of shrimp.
Data Availability Statement
The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/supplementary material.
Author Contributions
DL and YY conducted the experiment and data processing. JX and FL conceived and supervised the project. QW and XZ contributed to statistical analysis. ZL and QZ participated in the extraction of genomic DNA. DL, YY, and FL prepared the manuscript. All authors have read and approved the manuscript.
Funding
This work was financially supported by the Strategic Priority Research Program of the Chinese Academy of Sciences (Grant No. XDA24030105), National Key R&D Program of China (2018YFD0901301), Agricultural Variety Improvement Project of Shandong Province (2019LZGC014), and China Agriculture Research System-48 (CARS-48).
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Footnotes
References
Abdelrahman, H., Elhady, M., Alcivar-Warren, A., Allen, S., Al-Tobasei, R., Bao, L., et al. (2017). Aquaculture genomics, genetics and breeding in the united states: current status, challenges, and priorities for future research. BMC Genom. 18:235. doi: 10.1186/s12864-017-3557-1
Andriantahina, F., Liu, X., and Huang, H. (2013). Genetic map construction and quantitative trait locus (QTL) detection of growth-related traits in Litopenaeus vannamei for selective breeding applications. PLoS One 8:e75206. doi: 10.1371/journal.pone.0075206
Argue, B. J., Arce, S. M., Lotz, J. M., and Moss, S. M. (2002). Selective breeding of pacific white shrimp (Litopenaeus vannamei) for growth and resistance to Taura syndrome virus. Aquaculture 204, 447–460. doi: 10.1016/s0044-8486(01)00830-4
Aulchenko, Y. S., Ripke, S., Isaacs, A., and Van Duijn, C. M. (2007). GenABEL: an R library for genome-wide association analysis. Bioinformatics 23, 1294–1296. doi: 10.1093/bioinformatics/btm108
Bulmer, M. G. (1971). The effect of selection on genetic variability. Am. Nat. 105, 201–211. doi: 10.1086/282718
Castillo-Juárez, H., Casares, J. C. Q., Campos-Montes, G., Villela, C. C., Ortega, A. M., and Montaldo, H. H. (2007). Heritability for body weight at harvest size in the pacific white shrimp, Penaeus (Litopenaeus) vannamei, from a multi-environment experiment using univariate and multivariate animal models. Aquaculture 273:49.
Falconer, D. S., and Mackay, T. F. C. (1996). Introduction to Quantitative Genetics, 4th Edn. Harlow: Longman.
Fernando, R. L., and Grossman, M. (1989). Marker assisted selection using best linear unbiased prediction. Genet. Sel. Evol. 21, 467–477. doi: 10.1186/1297-9686-21-4-467
Fisher, R. A. (1918). The correlations between relatives on the supposition of Mendelian inheritance. Philos. Trans. R. Soc. Edinb. 52, 399–433. doi: 10.1017/s0080456800012163
Gelbard, A. S., Kim, J. H., and Perez, A. G. (1969). Fluctuations in deoxycytidine monophosphate deaminase activity during the cell cycle in synchronous populations of HeLa cells. Biochim. Biophys. Acta 182, 564–566. doi: 10.1016/0005-2787(69)90209-3
Gilmour, A. R., Gogel, B. J., Cullis, B. R., and Thompson, R. (2012). ASReml User Guide Release 4.0. Hemel Hempstead: VSN International Ltd.
Gonzalez-Pena, D., Gao, G., Baranski, M., Moen, T., Cleveland, B. M., Kenney, P. B., et al. (2016). Genome-wide association study for identifying loci that affect fillet yield, carcass, and body weight traits in rainbow trout (Oncorhynchus mykiss). Front. Genet. 7:203. doi: 10.3389/fgene.2016.00203
Gutierrez, A. P., Yanez, J. M., Fukui, S., Swift, B., and Davidson, W. S. (2015). Genome-wide association study (GWAS) for growth rate and age at sexual maturation in Atlantic salmon (Salmo salar). PLoS one 10:e0119730. doi: 10.1371/journal.pone.0119730
Horn, S. S., Ruyter, B., Meuwissen, T. H. E., Moghadam, H. K., Hillestad, B., and Sonesson, A. K. (2020). GWAS identifies genetic variants associated with omega-3 fatty acid composition of Atlantic salmon fillets. Aquaculture 514:734494. doi: 10.1016/j.aquaculture.2019.734494
Jones, O. R., and Wang, J. (2010). COLONY: a program for parentage and sibship inference from multilocus genotype data. Mol. Ecol. Resour. 10, 551–555. doi: 10.1111/j.1755-0998.2009.02787.x
Kennedy, B. W., Quinton, M., and Van Arendonk, J. A. (1992). Estimation of effects of single genes on quantitative traits. J. Anim. Sci. 70, 2000–2012. doi: 10.2527/1992.7072000x
Lande, R., and Thompson, R. (1990). Efficiency of marker-assisted selection in the improvement of quantitative traits. Genetics 124, 743–756. doi: 10.1093/genetics/124.3.743
Lee, J. H., Momani, J., Kim, Y. M., Kang, C. K., Choi, J. H., Baek, H. J., et al. (2015). Effective RNA-silencing strategy of Lv-MSTN/GDF11 gene and its effects on the growth in shrimp, Litopenaeus vannamei. Comp. Biochem. Physiol. B Biochem. Mol. Biol. 179, 9–16. doi: 10.1016/j.cbpb.2014.09.005
Li, N., Zhou, T., Geng, X., Jin, Y., Wang, X., Liu, S., et al. (2017). Identification of novel genes significantly affecting growth in catfish through GWAS analysis. Mol. Genet. Genom. 293, 1–13.
Li, Y. I., Van de Geijn, B., Raj, A., Knowles, D. A., Petti, A. A., Golan, D., et al. (2016). RNA splicing is a primary link between genetic variation and disease. Science 352, 600–604. doi: 10.1126/science.aad9417
Liu, S., Vallejo, R. L., Palti, Y., Gao, G., Marancik, D. P., Hernandez, A. G., et al. (2015). Identification of single nucleotide polymorphism markers associated with bacterial cold water disease resistance and spleen size in rainbow trout. Front. Genet. 6:298. doi: 10.3389/fgene.2015.00298
Maley, F., and Maley, G. F. (1959). The enzymic conversion of glucosamine to galactosamine. Biochim. Biophys. Acta 31, 577–578. doi: 10.1016/0006-3002(59)90047-2
Maley, F., and Maley, G. F. (1961). Nucleotide interconversions IV. Activities of deoxycytidylate deaminase and thymidylate synthetase in normal rat liver and hepatomas. Cancer Res. 21, 1421–1426.
Mather, K. (1941). Variation and selection of polygenic characters. J. Genet. 41, 159–193. doi: 10.1007/bf02983019
Nolasco-Alzaga, H. R., Perez-Enriquez, R., Enez, F., Bestin, A., Palacios, E., and Haffray, P. (2018). Quantitative genetic parameters of growth and fatty acid content in the hemolymph of the White leg shrimp Litopenaeus vannamei. Aquaculture 482, 17–23. doi: 10.1016/j.aquaculture.2017.09.015
Pickrell, J. K. (2014). Joint analysis of functional genomic data and genome wide association studies of 18 human traits. Am. J. Hum. Genet. 94, 559–573. doi: 10.1016/j.ajhg.2014.03.004
Price, A. L., Patterson, N. J., Plenge, R. M., Weinblatt, M. E., Shadick, N. A., and Reich, D. (2006). Principal components analysis corrects for stratification in genome-wide association studies. Nat. Genet. 38, 904–909. doi: 10.1038/ng1847
Robinson, D. R., Wu, Y. M., and Lin, S. F. (2000). The protein tyrosine family of the human genome. Oncogene 19, 5548–5557. doi: 10.1038/sj.onc.1203957
Rochus, C. M., Holborn, M. K., Ang, K. P., Elliott, J. A. K., Glebe, B. D., Leadbeater, S., et al. (2018). Genome-wide association analysis of salmon lice (Lepeophtheirus salmonis) resistance in a North American Atlantic salmon population. Aquac. Res. 49, 1329–1338. doi: 10.1111/are.13592
Roth, J. S., and Buccino, G. (1963). Some aspects of deoxynucleotide metabolism in the developing chick embryo. Arch. Biochem. Biophys. 101, 516–522. doi: 10.1016/0003-9861(63)90512-5
Sodeland, M., Gaarder, M., Moen, T., Thomassen, M., Kjøglum, S., Kent, M., et al. (2013). Genome-wide association testing reveals quantitative trait loci for fillet texture and fat content in Atlantic salmon. Aquaculture 408, 169–174. doi: 10.1016/j.aquaculture.2013.05.029
Sui, J., Luan, S., Luo, K., Meng, X., Lu, X., Cao, B., et al. (2016). Genetic parameters and response to selection for harvest body weight of Pacific white shrimp, Litopenaeus vannamei. Aquac. Res. 47, 2795–2803. doi: 10.1111/are.12729
Takeda, S., and Matsuoka, M. (2008). Genetic approaches to crop improvement: responding to environmental and population changes. Nature 9, 444–457. doi: 10.1038/nrg2342
Tan, S., Zhou, T., Wang, W., Jin, Y., Wang, X., Geng, X., et al. (2018). GWAS analysis using interspecific backcross progenies reveals superior blue catfish alleles responsible for strong resistance against enteric septicemia of catfish. Mol. Genet. Genom. 293, 1107–1120. doi: 10.1007/s00438-018-1443-4
Van Arendonk, J. A., Tier, B., and Kinghorn, B. P. (1994). Use of multiple genetic markers in prediction of breeding values. Genetics 137, 319–329. doi: 10.1093/genetics/137.1.319
Villanueva, B., Pongwong, R., Fernández, J., and Toro, M. A. (2005). Benefits from marker assisted selection under an additive polygenic genetic model. J. Anim. Sci. 83:1747. doi: 10.2527/2005.8381747x
Wang, Q., Yu, Y., Yuan, J., Zhang, X., Huang, H., Li, F., et al. (2017). Effects of marker density and population structure on the genomic prediction accuracy for growth trait in pacific white shrimp Litopenaeus vannamei. BMC Genet. 18:45. doi: 10.1186/s12863-017-0507-5
Wang, Q., Yu, Y., Zhang, Q., Luo, Z., Zhang, X., Xiang, J., et al. (2020). The polymorphism of LvMMD2 and its association with growth traits in Litopenaeus vannamei. Mar. Biotechnol. 22, 564–571. doi: 10.1007/s10126-020-09977-0
Wang, Q., Yu, Y., Zhang, Q., Zhang, X., Yuan, J., Huang, H., et al. (2019). A novel candidate gene associated with body weight in the Pacific White Shrimp Litopenaeus vannamei. Front. Genet. 10:520. doi: 10.3389/fgene.2019.00520
Yu, Y., Wang, Q., Zhang, Q., Luo, Z., Wang, Y., Zhang, X., et al. (2019). Genome scan for genomic regions and genes associated with growth trait in pacific white shrimp Litopeneaus vannamei. Mar. Biotechnol. 21, 374–383. doi: 10.1007/s10126-019-09887-w
Yu, Y., Zhang, X., Yuan, J., Li, F., Chen, X., and Zhao, Y. (2015). Genome survey and high-density genetic map construction provide genomic and genetic resources for the pacific white shrimp Litopenaeus vannamei. Sci. Rep. 5:15612.
Zhang, H., Zhang, Y., and Zhang, Q. (2003). Pre-selecting young bulls before progeny-testing with marker-assisted blup. Sci. Bull. 48, 259–265. doi: 10.1360/03tb9053
Zhang, X., Yuan, J., Sun, Y., Li, S., Gao, Y., Yu, Y., et al. (2019). Penaeid shrimp genome provides insights into benthic adaptation and frequent molting. Nat. commun. 10:356.
Zhong, X., Wang, X., Zhou, T., Jin, Y., Tan, S., Jiang, C., et al. (2017). Genome-wide association study reveals multiple novel QTL associated with low oxygen tolerance in hybrid catfish. Mar. Biotechnol. 19, 379–390. doi: 10.1007/s10126-017-9757-5
Keywords: GWAS, growth trait, marker assisted BLUP, genetic breeding, Litopenaeus vannamei
Citation: Lyu D, Yu Y, Wang Q, Luo Z, Zhang Q, Zhang X, Xiang J and Li F (2021) Identification of Growth-Associated Genes by Genome-Wide Association Study and Their Potential Application in the Breeding of Pacific White Shrimp (Litopenaeus vannamei). Front. Genet. 12:611570. doi: 10.3389/fgene.2021.611570
Received: 29 September 2020; Accepted: 08 March 2021;
Published: 07 April 2021.
Edited by:
José Manuel Yáñez, University of Chile, ChileReviewed by:
Hugo H. Montaldo, National Autonomous University of Mexico, MexicoLuke Kramer, Iowa State University, United States
Copyright © 2021 Lyu, Yu, Wang, Luo, Zhang, Zhang, Xiang and Li. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Fuhua Li, fhli@qdio.ac.cn
†These authors have contributed equally to this work