scispace - formally typeset
Search or ask a question

Showing papers in "BMC Genetics in 2017"


Journal ArticleDOI
TL;DR: The history of genetic studies and the progress driven by next-generation sequencing (NGS) are discussed, using complex inflammatory bowel diseases as an example.
Abstract: Recent advances in the development of sequencing technologies provide researchers with unprecedented possibilities for genetic analyses. In this review, we will discuss the history of genetic studies and the progress driven by next-generation sequencing (NGS), using complex inflammatory bowel diseases as an example. We focus on the opportunities, but also challenges that researchers are facing when working with NGS data to unravel the genetic causes underlying diseases.

160 citations


Journal ArticleDOI
Xiuge Gu1, Mengying Li1, Ye Jin1, Dongxu Liu1, Fulan Wei1 
TL;DR: This study comprehensively identified lncRNAs/circRNAs and first integrated their potential ceRNA function during PDLSC osteogenic differentiation and periodontal regeneration, suggesting that specific lnc RNAs and circRNAs might function as ceRNAs to promote PDL SC osteogenic differentiate and periodental regeneration.
Abstract: Researchers have been exploring the molecular mechanisms underlying the control of periodontal ligament stem cell (PDLSC) osteogenic differentiation. Recently, long noncoding RNAs (lncRNAs) and circular RNAs (circRNAs) were shown to function as competitive endogenous RNAs (ceRNAs) to regulate the effect of microRNAs (miRNAs) on their target genes during cell differentiation. However, comprehensive identification and integrated analysis of lncRNAs and circRNAs acting as ceRNAs during PDLSC osteogenic differentiation have not been performed. PDLSCs were derived from healthy human periodontal ligament and cultured separately with osteogenic induction and normal media for 7 days. Cultured PDLSCs were positive for STRO-1 and CD146 and negative for CD31 and CD45. Osteo-induced PDLSCs showed increased ALP (alkaline phosphatase) activity and up-regulated expression levels of the osteogenesis-related markers ALP, Runt-related transcription factor 2 and osteocalcin. Then, a total of 960 lncRNAs and 1456 circRNAs were found to be differentially expressed by RNA sequencing. The expression profiles of eight lncRNAs and eight circRNAs were measured with quantitative real-time polymerase chain reaction and were shown to agree with the RNA-seq results. Furthermore, the potential functions of lncRNAs and circRNAs as ceRNAs were predicted based on miRanda and were investigated using Gene Ontology and Kyoto Encyclopedia of Genes and Genomes analysis. In total, 147 lncRNAs and 1382 circRNAs were predicted to combine with 148 common miRNAs and compete for miRNA binding sites with 744 messenger RNAs. These mRNAs were predicted to significantly participate in osteoblast differentiation, the MAPK pathway, the Wnt pathway and the signaling pathways regulating pluripotency of stem cells. Among them, lncRNAs coded as TCONS_00212979 and TCONS_00212984, as well as circRNA BANP and circRNA ITCH, might interact with miRNA34a and miRNA146a to regulate PDLSC osteogenic differentiation via the MAPK pathway. This study comprehensively identified lncRNAs/circRNAs and first integrated their potential ceRNA function during PDLSC osteogenic differentiation. These findings suggest that specific lncRNAs and circRNAs might function as ceRNAs to promote PDLSC osteogenic differentiation and periodontal regeneration.

127 citations


Journal ArticleDOI
TL;DR: The results show that Trichoderma is able to respond in different ways to the presence of a plant or a fungal host, and it can even distinguish between different strains of fungi of a given species.
Abstract: Trichoderma spp. can establish beneficial interactions with plants by promoting plant growth and defense systems, as well as, antagonizing fungal phytopathogens in mycoparasitic interactions. Such interactions depend on signal exchange between both participants and can be mediated by effector proteins that alter the host cell structure and function, allowing the establishment of the relationship. The main purpose of this work was to identify, using computational methods, candidates of effector proteins from T. virens, T. atroviride and T. reesei, validate the expression of some of the genes during a beneficial interaction and mycoparasitism and to define the biological function for one of them. We defined a catalogue of putative effector proteins from T. virens, T. atroviride and T. reesei. We further validated the expression of 16 genes encoding putative effector proteins from T. virens and T. atroviride during the interaction with the plant Arabidopsis thaliana, and with two anastomosis groups of the phytopathogenic fungus Rhizoctonia solani. We found genes which transcript levels are modified in response to the presence of both plant fungi, as well as genes that respond only to either a plant or a fungal host. Further, we show that overexpression of the gene tvhydii1, a Class II hydrophobin family member, enhances the antagonistic activity of T. virens against R. solani AG2. Further, deletion of tvhydii1 results in reduced colonization of plant roots, while its overexpression increases it. Our results show that Trichoderma is able to respond in different ways to the presence of a plant or a fungal host, and it can even distinguish between different strains of fungi of a given species. The putative effector proteins identified here may play roles in preventing perception of the fungus by its hosts, favoring host colonization or protecting it from the host’s defense response. Finally, the novel effector protein TVHYDII1 plays a role in plant root colonization by T, virens, and participates in its antagonistic activity against R. solani.

102 citations


Journal ArticleDOI
TL;DR: For the growth traits with moderate or high heritability, such as body weight and body length, the number of about 3.2 K SNPs distributed evenly along the genome was able to satisfy the need for accurate GS prediction in the investigated L.vannamei population.
Abstract: Due to the great advantages in selection accuracy and efficiency, genomic selection (GS) has been widely studied in livestock, crop and aquatic animals Our previous study based on one full-sib family of Litopenaeus vannamei (L vannamei) showed that GS was feasible in penaeid shrimp However, the applicability of GS might be influenced by many factors including heritability, marker density and population structure etc Therefore it is necessary to evaluate the major factors affecting the prediction ability of GS in shrimp The aim of this study was to evaluate the factors influencing the GS accuracy for growth traits in L vannamei Genotype and phenotype data of 200 individuals from 13 full-sib families were used for this analysis In the present study, the heritability of growth traits in L vannamei was estimated firstly based on the full set of markers (23 K) It was 0321 for body weight and 0452 for body length The estimated heritability increased rapidly with the increase of the marker density from 005 K to 32 K, and then it tended to be stable for both traits For genomic prediction on the growth traits in L vannamei, three statistic models (RR-BLUP, BayesA and Bayesian LASSO) showed similar performance for the prediction accuracy of genomic estimated breeding value (GEBV) The prediction accuracy was improved with the increasing of marker density However, the marker density would bring a weak effect on the prediction accuracy after the marker number reached 32 K In addition, the genetic relationship between reference and validation population could influence the GS accuracy significantly A distant genetic relationship between reference and validation population resulted in a poor performance of genomic prediction for growth traits in L vannamei For the growth traits with moderate or high heritability, such as body weight and body length, the number of about 32 K SNPs distributed evenly along the genome was able to satisfy the need for accurate GS prediction in the investigated Lvannamei population The genetic relationship between the reference population and the validation population showed significant effects on the accuracy for genomic prediction Therefore it is very important to optimize the design of the reference population when applying GS to shrimp breeding

68 citations


Journal ArticleDOI
TL;DR: It is demonstrated that SCoT markers were more efficient than ISSR for genetic diversity studies in V. unguiculata and can be integrated in the exploration of their genetic diversity for improvement and germplasm utilization.
Abstract: Assessment of genetic diversity of Vigna unguiculata (L.) Walp (cowpea) accessions using informative molecular markers is imperative for their genetic improvement and conservation. Use of efficacious molecular markers to obtain the required knowledge of the genetic diversity within the local and regional germplasm collections can enhance the overall effectiveness of cowpea improvement programs, hence, the comparative assessment of Inter-simple sequence repeat (ISSR) and Start codon targeted (SCoT) markers in genetic diversity of V. unguiculata accessions from different regions in Nigeria. Comparative analysis of the genetic diversity of eighteen accessions from different locations in Nigeria was investigated using ISSR and SCoT markers. DNA extraction was done using Zymogen Kit according to its manufacturer’s instructions followed by amplifications with ISSR and SCoT and agarose gel electrophoresis. The reproducible bands were scored for analyses of dendrograms, principal component analysis, genetic diversity, allele frequency, polymorphic information content, and population structure. Both ISSR and SCoT markers resolved the accessions into five major clusters based on dendrogram and principal component analyses. Alleles of 32 and 52 were obtained with ISSR and SCoT, respectively. Numbers of alleles, gene diversity and polymorphic information content detected with ISSR were 9.4000, 0.7358 and 0.7192, while SCoT yielded 11.1667, 0.8158 and 0.8009, respectively. Polymorphic loci were 70 and 80 in ISSR and SCoT, respectively. Both markers produced high polymorphism (94.44–100%). The ranges of effective number of alleles (Ne) were 1.2887 ± 0.1797–1.7831 ± 0.2944 and 1.7416 ± 0.0776–1.9181 ± 0.2426 in ISSR and SCoT, respectively. The Nei’s genetic diversity (H) ranged from 0.2112 ± 0.0600–0.4335 ± 0.1371 and 0.4111 ± 0.0226–0.4778 ± 0.1168 in ISSR and SCoT, respectively. Shannon’s information index (I) from ISSR and SCoT were 0.3583 ± 0.0639–0.6237 ± 0.1759 and 0.5911 ± 0.0233–0.6706 ± 0.1604. Total gene diversity (Ht), gene diversity within population (Hs), coefficient of gene differentiation (Gst) and level of gene flow (Nm) revealed by ISSR were 0.4498, 0.3203, 0.2878 and 1.2371 respectively, while SCoT had 0.4808, 0.4522, 0.0594 and 7.9245. Both markers showed highest genetic diversity in accessions from Ebonyi. Our study demonstrated that SCoT markers were more efficient than ISSR for genetic diversity studies in V. unguiculata and can be integrated in the exploration of their genetic diversity for improvement and germplasm utilization.

58 citations


Journal ArticleDOI
TL;DR: Meat quality related gene regions that are positively selected in Ankole (Sanga) cattle breeds as compared to indicus (Boran, Ogaden, and Kenana) breeds are identified using cross-population (XP-EHH and XP-CLR) statistical methods.
Abstract: Africa is home to numerous cattle breeds whose diversity has been shaped by subtle combinations of human and natural selection. African Sanga cattle are an intermediate type of cattle resulting from interbreeding between Bos taurus and Bos indicus subspecies. Recently, research has asserted the potential of Sanga breeds for commercial beef production with better meat quality as compared to Bos indicus breeds. Here, we identified meat quality related gene regions that are positively selected in Ankole (Sanga) cattle breeds as compared to indicus (Boran, Ogaden, and Kenana) breeds using cross-population (XP-EHH and XP-CLR) statistical methods. We identified 238 (XP-EHH) and 213 (XP-CLR) positively selected genes, of which 97 were detected from both statistics. Among the genes obtained, we primarily reported those involved in different biological process and pathways associated with meat quality traits. Genes (CAPZB, COL9A2, PDGFRA, MAP3K5, ZNF410, and PKM2) involved in muscle structure and metabolism affect meat tenderness. Genes (PLA2G2A, PARK2, ZNF410, MAP2K3, PLCD3, PLCD1, and ROCK1) related to intramuscular fat (IMF) are involved in adipose metabolism and adipogenesis. MB and SLC48A1 affect meat color. In addition, we identified genes (TIMP2, PKM2, PRKG1, MAP3K5, and ATP8A1) related to feeding efficiency. Among the enriched Gene Ontology Biological Process (GO BP) terms, actin cytoskeleton organization, actin filament-based process, and protein ubiquitination are associated with meat tenderness whereas cellular component organization, negative regulation of actin filament depolymerization and negative regulation of protein complex disassembly are involved in adipocyte regulation. The MAPK pathway is responsible for cell proliferation and plays an important role in hyperplastic growth, which has a positive effect on meat tenderness. Results revealed several candidate genes positively selected in Ankole cattle in relation to meat quality characteristics. The genes identified are involved in muscle structure and metabolism, and adipose metabolism and adipogenesis. These genes help in the understanding of the biological mechanisms controlling beef quality characteristics in African Ankole cattle. These results provide a basis for further research on the genomic characteristics of Ankole and other Sanga cattle breeds for quality beef.

56 citations


Journal ArticleDOI
TL;DR: This study is the first comprehensive analysis of SnRK2 genes in upland cotton and provides the fundamental information for the functional dissection of GhSnK2s and vital availability for the improvement of plant stress tolerance using GhSnRK1s.
Abstract: Sucrose non-fermenting-1-related protein kinase 2 (SnRK2) is a plant-specific serine/threonine kinase family involved in the abscisic acid (ABA) signaling pathway and responds to osmotic stress. A genome-wide analysis of this protein family has been conducted previously in some plant species, but little is known about SnRK2 genes in upland cotton (Gossypium hirsutum L.). The recent release of the G. hirsutum genome sequence provides an opportunity to identify and characterize the SnRK2 kinase family in upland cotton. We identified 20 putative SnRK2 sequences in the G. hirsutum genome, designated as GhSnRK2.1 to GhSnRK2.20. All of the sequences encoded hydrophilic proteins. Phylogenetic analysis showed that the GhSnRK2 genes were classifiable into three groups. The chromosomal location and phylogenetic analysis of the cotton SnRK2 genes indicated that segmental duplication likely contributed to the diversification and evolution of the genes. The gene structure and motif composition of the cotton SnRK2 genes were analyzed. Nine exons were conserved in length among all members of the GhSnRK2 family. Although the C-terminus was divergent, seven conserved motifs were present. All GhSnRK2s genes showed expression patterns under abiotic stress based on transcriptome data. The expression profiles of five selected genes were verified in various tissues by quantitative real-time RT-PCR (qRT-PCR). Transcript levels of some family members were up-regulated in response to drought, salinity or ABA treatments, consistent with potential roles in response to abiotic stress. This study is the first comprehensive analysis of SnRK2 genes in upland cotton. Our results provide the fundamental information for the functional dissection of GhSnRK2s and vital availability for the improvement of plant stress tolerance using GhSnRK2s.

55 citations


Journal ArticleDOI
TL;DR: Genetic association between SNPs on chromosomes 2, 6, 7, 9, 16 and 18 and CAC were detected in AAs with T2D from AA-DHS and replicated in the JHS, and data support a role for genetic variation on these chromosomes as contributors to CAC.
Abstract: Coronary artery calcified atherosclerotic plaque (CAC) predicts cardiovascular disease (CVD). Despite exposure to more severe conventional CVD risk factors, African Americans (AAs) are less likely to develop CAC, and when they do, have markedly lower levels than European Americans. Genetic factors likely contribute to the observed ethnic differences. To identify genes associated with CAC in AAs with type 2 diabetes (T2D), a genome-wide association study (GWAS) was performed using the Illumina 5 M chip in 691 African American-Diabetes Heart Study participants (AA-DHS), with replication in 205 Jackson Heart Study (JHS) participants with T2D. Genetic association tests were performed on the genotyped and 1000 Genomes-imputed markers separately for each study, and combined in a meta-analysis. Single nucleotide polymorphisms (SNPs), rs11353135 (2q22.1), rs16879003 (6p22.3), rs5014012, rs58071836 and rs10244825 (all on chromosome 7), rs10918777 (9q31.2), rs13331874 (16p13.3) and rs4459623 (18q12.1) were associated with presence and/or quantity of CAC in the AA-DHS and JHS, with meta-analysis p-values ≤8.0 × 10−7. The strongest result in AA-DHS alone was rs6491315 in the 13q32.1 region (parameter estimate (SE) = −1.14 (0.20); p-value = 9.1 × 10−9). This GWAS peak replicated a previously reported AA-DHS CAC admixture signal (rs7492028, LOD score 2.8). Genetic association between SNPs on chromosomes 2, 6, 7, 9, 16 and 18 and CAC were detected in AAs with T2D from AA-DHS and replicated in the JHS. These data support a role for genetic variation on these chromosomes as contributors to CAC in AAs with T2D, as well as to variation in CAC between populations of African and European ancestry.

52 citations


Journal ArticleDOI
TL;DR: The accuracies observed in this study support the feasibility of genomic selection for growth, carcass and meat quality traits in New Zealand Terminal Sire breeds using the Ovine HD SNP chip.
Abstract: New Zealand has some unique Terminal Sire composite sheep breeds, which were developed in the last three decades to meet commercial needs. These composite breeds were developed based on crossing various Terminal Sire and Maternal breeds and, therefore, present high genetic diversity compared to other sheep breeds. Their breeding programs are focused on improving carcass and meat quality traits. There is an interest from the industry to implement genomic selection in this population to increase the rates of genetic gain. Therefore, the main objectives of this study were to determine the accuracy of predicted genomic breeding values for various growth, carcass and meat quality traits using a HD SNP chip and to evaluate alternative genomic relationship matrices, validation designs and genomic prediction scenarios. A large multi-breed population (n = 14,845) was genotyped with the HD SNP chip (600 K) and phenotypes were collected for a variety of traits. The average observed accuracies (± SD) for traits measured in the live animal, carcass, and, meat quality traits ranged from 0.18 ± 0.07 to 0.33 ± 0.10, 0.28 ± 0.09 to 0.55 ± 0.05 and 0.21 ± 0.07 to 0.36 ± 0.08, respectively, depending on the scenario/method used in the genomic predictions. When accounting for population stratification by adjusting for 2, 4 or 6 principal components (PCs) the observed accuracies of molecular breeding values (mBVs) decreased or kept constant for all traits. The mBVs observed accuracies when fitting both G and A matrices were similar to fitting only G matrix. The lowest accuracies were observed for k-means cross-validation and forward validation performed within each k-means cluster. The accuracies observed in this study support the feasibility of genomic selection for growth, carcass and meat quality traits in New Zealand Terminal Sire breeds using the Ovine HD SNP chip. There was a clear advantage on using a mixed training population instead of performing analyzes per genomic clusters. In order to perform genomic predictions per breed group, genotyping more animals is recommended to increase the size of the training population within each group and the genetic relationship between training and validation populations. The different scenarios evaluated in this study will help geneticists and breeders to make wiser decisions in their breeding programs.

46 citations


Journal ArticleDOI
TL;DR: This study provides the results of comparative transcriptome and DNA methylation analyses between RCC and WCC skin tissues and reveals that the molecular mechanism of body color variation in crucian carp is strongly related to disruptions in gene expression andDNA methylation during pigmentation.
Abstract: Crucian carp is a popular ornamental strain in Asia with variants in body color. To further explore the genetic mechanisms underlying gray and red body color formation in crucian carp, the skin transcriptomes and partial DNA methylation sites were obtained from red crucian carp (RCC) and white crucian carp (WCC). Here, we show significant differences in mRNA expression and DNA methylation sites between skin tissues of RCC and WCC. Totals of 3434 and 3683 unigenes had significantly lower and higher expression in WCC, respectively, compared with unigenes expressed in RCC. Some potential genes for body color development were further identified by quantitative polymerase chain reaction, such as mitfa, tyr, tyrp1, and dct, which were down-regulated, and foxd3, hpda, ptps, and gch1, which were up-regulated. A KEGG pathway analysis indicated that the differentially expressed genes were mainly related to mitogen activated protein kinase (MAPK), Wnt, cell cycle, and endocytosis signaling pathways, as well as variations in melanogenesis in crucian carp. In addition, some differentially expressed DNA methylation site genes were related to pigmentation, including mitfa, tyr, dct, foxd3, and hpda. The differentially expressed DNA methylation sites were mainly involved in signaling pathways, including MAPK, cAMP, endocytosis, melanogenesis, and Hippo. Our study provides the results of comparative transcriptome and DNA methylation analyses between RCC and WCC skin tissues and reveals that the molecular mechanism of body color variation in crucian carp is strongly related to disruptions in gene expression and DNA methylation during pigmentation.

41 citations


Journal ArticleDOI
TL;DR: This study discovered that the conventional two-enzyme GBS protocol could produce a large number of high-quality genotypes provided that appropriate filtration criteria were used and qualified the PstI/MspI GBS assay as a low-cost high-density genotyping platform.
Abstract: Genotyping-by-sequencing (GBS) has emerged as a powerful and cost-effective approach for discovering and genotyping single-nucleotide polymorphisms. The GBS technique was largely used in crop species where its low sequence coverage is not a drawback for calling genotypes because inbred lines are almost homozygous. In contrast, only a few studies used the GBS technique in animal populations (with sizeable heterozygosity rates) and many of those that have been published did not consider the quality of the genotypes produced by the bioinformatic pipelines. To improve the sequence coverage of the fragments, an alternative GBS preparation protocol that includes selective primers during the PCR amplification step has been recently proposed. In this study, we compared this modified protocol with the conventional two-enzyme GBS protocol. We also described various procedures to maximize the selection of high quality genotypes and to increase the accuracy of imputation. The in silico digestions of the bovine genome showed that the combination of PstI and MspI is more suitable for sequencing bovine GBS libraries than the use of single digestions with PstI or ApeKI. The sequencing output of the GBS libraries generated a total of 123,666 variants with the selective-primer approach and 272,103 variants with the conventional approach. Validating our data with genotypes obtained from mass spectrometry and Illumina’s bovine SNP50 array, we found that the genotypes produced by the conventional GBS method were concordant with those produced by these alternative genotyping methods, whereas the selective-primer method failed to call heterozygotes with confidence. Our results indicate that high accuracy in genotype calling (>97%) can be obtained using low read-depth thresholds (3 to 5 reads) provided that markers are simultaneously filtered for genotype quality scores. We also show that factors such as the minimum call rate and the minor allele frequency positively influence the accuracy of imputation of missing GBS data. The highest accuracies (around 85%) of imputed GBS markers were obtained with the FIMPUTE program when GBS and SNP50 array genotypes were combined (80,190 to 100,297 markers) before imputation. We discovered that the conventional two-enzyme GBS protocol could produce a large number of high-quality genotypes provided that appropriate filtration criteria were used. In contrast, the selective-primer approach resulted in a substantial proportion of miscalled genotypes and should be avoided for livestock genotyping studies. Overall, our study demonstrates that carefully adjusting the different filtering parameters applied to the GBS data is critical to maximize the selection of high quality genotypes and to increase the accuracy of imputation of missing data. The strategies and results presented here provide a framework to maximize the output of the GBS technique in animal populations and qualified the PstI/MspI GBS assay as a low-cost high-density genotyping platform. The conclusions reported here regarding read-depth and genotype quality filtering could benefit many GBS applications, notably genome-wide association studies, where there is a need to increase the density of markers genotyped across the target population while preserving the quality of genotypes.

Journal ArticleDOI
TL;DR: Analysis of patterns of genetic diversity of the VGSC gene revealed a high polymorphism level of this gene across Africa with no evidence of directional selection suggesting a limited role for knockdown resistance in pyrethroid resistance in An.
Abstract: Understanding the molecular basis of insecticide resistance is key to improve the surveillance and monitoring of malaria vector populations under control. In the major malaria vector Anopheles funestus, little is currently known about the role of the knockdown resistance (kdr) mechanism. Here, we investigated the presence and contribution of knockdown resistance (kdr) to pyrethroids/DDT resistance observed in Anopheles funestus across Africa. Pyrosequencing genotyping and sequencing of the voltage gated sodium channel (VGSC) gene did not detect the common L1014F mutation in field collected An. funestus across Africa. Amplification and cloning of the full-length of the sodium channel gene in pyrethroid resistant mosquitoes revealed evidences of alternative splicing events with three transcripts of 2092, 2061 and 2117 amino acids (93% average similarity to An. gambiae). Several amino acid changes were detected close to the domain II of the protein such as L928R, F938 W, I939S, L802S and T1008 M. However, all these mutations are found at low frequency and their role in pyrethroid resistance could not be established. The presence of the exclusive alternative splicing at exon 19 was not associated with resistance phenotype. Analysis of patterns of genetic diversity of the VGSC gene revealed a high polymorphism level of this gene across Africa with no evidence of directional selection suggesting a limited role for knockdown resistance in pyrethroid resistance in An. funestus. Patterns of genetic differentiation correlate with previous observations of the existence of barriers to gene flow Africa-wide with southern population significantly differentiated from other regions. Despite an apparent limited role of knockdown resistance in An. funestus, it is necessary to continue to monitor the contribution of the mutations detected here as increasing selection from insecticide-based interventions may change the dynamic in field populations as previously observed in other vectors.

Journal ArticleDOI
TL;DR: It is recommended to use internal weights from the study population itself to construct weighted GRS for GxE interaction studies, if the SNPs were chosen because a strong marginal genetic effect was hypothesized and GRS-marginal-internal should be used.
Abstract: Weighted genetic risk scores (GRS), defined as weighted sums of risk alleles of single nucleotide polymorphisms (SNPs), are statistically powerful for detection gene-environment (GxE) interactions. To assign weights, the gold standard is to use external weights from an independent study. However, appropriate external weights are not always available. In such situations and in the presence of predominant marginal genetic effects, we have shown in a previous study that GRS with internal weights from marginal genetic effects (“GRS-marginal-internal”) are a powerful and reliable alternative to single SNP approaches or the use of unweighted GRS. However, this approach might not be appropriate for detecting predominant interactions, i.e. interactions showing an effect stronger than the marginal genetic effect. In this paper, we present a weighting approach for such predominant interactions (“GRS-interaction-training”) in which parts of the data are used to estimate the weights from the interaction terms and the remaining data are used to determine the GRS. We conducted a simulation study for the detection of GxE interactions in which we evaluated power, type I error and sign-misspecification. We compared this new weighting approach to the GRS-marginal-internal approach and to GRS with external weights. Our simulation study showed that in the absence of external weights and with predominant interaction effects, the highest power was reached with the GRS-interaction-training approach. If marginal genetic effects were predominant, the GRS-marginal-internal approach was more appropriate. Furthermore, the power to detect interactions reached by the GRS-interaction-training approach was only slightly lower than the power achieved by GRS with external weights. The power of the GRS-interaction-training approach was confirmed in a real data application to the Traffic, Asthma and Genetics (TAG) Study (N = 4465 observations). When appropriate external weights are unavailable, we recommend to use internal weights from the study population itself to construct weighted GRS for GxE interaction studies. If the SNPs were chosen because a strong marginal genetic effect was hypothesized, GRS-marginal-internal should be used. If the SNPs were chosen because of their collective impact on the biological mechanisms mediating the environmental effect (hypothesis of predominant interactions) GRS-interaction-training should be applied.

Journal ArticleDOI
TL;DR: This investigation demonstrates a new mutation in the BMP15 gene providing a valuable genetic tool to control fecundity in Tunisian Barbarine, usable for diffusion program into conventional flocks looking for prolificacy improvement.
Abstract: Naturally occurring mutations in growth and differentiation factor 9 (GDF9) or bone morphogenetic protein 15 (BMP15) genes are associated with increased ovulation rate (OR) and litter size (LS) but also sterility. Observing the Tunisian Barbarine ewes of the “W” flock selected for improved prolificacy, we found prolific and infertile ewes with streaky ovaries. Blood genomic DNA was extracted from a subset of low-ovulating, prolific and infertile ewes of the “W” flock, and the entire coding sequences of GDF9 and BMP15 were sequenced. We evidenced a novel polymorphism in the exon 1 of the BMP15 gene associated with increased prolificacy and sterility. This novel mutation called FecX Bar is a composite polymorphism associating a single nucleotide substitution (c.301G > T), a 3 bp deletion (c.302_304delCTA) and a C insertion (c.310insC) in the ovine BMP15 cDNA leading to a frame shift at protein position 101. Calculated in the “W” flock, the FecX Bar allele increased OR by 0.7 ova and LS by 0.3 lambs (p = 0.08). As for already identified mutations, homozygous females carrying FecX Bar exhibited streaky ovaries with a blockade at the primary stage of folliculogenesis as shown by histochemistry. Our investigation demonstrates a new mutation in the BMP15 gene providing a valuable genetic tool to control fecundity in Tunisian Barbarine, usable for diffusion program into conventional flocks looking for prolificacy improvement.

Journal ArticleDOI
TL;DR: The finding suggested that the profiles of miRNAs were related to the physiological functions of mammary gland in the colostrum and peak lactation periods.
Abstract: MicroRNAs (miRNAs) have a great influence on various physiological functions. A lot of high-throughput sequencing (HTS) research on miRNAs has been executed in the caprine mammary gland at different lactation periods (common milk lactation and dry period), but little is known about differentially expressed miRNAs in the caprine mammary gland of colostrum and peak lactation periods. This study identified 131 differentially expressed miRNAs (P 1 or log2 colostrum NE/peak lactation NE 1 or log2 colostrum NE/peak lactation NE < −1). The expressions of 10 randomly selected miRNAs was analyzed through stem-loop real-time quantitative PCR (RT-qPCR). Their expression patterns were the same with Solexa sequencing results. Pathway analysis suggested that oestrogen, endocrine, adipocytokine, oxytocin and MAPK signalling pathways act on the development of mammary gland and milk secretion importantly. In addition, the miRNA-target-network showed that the bta-miR-574 could influence the development of mammary gland and lactation by leptin receptor (LEPR), which was in the adipocytokine signalling pathway. Chr5_3880_mature regulated mammary gland development and lactation through Serine/threonine-protein phosphatase (PPP1CA), which was in the oxytocin signalling pathway. Our finding suggested that the profiles of miRNAs were related to the physiological functions of mammary gland in the colostrum and peak lactation periods. The biological features of these miRNAs may help to clarify the molecular mechanisms of lactation and the development of caprine mammary gland.

Journal ArticleDOI
TL;DR: The results from an embryo-cytoplasm-maternal model for diploid seeds showed that OC was primarily determined by genetic effects (VG) and GE (VGE), which together accounted for 86.19% of the phenotypic variance (VP).
Abstract: As seed oil content (OC) is a key measure of rapeseed quality, better understanding the genetic basis of OC would greatly facilitate the breeding of high-oil cultivars. Here, we investigated the components of genetic effects and genotype × environment interactions (GE) that govern OC using a full diallel set of nine parents, which represented a wide range of the Chinese rapeseed cultivars and pure lines with various OCs. Our results from an embryo-cytoplasm-maternal (GoCGm) model for diploid seeds showed that OC was primarily determined by genetic effects (VG) and GE (VGE), which together accounted for 86.19% of the phenotypic variance (VP). GE (VGE) alone accounted for 51.68% of the total genetic variance, indicating the importance of GE interaction for OC. Furthermore, maternal variance explained 75.03% of the total genetic variance, embryo and cytoplasmic effects accounted for 21.02% and 3.95%, respectively. We also found that the OC of F1 seeds was mainly determined by maternal effect and slightly affected by xenia. Thus, the OC of rapeseed was simultaneously affected by various genetic components, including maternal, embryo, cytoplasm, xenia and GE effects. In addition, general combining ability (GCA), specific combining ability (SCA), and maternal variance had significant influence on OC. The lines H2 and H1 were good general combiners, suggesting that they would be the best parental candidates for OC improvement. Crosses H3 × M2 and H1 × M3 exhibited significant SCA, suggesting their potentials in hybrid development. Our study thoroughly investigated and reliably quantified various genetic factors associated with OC of rapeseed by using a full diallel and backcross and reciprocal backcross. This findings lay a foundation for future genetic studies of OC and provide guidance for breeding of high-oil rapeseed cultivars.

Journal ArticleDOI
TL;DR: The analysis highlights the unreliability of ethnicity classification based on patient self-reports and recommends the routine use of pan-ethnic carrier screening panels in reproductive medicine and the use of an ancestry model would allow better estimation of carrier rates and residual risks.
Abstract: Current professional society guidelines recommend genetic carrier screening be offered on the basis of ethnicity, or when using expanded carrier screening panels, they recommend to compute residual risk based on ethnicity. We investigated the reliability of self-reported ethnicity in 9138 subjects referred to carrier screening. Self-reported ethnicity gathered from test requisition forms and during post-test genetic counseling, and genetic ancestry predicted by a statistical model, were compared for concordance. We identified several discrepancies between the two sources of self-reported ethnicity and genetic ancestry. Only 30.3% of individuals who indicated Mediterranean ancestry during consultation self-reported this on requisition forms. Additionally, the proportion of individuals who reported Southeast Asian but were estimated to have a different genetic ancestry was found to depend on the source of self-report. Finally, individuals who reported Latin American demonstrated a high degree of ancestral admixture. As a result, carrier rates and residual risks provided for patient decision-making are impacted if using self-reported ethnicity. Our analysis highlights the unreliability of ethnicity classification based on patient self-reports. We recommend the routine use of pan-ethnic carrier screening panels in reproductive medicine. Furthermore, the use of an ancestry model would allow better estimation of carrier rates and residual risks.

Journal ArticleDOI
TL;DR: The levels of linkage disequilibrium indicate that genomic selection could be implemented with the high density SNP panel, and the moderate to high consistency of gametic phase between breed-groups and overlapping population structure support the pooling of the animals in a mixed training population for genomic predictions.
Abstract: Knowledge about the genetic diversity of a population is a crucial parameter for the implementation of successful genomic selection and conservation of genetic resources. The aim of this research was to establish the scientific basis for the implementation of genomic selection in a composite Terminal sheep breeding scheme by providing consolidated linkage disequilibrium (LD) measures across SNP markers, estimating consistency of gametic phase between breed-groups, and assessing genetic diversity measures, such as effective population size (Ne), and population structure parameters, using a large number of animals (n = 14,845) genotyped with a high density SNP chip (606,006 markers). Information generated in this research will be useful for optimizing molecular breeding values predictions and managing the available genetic resources. Overall, as expected, levels of pairwise LD decreased with increasing distance between SNP pairs. The mean LD r2 between adjacent SNP was 0.26 ± 0.10. The most recent effective population size for all animals (687) and separately per breed-groups: Primera (974), Lamb Supreme (380), Texel (227) and Dual-Purpose (125) was quite variable. The genotyped animals were outbred or had an average low level of inbreeding. Consistency of gametic phase was higher than 0.94 for all breed pairs at the average distance between SNP on the chip (~4.74 kb). Moreover, there was not a clear separation between the breed-groups based on principal component analysis, suggesting that a mixed-breed training population for calculation of molecular breeding values would be beneficial. This study reports, for the first time, estimates of linkage disequilibrium, genetic diversity and population structure parameters from a genome-wide perspective in New Zealand Terminal Sire composite sheep breeds. The levels of linkage disequilibrium indicate that genomic selection could be implemented with the high density SNP panel. The moderate to high consistency of gametic phase between breed-groups and overlapping population structure support the pooling of the animals in a mixed training population for genomic predictions. In addition, the moderate to high Ne highlights the need to genotype and phenotype a large training population in order to capture most of the haplotype diversity and increase accuracies of genomic predictions. The results reported herein are a first step toward understanding the genomic architecture of a Terminal Sire composite sheep population and for the optimal implementation of genomic selection and genome-wide association studies in this sheep population.

Journal ArticleDOI
TL;DR: The rate of LD decay for the three Canadian pig breeds indicates that genomic selection can be successfully implemented within breeds with the current 60 K SNP panel, and the use of a multi-breed training population involving Landrace and Yorkshire to estimate the genomic breeding values of crossbred animals should be further evaluated.
Abstract: Knowledge on the levels of linkage disequilibrium (LD) across the genome, persistence of gametic phase between breed pairs, genetic diversity and population structure are important parameters for the successful implementation of genomic selection. Therefore, the objectives of this study were to investigate these parameters in order to assess the feasibility of a multi-herd and multi-breed training population for genomic selection in important purebred and crossbred pig populations in Canada. A total of 3,057 animals, representative of the national populations, were genotyped with the Illumina Porcine SNP60 BeadChip (62,163 markers). The overall LD (r 2) between adjacent SNPs was 0.49, 0.38, 0.40 and 0.31 for Duroc, Landrace, Yorkshire and Crossbred (Landrace x Yorkshire) populations, respectively. The highest correlation of phase (r) across breeds was observed between Crossbred animals and either Landrace or Yorkshire breeds, in which r was approximately 0.80 at 1 Mbp of distance. Landrace and Yorkshire breeds presented r ≥ 0.80 in distances up to 0.1 Mbp, while Duroc breed showed r ≥ 0.80 for distances up to 0.03 Mbp with all other populations. The persistence of phase across herds were strong for all breeds, with r ≥ 0.80 up to 1.81 Mbp for Yorkshire, 1.20 Mbp for Duroc, and 0.70 Mbp for Landrace. The first two principal components clearly discriminate all the breeds. Similar levels of genetic diversity were observed among all breed groups. The current effective population size was equal to 75 for Duroc and 92 for both Landrace and Yorkshire. An overview of population structure, LD decay, demographic history and inbreeding of important pig breeds in Canada was presented. The rate of LD decay for the three Canadian pig breeds indicates that genomic selection can be successfully implemented within breeds with the current 60 K SNP panel. The use of a multi-breed training population involving Landrace and Yorkshire to estimate the genomic breeding values of crossbred animals (Landrace × Yorkshire) should be further evaluated. The lower correlation of phase at short distances between Duroc and the other breeds indicates that a denser panel may be required for the use of a multi-breed training population including Duroc.

Journal ArticleDOI
TL;DR: Mapping chicken leptin together with a cluster of five syntenic genes provided the final proof for its identification as the true chicken ortholog and suggests other similar clusters of genes in GC-rich genomic regions are missing from the current genome assembly (Galgal5), which should be resolved in future assemblies of the chicken genome.
Abstract: Misidentification of the chicken leptin gene has hampered research of leptin signaling in this species for almost two decades. Recently, the genuine leptin gene with a GC-rich (~70%) repetitive-sequence content was identified in the chicken genome but without indicating its genomic position. This suggests that such GC-rich sequences are difficult to sequence and therefore substantial regions are missing from the current chicken genome assembly. A radiation hybrid panel of chicken-hamster Wg3hCl2 cells was used to map the genome location of the chicken leptin gene. Contrary to our expectations, based on comparative genome mapping and sequence characteristics, the chicken leptin was not located on a microchromosome, which are known to contain GC-rich and repetitive regions, but at the distal tip of the largest chromosome (1p). Following conserved synteny with other vertebrates, we also mapped five additional genes to this genomic region (ARF5, SND1, LRRC4, RBM28, and FLNC), bridging the genomic gap in the current Galgal5 build for this chromosome region. All of the short scaffolds containing these genes were found to consist of GC-rich (54 to 65%) sequences comparing to the average GC-content of 40% on chromosome 1. In this syntenic group, the RNA-binding protein 28 (RBM28) was in closest proximity to leptin. We deduced the full-length of the RBM28 cDNA sequence and profiled its expression patterns detecting a negative correlation (R = − 0.7) between the expression of leptin and of RBM28 across tissues that expressed at least one of the genes above the average level. This observation suggested a local regulatory interaction between these genes. In adipose tissues, we observed a significant increase in RBM28 mRNA expression in breeds with lean phenotypes. Mapping chicken leptin together with a cluster of five syntenic genes provided the final proof for its identification as the true chicken ortholog. The high GC-content observed for the chicken leptin syntenic group suggests that other similar clusters of genes in GC-rich genomic regions are missing from the current genome assembly (Galgal5), which should be resolved in future assemblies of the chicken genome.

Journal ArticleDOI
TL;DR: This project has patched the hole in the genetic map of Eurasia by demonstrating complexity of genetic structure of Northern Eurasians, existence of East-West and North-South genetic gradients, and assessed different inputs of ancient populations into modern populations.
Abstract: The history of human populations occupying the plains and mountain ridges separating Europe from Asia has been eventful, as these natural obstacles were crossed westward by multiple waves of Turkic and Uralic-speaking migrants as well as eastward by Europeans. Unfortunately, the material records of history of this region are not dense enough to reconstruct details of population history. These considerations stimulate growing interest to obtain a genetic picture of the demographic history of migrations and admixture in Northern Eurasia. We genotyped and analyzed 1076 individuals from 30 populations with geographical coverage spanning from Baltic Sea to Baikal Lake. Our dense sampling allowed us to describe in detail the population structure, provide insight into genomic history of numerous European and Asian populations, and significantly increase quantity of genetic data available for modern populations in region of North Eurasia. Our study doubles the amount of genome-wide profiles available for this region. We detected unusually high amount of shared identical-by-descent (IBD) genomic segments between several Siberian populations, such as Khanty and Ket, providing evidence of genetic relatedness across vast geographic distances and between speakers of different language families. Additionally, we observed excessive IBD sharing between Khanty and Bashkir, a group of Turkic speakers from Southern Urals region. While adding some weight to the “Finno-Ugric” origin of Bashkir, our studies highlighted that the Bashkir genepool lacks the main “core”, being a multi-layered amalgamation of Turkic, Ugric, Finnish and Indo-European contributions, which points at intricacy of genetic interface between Turkic and Uralic populations. Comparison of the genetic structure of Siberian ethnicities and the geography of the region they inhabit point at existence of the “Great Siberian Vortex” directing genetic exchanges in populations across the Siberian part of Asia. Slavic speakers of Eastern Europe are, in general, very similar in their genetic composition. Ukrainians, Belarusians and Russians have almost identical proportions of Caucasus and Northern European components and have virtually no Asian influence. We capitalized on wide geographic span of our sampling to address intriguing question about the place of origin of Russian Starovers, an enigmatic Eastern Orthodox Old Believers religious group relocated to Siberia in seventeenth century. A comparative reAdmix analysis, complemented by IBD sharing, placed their roots in the region of the Northern European Plain, occupied by North Russians and Finno-Ugric Komi and Karelian people. Russians from Novosibirsk and Russian Starover exhibit ancestral proportions close to that of European Eastern Slavs, however, they also include between five to 10 % of Central Siberian ancestry, not present at this level in their European counterparts. Our project has patched the hole in the genetic map of Eurasia: we demonstrated complexity of genetic structure of Northern Eurasians, existence of East-West and North-South genetic gradients, and assessed different inputs of ancient populations into modern populations.

Journal ArticleDOI
TL;DR: In this work, CLOCK gene variants were associated with sleep duration and BMI suggesting that the effects of these polymorphisms on circadian rhythmicity may affectSleep duration and body weight regulation in Africans Americans.
Abstract: Circadian rhythms regulate key biological processes and the dysregulation of the intrinsic clock mechanism affects sleep patterns and obesity onset. The CLOCK (circadian locomotor output cycles protein kaput) gene encodes a core transcription factor of the molecular circadian clock influencing diverse metabolic pathways, including glucose and lipid homeostasis. The primary objective of this study was to evaluate the associations between CLOCK single nucleotide polymorphisms (SNPs) and body mass index (BMI). We also evaluated the association of SNPs with BMI related factors such as sleep duration and quality, adiponectin and leptin, in 2962 participants (1116 men and 1810 women) from the Jackson Heart Study. Genotype data for the selected 23 CLOCK gene SNPS was obtained by imputation with IMPUTE2 software and reference phase data from the 1000 genome project. Genetic analyses were conducted with PLINK We found a significant association between the CLOCK SNP rs2070062 and sleep duration, participants carriers of the T allele showed significantly shorter sleep duration compared to non-carriers after the adjustment for individual proportions of European ancestry (PEA), socio economic status (SES), body mass index (BMI), alcohol consumption and smoking status that reach the significance threshold after multiple testing correction. In addition, we found nominal associations of the CLOCK SNP rs6853192 with longer sleep duration and the rs6820823, rs3792603 and rs11726609 with BMI. However, these associations did not reach the significance threshold after correction for multiple testing. In this work, CLOCK gene variants were associated with sleep duration and BMI suggesting that the effects of these polymorphisms on circadian rhythmicity may affect sleep duration and body weight regulation in Africans Americans.

Journal ArticleDOI
TL;DR: Overdominant QTLs are important factors for heterosis of PH and EH, and a major QTL qPH.A-1.3 may be a desired target for genetic improvement of maize plant height.
Abstract: Plant height (PH) and ear height (EH) are two important agronomic traits in maize selection breeding. F1 hybrid exhibit significant heterosis for PH and EH as compared to their parental inbred lines. To understand the genetic basis of heterosis controlling PH and EH, we conducted quantitative trait locus (QTL) analysis using a recombinant inbreed line (RIL) based design III population derived from the elite maize hybrid Zhengdan 958 in five environments. A total of 14 environmentally stable QTLs were identified, and the number of QTLs for Z1 and Z2 populations was six and eight, respectively. Notably, all the eight environmentally stable QTLs for Z2 were characterized by overdominance effect (OD), suggesting that overdominant QTLs were the most important contributors to heterosis for PH and EH. Furthermore, 14 environmentally stable QTLs were anchored on six genomic regions, among which four are trait-specific QTLs, suggesting that the genetic basis for PH and EH is partially different. Additionally, qPH.A-1.3, modifying about 10 centimeters of PH, was further validated in backcross populations. The genetic basis for PH and EH is partially different, and overdominant QTLs are important factors for heterosis of PH and EH. A major QTL qPH.A-1.3 may be a desired target for genetic improvement of maize plant height.

Journal ArticleDOI
TL;DR: A compendium combining mammalian genes involved in adaptation to cold environment was designed, based on the intersection of positively selected genes from six Arctic and Antarctic species, which revealed that genes from compendium were over-represented in the lists of genes expressed in the lung and liver.
Abstract: Many mammals are well adapted to surviving in extremely cold environments. These species have likely accumulated genetic changes that help them efficiently cope with low temperatures. It is not known whether the same genes related to cold adaptation in one species would be under selection in another species. The aims of this study therefore were: to create a compendium of mammalian genes related to adaptations to a low temperature environment; to identify genes related to cold tolerance that have been subjected to independent positive selection in several species; to determine promising candidate genes/pathways/organs for further empirical research on cold adaptation in mammals. After a search for publications containing keywords: “whole genome”, “transcriptome or exome sequencing data”, and “genome-wide genotyping array data” authors looked for information related to genetic signatures ascribable to positive selection in Arctic or Antarctic mammalian species. Publications related to Human, Arctic fox, Yakut horse, Mammoth, Polar bear, and Minke whale were chosen. The compendium of genes that potentially underwent positive selection in >1 of these six species consisted of 416 genes. Twelve of them showed traces of positive selection in three species. Gene ontology term enrichment analysis of 416 genes from the compendium has revealed 13 terms relevant to the scope of this study. We found that enriched terms were relevant to three major groups: terms associated with collagen proteins and the extracellular matrix; terms associated with the anatomy and physiology of cilium; terms associated with docking. We further revealed that genes from compendium were over-represented in the lists of genes expressed in the lung and liver. A compendium combining mammalian genes involved in adaptation to cold environment was designed, based on the intersection of positively selected genes from six Arctic and Antarctic species. The compendium contained 416 genes that have been positively selected in at least two species. However, we did not reveal any positively selected genes that would be related to cold adaptation in all species from our list. But, our work points to several strong candidate genes involved in mechanisms and biochemical pathways related to cold adaptation response in different species.

Journal ArticleDOI
TL;DR: Analysis of the expression patterns of ClLAX, ClPIN and ClABCB genes under salt, drought, cold treatment and grafting response helps to understand the possible roles of auxin transporter genes in watermelon adaptation to environmental stresses.
Abstract: Auxin plays an important role in regulating plant growth and development as well as in the response of plants to abiotic stresses. Auxin is transported by three kinds of major protein families, including the AUXIN RESISTANT 1/LIKE AUX1 (AUX⁄LAX) influx carriers, the PIN-FORMED (PIN) efflux carriers and the ATP binding cassette B/P-glycoprotein/Multidrug-resistance (ABCB/MDR/PGP) efflux/condition carriers. The biological function of several auxin transporter genes has been well characterized in Arabidopsis thaliana. However, their function in response to exogenous auxin and abiotic stresses in watermelon (Citrullus lanatus. L) remained unknown. Here, the latest updated watermelon genome was used to characterise the ClLAX, ClPIN and ClABCB family genes from watermelon. The genome-wide analysis of the ClLAX, ClPIN and ClABCB family genes, including chromosome localisation, gene structure, and phylogenic relationships, was carried out. Seven ClLAXs, 11 ClPINs and 15 ClABCBs were mapped on 10 watermelon chromosomes. The expression profiles of the ClLAX, ClPIN and ClABCB genes under exogenous indole-3-acetic acid and various abiotic stresses (salt, drought, and cold stresses) treatments were performed by quantitative real-time PCR (qRT-PCR). The transcriptional level of majority ClLAX, ClPIN and ClABCB genes were changed by abiotic stresses in both shoots and roots. We also analysed the expression levels of ClLAX, ClPIN and ClABCB genes in graft response. Analysis of the expression patterns of ClLAX, ClPIN and ClABCB genes under salt, drought, cold treatment and grafting response helps us to understand the possible roles of auxin transporter genes in watermelon adaptation to environmental stresses.

Journal ArticleDOI
TL;DR: A new African subgroup was described in the SWIO region with mitochondrial genetic evidence that A. m.
Abstract: The South West Indian Ocean (SWIO) archipelagos and Madagascar constitute a hotspot of biodiversity with a high rate of endemism. In this area, the endemic subspecies A. m. unicolor has been described in Madagascar. It belongs to the African lineage, one of the four described evolutionary lineages in honey bees. Despite a long beekeeping tradition and several recorded European introductions, few studies have been carried out on the diversity and proportion of honey bee subspecies. In order to identify and define which evolutionary lineages and potential sub-lineages are present in the SWIO, the COI-COII intergenic region and the ND2 gene of the mtDNA were sequenced in honey bee colonies from three archipelagos. An extensive sampling (n = 1184 colonies) was done in the Mascarene (La Reunion, Mauritius, Rodrigues), Seychelles (Mahe, Praslin, La Digue) and Comoros (Grande Comore, Moheli, Anjouan, Mayotte) archipelagos. Islands genetic diversity was compared to newly sampled populations from Madagascar, continental African and European populations. African lineage haplotypes were found in all islands (except for Rodrigues). Madagascar, Comoros and Seychelles had 100% of A lineage, 95.5% in La Reunion and 56.1% in Mauritius. Among all African colonies detected in the SWIO, 98.1% (n = 633) of COI-COII haplotypes described the presence of the subspecies A. M. unicolor. Both genetic markers revealed i) a new private AI mitochondrial group shared by the SWIO archipelagos and Madagascar distant from continental populations; ii) the private African haplotypes for each island suggested diversity radiation in the archipelagos; iii) the detection of the Comoros archipelago as a possible contact area between insular and continental African populations. The exotic European C and M lineages were only detected in the Mascarene archipelago, but striking differences of proportion were observed among islands. Merely 4.6% of European colonies were found in La Reunion whereas Mauritius cumulated 44%. Here, among the 84 observed COI-COII haplotypes, 50 were newly described including 13 which were private to the SWIO archipelagos and Madagascar. Similarly, 24 of the 34 found ND2 haplotypes were novel which included six haplotypes particular to the SWIO populations. A new African subgroup was described in the SWIO region with mitochondrial genetic evidence that A. m. unicolor is the indigenous subspecies of the archipelagos surrounding Madagascar.

Journal ArticleDOI
TL;DR: Chromosomal variability found in T. obscurus are due to rearrangements of the type fusion/fission and reciprocal translocations in heterozygous, and may be contributing to reproductive isolation between populations analyzed.
Abstract: Holocentric chromosomes occur in approximately 750 species of eukaryotes. Among them, the genus Tityus (Scorpiones, Buthidae) has a labile karyotype that shows complex multivalent associations during male meiosis. Thus, taking advantage of the excellent model provided by the Buthidae scorpions, here we analyzed the chromosomal distribution of several repetitive DNA classes on the holocentric chromosomes of different populations of the species Tityus obscurus Gervais, 1843, highlighting their involvement in the karyotypic differences found among them. This species shows inter- and intrapopulational karyotype variation, with seven distinct cytotypes: A (2n = 16), B (2n = 14), C (2n = 13), D (2n = 13), E (2n = 12), F (2n = 12) and G (2n = 11). Furthermore, exhibits achiasmatic male meiosis and lacks heteromorphic sex chromosomes. Trivalent and quadrivalent meiotic associations were found in some cytotypes. In them, 45S rDNAs were found in the terminal portions of two pairs, while TTAGG repeats were found only at the end of the chromosomes. In the cytotype A (2n = 16), the U2 snRNA gene mapped to pair 1, while the H3 histone cluster and C 0 t-1 DNA fraction was terminally distributed on all pairs. Mariner transposons were found throughout the chromosomes, with the exception of one individual of cytotype A (2n = 16), in which it was concentrated in heterochromatic regions. Chromosomal variability found in T. obscurus are due to rearrangements of the type fusion/fission and reciprocal translocations in heterozygous. These karyotype differences follow a geographical pattern and may be contributing to reproductive isolation between populations analyzed. Our results also demonstrate high mobility of histone H3 genes. In contrast, other multigene families (45S rDNA and U2 snRNA) have conserved distribution among individuals. The accumulation of repetitive sequences in distal regions of T. obscurus chromosomes, suggests that end of chromosome are not covered by the kinetochore.

Journal ArticleDOI
TL;DR: The results suggest that the T allele is associated with increased LM and elite athletic success, which has implications for athletic populations, as well as conditions characterised by low LM such as sarcopenia and cachexia.
Abstract: FTO gene variants have been associated with obesity phenotypes in sedentary and obese populations, but rarely with skeletal muscle and elite athlete phenotypes. In 1089 participants, comprising 530 elite rugby athletes and 559 non-athletes, DNA was collected and genotyped for the FTO rs9939609 variant using real-time PCR. In a subgroup of non-resistance trained individuals (NT; n = 120), we also assessed structural and functional skeletal muscle phenotypes using dual energy x-ray absorptiometry, ultrasound and isokinetic dynamometry. In a subgroup of rugby athletes (n = 77), we assessed muscle power during a countermovement jump. In NT, TT genotype and T allele carriers had greater total body (4.8% and 4.1%) and total appendicular lean mass (LM; 3.0% and 2.1%) compared to AA genotype, with greater arm LM (0.8%) in T allele carriers and leg LM (2.1%) for TT, compared to AA genotype. Furthermore, the T allele was more common (94%) in selected elite rugby union athletes (back three and centre players) who are most reliant on LM rather than total body mass for success, compared to other rugby athletes (82%; P = 0.01, OR = 3.34) and controls (84%; P = 0.03, OR = 2.88). Accordingly, these athletes had greater peak power relative to body mass than other rugby athletes (14%; P = 2 x 10-6). Collectively, these results suggest that the T allele is associated with increased LM and elite athletic success. This has implications for athletic populations, as well as conditions characterised by low LM such as sarcopenia and cachexia.

Journal ArticleDOI
TL;DR: The results suggest that Ne has decreased since the development of sheep breeds in Europe and this reduction in Ne has been accelerated in the last three decades.
Abstract: Investments in genetic selection have played a major role in the New Zealand sheep industry competitiveness. Selection may erode genetic diversity, which is a crucial factor for the success of breeding programs. Better understanding of linkage disequilibrium (LD) and ancestral effective population size (Ne) through quantifying this diversity and comparison between populations allows for more informed decisions with regards to selective breeding taking population genetic diversity into account. The estimation of N e can be determined via genetic markers and requires knowledge of genetic distances between these markers. Single nucleotide polymorphisms (SNP) data from a sample of 12,597 New Zealand crossbred and purebred sheep genotyped with the Illumina Ovine SNP50 BeadChip was used to perform a genome-wide scan of LD and N e . Three methods to estimate genetic distances were investigated: 1) M1: a ratio fixed across the whole genome of one Megabase per centiMorgan; 2) M2: the ratios of genetic distance (using M3, below) over physical distance fixed for each chromosome; and, 3) M3: a genetic map of inter-SNP distances estimated using CRIMAP software (v2.503). The estimates obtained with M2 and M3 showed much less variability between autosomes than those with M1, which tended to give lower N e results and higher LD decay. The results suggest that N e has decreased since the development of sheep breeds in Europe and this reduction in Ne has been accelerated in the last three decades. The N e estimated for five generations in the past ranged from 71 to 237 for Texel and Romney breeds, respectively. A low level of genetic kinship and inbreeding was estimated in those breeds suggesting avoidance of mating close relatives. M3 was considered the most accurate method to create genetic maps for the estimation of LD and Ne. The findings of this study highlight the history of genetic selection in New Zealand crossbred and purebred sheep and these results will be very useful to understand genetic diversity of the population with respect to genetic selection. In addition, it will help geneticists to identify genomic regions which have been preferentially selected within a variety of breeds and populations.

Journal ArticleDOI
TL;DR: This work evaluated different GP approaches to simultaneously model genotype-by-year (GY) effects and breeding values using multi-year data in terms of predictive ability and confirmed that using data from multiple selection stages provides valuable GY information and helps increasing predictive ability.
Abstract: The use of multiple genetic backgrounds across years is appealing for genomic prediction (GP) because past years’ data provide valuable information on marker effects. Nonetheless, single-year GP models are less complex and computationally less demanding than multi-year GP models. In devising a suitable analysis strategy for multi-year data, we may exploit the fact that even if there is no replication of genotypes across years, there is plenty of replication at the level of marker loci. Our principal aim was to evaluate different GP approaches to simultaneously model genotype-by-year (GY) effects and breeding values using multi-year data in terms of predictive ability. The models were evaluated under different scenarios reflecting common practice in plant breeding programs, such as different degrees of relatedness between training and validation sets, and using a selected fraction of genotypes in the training set. We used empirical grain yield data of a rye hybrid breeding program. A detailed description of the prediction approaches highlighting the use of kinship for modeling GY is presented. Using the kinship to model GY was advantageous in particular for datasets disconnected across years. On average, predictive abilities were 5% higher for models using kinship to model GY over models without kinship. We confirmed that using data from multiple selection stages provides valuable GY information and helps increasing predictive ability. This increase is on average 30% higher when the predicted genotypes are closely related with the genotypes in the training set. A selection of top-yielding genotypes together with the use of kinship to model GY improves the predictive ability in datasets composed of single years of several selection cycles. Our results clearly demonstrate that the use of multi-year data and appropriate modeling is beneficial for GP because it allows dissecting GY effects from genomic estimated breeding values. The model choice, as well as ensuring that the predicted candidates are sufficiently related to the genotypes in the training set, are crucial.