Citation: Fulton JE, Arango J, Ali RA, Bohorquez EB, Lund AR, et al. (2014) Genetic Variation within the Mx Gene of Commercially Selected Chicken Lines Reveals Multiple Haplotypes, Recombination and a Protein under Selection Pressure. PLoS ONE 9(9): e108054. doi:10.1371/journal.pone.0108054
Editor: Ana Paula Arez, Instituto de Higiene e Medicina Tropical, Portugal
Received: April 22, 2014; Accepted: August 18, 2014; Published: September 22, 2014
Copyright: © 2014 Fulton et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: The authors confirm that all data underlying the findings are fully available without restriction. All relevant data are included within the paper.
Funding: EB was supported in part by a grant from the North Carolina Agriculture Foundation. NCAF had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. Analyses conducted by MK and CA were supported in part by a grant from Hy-Line International. Hy-Line International provided support in the form of salaries for authors JEF, JA, ARL, PS and NPO, and did have a role in the study design, data collection and analysis, decision to publish, and preparation of the manuscript. The specific roles of these authors are articulated in the ‘author contributions’ section.
Competing interests: This work was a collaboration between scientists at Hy-Line International and North Carolina State University. Hy-Line International provided samples for sequencing as well as performed additional conformation SNP analysis, and trait association analysis. JEF, JA, ARL, PS and NPO are employees of Hy-Line International. There are no patents, products in development or marketed products to declare. This does not alter the authors' adherence to PLOS ONE policies on sharing data and materials.
Abstract
The Mx protein is one of the best-characterized interferon-stimulated antiviral mediators. Mx homologs have been identified in most vertebrates examined; however, their location within the cell, their level of activity, and the viruses they inhibit vary widely. Recent studies have demonstrated multiple Mx alleles in chickens and some reports have suggested a specific variant (S631N) within exon 14 confers antiviral activity. In the current study, the complete genome of nine elite egg-layer type lines were sequenced and multiple variants of the Mx gene identified. Within the coding region and upstream putative promoter region 36 SNP variants were identified, producing a total of 12 unique haplotypes. Each elite line contained from one to four haplotypes, with many of these haplotypes being found in only one line. Observation of changes in haplotype frequency over generations, as well as recombination, suggested some unknown selection pressure on the Mx gene. Trait association analysis with either individual SNP or haplotypes showed a significant effect of Mx haplotype on several egg production related traits, and on mortality following Marek’s disease virus challenge in some lines. Examination of the location of the various SNP within the protein suggests synonymous SNP tend to be found within structural or enzymatic regions of the protein, while non-synonymous SNP are located in less well defined regions. The putative resistance variant N631 was found in five of the 12 haplotypes with an overall frequency of 47% across the nine lines. Two Mx recombinants were identified within the elite populations, indicating that novel variation can arise and be maintained within intensively selected lines. Collectively, these results suggest the conflicting reports in the literature describing the impact of the different SNP on chicken Mx function may be due to the varying context of haplotypes present in the populations studied.
Introduction
The Myxovirus-resistance (Mx) proteins are interferon-induced, dynamin-like, large GTPases that were first identified because of their association with influenza virus resistance in laboratory mice [1]. Since this initial description, Mx homologues have been described in multiple species. In most animals, at least two Mx genes have been described; however, not all of these different Mx genes have documented antiviral activity (reviewed in [2]). Avian species appear to have just one Mx gene. There is conflicting evidence in the literature on the antiviral properties of the avian Mx proteins. Initial studies of the Mx proteins expressed by chickens and ducks failed to demonstrate antiviral activity [3,4]. Subsequent studies reported the existence of multiple Mx alleles among different genetic lines of chickens and mouse cell lines transfected with different chicken Mx alleles showed antiviral activity [5]. Fourteen amino acid variants were identified within the Mx protein from multiple chicken breeds, with antiviral activity seemingly linked to one amino acid variant at position 631 (S631N) [5,6].
Polymorphisms of the Mx gene have been reported in multiple breeds of chickens, including Australorp, Fayoumi, Japanese native chickens, Indonesian native chickens, White Leghorns, Broilers and inbred laboratory lines [5,7–10]. Most of these reports have focused primarily on exon 14 (13th coding exon) and the S631N variant, though other non-synonymous variants have been identified in other exons, and multiple haplotypes have been recognized.
The association between the 631 variant and antiviral activity was investigated in various breeds and with different systems. In vivo work by Ewald et al. [11] using commercial meat-type (broiler) chicks suggested that those with the N631 variant were more resistant to viral challenge than those with the S631 variant. However, other laboratories using primary chicken embryo fibroblast, transfected cells, chicken embryos, or chicks found no difference in the resistance to influenza virus infection regardless of which 631 variant was expressed [10,12–15]. Furthermore, Schusser et al. [14] demonstrated that neither of these Mx variants (N631 or S631) had GTPase activity, which is essential for antiviral activity [16]. Collectively, these results suggest that any antiviral activity expressed by the chicken Mx is likely more complicated than just one amino acid position, especially given the complex structural interactions involved in Mx biology and the numbers of polymorphism reported for the chicken Mx gene.
The significance of variation in the Mx gene and its potential role in resistance to avian influenza is intriguing. We report here variation in the promoter, 5' untranslated, and coding regions of the Mx gene in 9 elite chicken lines representing the three different breeds used by Hy-Line International for commercial egg production. These chicken lines contribute to over 40% of the commercial egg layer production birds in the world and thus represent a considerable proportion of global commercial egg production.
Materials and Methods
Genetic Material
A DNA archive consisting of multiple generations of males (1996–2011) from 9 different chicken lines was utilized for all studies. These lines are the elite pure lines used to produce commercially utilized egg-production chickens. DNA was obtained from blood from 135–1,264 birds from multiple generations of each line. Genotype and phenotype information from 7,964 samples was utilized for all subsequent trait association analyses. These 9 lines encompass three different breeds; six White Leghorn lines (WL) that produce white-shell eggs, one Rhode Island Red derived line (RIR) that produces brown-shell eggs and two White Plymouth Rock derived lines (WPR) that produce brown-shell eggs.
SNP Identification and Genotyping
Initially twelve individuals from each line were sequenced using Sanger sequencing (BigDye3.1, Applied Biosystems) on an ABI3100 (Applied Biosystems), and the resulting sequence data was analyzed using Vector NTI v. 10 (Invitrogen) to determine the number of potential SNP and haplotypes within these lines. Primer pairs (12) used to produce the template amplicons for sequencing (exons only) are as defined in Table 1. Subsequently, DNA pools (10 individuals per pool, 1 pool per line) were sequenced using the Illumina GAIIx at 7–10x coverage [17]. This resequence data was visualized using IGV viewer, version 2.3.8 [18] which allowed visualization of putative SNP within flanking regions, introns and exons of each line.
The confirmation of SNP was accomplished using fluorescencebased competitive allele specific PCR with KASP chemistry [19]. SNP-specific primer sets were developed using flanking sequence information. Initially genotype information was obtained on individuals from the 1995 or 1996 and 2010 generations. Linkage analysis indicated that specific SNP alleles were found in linkage disequilibrium within the lines. This ultimately defined Mx haplotypes. The minimum number of SNP required to identify haplotypes within each line were then used to genotype the individuals of the remaining generations (1997–2009, and 2011).
Phenotypic Information
Mortality traits. Two mortality related traits were measured. Mortality during grow and lay (LM) were recorded from the sire families which had been placed in multiple field test locations under typical commercial environments. Mortality traits were recorded as sire family means in percent based on 30 daughters per sire, and was measured across generations and genetic lines. The second mortality trait was from multiple generations of a Marek’s disease virus (MDV) [20] challenge test also using the progeny testing model. This phenotype was also measured using 30 daughters of each candidate sire. Chicks were maternal antibody positive and were vaccinated at day of age with HVT/ SB1 following standard industry practices. At 7 days of age, chicks were inoculated with 500 pfu of the highly virulent strain of serotype 1 MDV (vv+ isolate 686) and mortality due to MDV (MM) was recorded until 17–18 weeks of age [20].
Performance Traits. The performance traits recorded are typical for commercial egg production lines and include egg production (egg number, EN and lay rate, PD (%)), sexual maturity (SM, age at onset of lay), and egg quality traits [shell strength (PS, g-force); egg weight (EW, g); albumen height (AH, mm); eggshell color (CO, index using the three parameter L-a-b from the Minolta Chromameter system); and external egg defects (Def, in percent of total eggs produced)]. These traits were also calculated as the mean progeny average for each pure-line sire family across generations and genetic lines [21].
Table 1. Primers used for amplification and sequencing
Statistics
Two sets of statistical models were tested for association between Mx genotype and mortality and performance traits for each genetic line. The first model tested the SNP’s allele substitution effects (ASE) for multiple SNP and genetic lines. In this model, the effects of generation (test) and the number of copies of the reference allele for each tested SNP were fit. The second model fit the effects of generation (test) and haplotype for each trait, and was used to test the overall haplotype effect on each trait within line. For those cases with a significant effect, LSM (Least Squares Means) were calculated and separation and test between haplotypes was performed. Analyses were carried out using JMP 11.0 (SAS Institute Inc).
Analysis of evolutionary selection
Full length nucleotide coding sequence for each of the 12 haplotypes from the commercial lines identified here, plus 53 additional chicken Mx sequences representing multiple breeds reported in GenBank, were analyzed for evidence of recombination as well as site-by-site selection using the Datamonkey webserver (http://www.datamonkey.org) [22]. Briefly, the best fit model (010010) with AIC of 8117.059265150644, also known as the HKY85 model, was determined, followed by analysis to identify individual sites under diversifying or purifying selection using six different methodologies: Mixed Effects Model of Evolution (MEME), Fast Unconstrained Bayesian AppRoximation (FUBAR), Single Likelihood Ancestor Counting (SLAC), Fixed Effects Likelihood (FEL), and Random Effects Likelihood (REL) using Genetic Algorithm for Recombination Detection (GARD) inferred trees. Accession numbers for these additional 53 sequences are: AB088533, AB088534, AB088535, AB088536, AB244818, AY695797, DQ316779, DQ788613, DQ788614, DQ788615, DQ788616, EF575608, EF575609, EF575610, EF575611, EF575612, EF575613, EF575614, EF575615, EF575616, EF575617, EF575618, EF575619, EF575620, EF575621, EF575622, EF575623, EF575624, EF575625, EF575626, EF575627, EF575628, EF575629, EF575630, EF575631, EF575632, EF575633, EF575634, EF575635, EF575636, EF575637, EF575638, EF575639, EF575640, EF575641, EU348752, GQ390353, HM775376, HQ014737, HQ014738, HQ014739, HQ014740, Z23168.
Structural modeling
The 12 Mx haplotypes identified in the nine elite lines reported here were each analyzed using the RaptorX protein structure prediction server (raptorx.uchicago.edu) which identified protein data bank record 3szr [23] as the most likely structural match. The resulting predicted 3D structures of the 12 chMx haplotypes were visualized using Swiss PDB Viewer 4.1 (http://spdbv.vital-it.ch/).
Animals
The protocols for all experiments involving the collection of blood samples and phenotypic observations used in this study were reviewed and approved by the Institutional Animal Care and Use Committee (IACUC) at Hy-Line International.
Results
Combining the information from the de novo sequencing and the NextGen resequence data of the elite commercial lines, revealed multiple SNP within the Mx gene and its immediate upstream region (Table 2). SNP identified within the 400 bp upstream of the start of exon 1 and exonic SNP that changed the amino acid codons were studied in detail. These SNP are listed in Table 2, along with their location within galGal4 from UCSC genome browser (genome.ucsc.edu), their nucleotide change (galGal4 . variant), their codon with amino acid change (if applicable) and the affected predicted protein domain. If the SNP has been previously reported, the source is indicated. There were six SNP identified in the promoter region (4,787 bp upstream of the ATG start in exon 2) and are labeled with the prefix ‘‘P’’. Among these six SNP, one (MxP-55) lies within the interferonstimulated response element (ISRE) region previously described by Schumacher et al. [24] as essential for Mx gene expression. Of the remaining five, one (MxP-18) falls within a putative TATA-like element, two (MxP-136 and -142) are located within a possible SP- 1-like binding site [25,26], and the other two (MxP-158 and -224) are not associated with any known or proposed functional elements (Figure 1). Within the 140 bp that make up the 59 untranslated region, six SNP were identified, two in exon 1 and four in exon 2. These are labeled with the prefix of ‘‘Mx5U’’ (for 5' untranslated) with the numeric label indicating how many bases they are from the beginning of the RNA transcription initiation (Table 2 and Figure 1). Within the actual coding region of the Mx gene, 24 SNP were found within the nine elite lines (Table 2). These SNP were then confirmed by SNP-PCR. There were four additional SNP (indicated by * Table 2) that were reported in the literature [3,5,8,13] but not found to be segregating within the populations in this study. The coding-region SNP were named based on the affected nucleotide position relative to the ATG start codon of Mx. Multiple SNP were also found within the introns of Mx. These were not genotyped except when necessary to identify the regions of recombination. In total, three novel SNP were identified in the nine elite lines, one in the 5' UTR, and two nonsynonymous substitutions (dNS) SNP located in the distal stalk region (Table 2).
Haplotypes
Each line was genotyped for 33 SNP on 135 to 200 animals per generation from widely separated generations. This resulted in the identification of 12 Mx haplotypes across all the lines (Table 3), eight of which, to the best of our knowledge, have not been previously reported. The haplotype for the reference genome (gagGal4 UCSC) is provided for comparison.
Mx Variation
Each elite line contained from one to four haplotypes (Table 4). Within the elite stocks tested here, many of the haplotypes appear to be breed specific, with the exception of haplotype Mx-H04, which was found in both the RIR and WPR breeds. Investigation of the historical haplotype segregation within the nine elite lines from 1995 to 2010 indicated that five of the eight lines that were segregating for Mx haplotypes had significant changes in Mx haplotype frequency during this time.
Table 2. SNP genotyped, their location within the gene, position within galGal version 4, nucleotide change, and codon affected, and the MX protein domain involved.
Mx Recombinants
Close examination of the SNP composition of each haplotype revealed that two haplotypes appeared to be the result of withinline recombination events (Table 3). Haplotype Mx-H02 was found in low frequency in only one line (WL-04). It is a recombination of the two major haplotypes within line WL-04, having the same SNP composition for the promoter region through SNP MxCDS351 (exon 4) as MH-H01 and the same SNP composition from SNP MxCDS992 (exon 8) to the end of the gene as MX-H05. The seven intervening exonic SNP are identical between the two parental haplotypes. Further genotyping with the numerous intronic SNP that differ between these two haplotypes allowed the actual recombination region to be narrowed down to a 450 bp region within intron 5 (data not shown). The de novo occurrence of this haplotype was tracked to a female from the 2003 generation. It has gradually increased in frequency since that time, reaching 0.04 by 2010. The low numbers of individuals with this haplotype are insufficient to determine if there are any trait associations, though the continued increase in frequency is suggestive of a selective advantage.
A second recombinant was identified in line WL-01 and appears to be the result of recombination between the two major haplotypes within that line (Table 3). Haplotype Mx-H12 is identical to Mx-H08 from the promoter through SNP MxCDS1248 (exon 10) and identical to haplotype Mx-H01 from SNP MxCDS1643 (exon 14) through to the end of the gene. The intervening exonic SNP are identical between the two parental haplotypes. The use of intronic SNP that differ between the two parental haplotypes narrowed the identification of the actual recombinant region to a 414 b region of intron 11. Haplotype MxH12 has been maintained at a low frequency (,0.04) since the archive DNA collection was initiated in 1996, thus the original progenitor could not be identified.
Trait Associations
Trait Associations with SNP (ASE). Those Mx SNP that had a significant association with phenotype are summarized in Table 5, along with the size of the effect and which SNP allele was favorable. Allele substitution effects were found in 4 lines. For two mortality traits, allele specific effects were found with five SNP (MxP-55, MxCDS62, MxCDS122, MxCDS694, and MxCDS1015) in four lines (WL-02, WL-03, WPR-01 and RIR- 01) The most consistent association was for MxCDS122, which showed significance in three lines, and in one line (WL-03) the ASE was significant for progeny mortality in both the MDV challenge test and during the grow/lay period.
The ASE on performance traits also identified several SNP with significant effects in three lines. Most of the significant associations were for egg production (either egg number or lay rate) and egg shell color (seen in two lines). Significant ASE for egg weight and albumen height was seen in two lines, whereas shell puncture resistance and egg defects each had only one instance of significant association.
Trait Associations with Haplotypes. Significant haplotype effects found in each line are summarized by trait in Table 6. The favorable haplotype is indicated first, and the size of the effect for the favorable vs alternate haplotype is given. Significant associations with mortality (MDV challenge and during the grow/lay period) were found in one line (WL-02). The size of the effect for mortality due to Marek’s Disease Virus was a decrease in progeny mortality of 5.21% for haplotype H11 vs H05 that was found in the sire and a decrease in progeny mortality of 1.8% during the grow/lay period in commercial environments.
Figure 1. Location of SNP within the promoter and untranslated region of chicken Mx. galGal4 Mx promoter region starting 400 nt upstream of the RNA transcription initiation site through the first 140 nt of the RNA transcript (excluding 4,647 nt from intron 1) are shown above. The previously described ISRE (-52 to -63) as well as other potential functional elements (ISRE2, -282 to -293; SP1-like element, -135 to -142; TATA box-like element, 219 to 212) are underlined. The ‘‘GAAA’’ motif found repeated in many IFN regulated gene promoter regions are shown in bold. SNP found in the 9 elite lines that differ from galGal4 are shown under the reference sequence. Additionally, the RNA transcription initiation and Mx 5'UTR, comprised of exon 1 and the first 92 nt of exon 2, is shown in italics. doi:10.1371/journal.pone.0108054.g001
Trait association between haplotypes and performance traits of progeny showed at least one significant association (p,.05) in four of the lines. Haplotype association with shell color was found in two lines and was highly significant (p,.0001) in both lines. Haplotype 11 showed consistent advantage for four traits (two mortality and two performance traits).
Evidence of Selection
Full length coding sequence of the 12 Mx haplotypes identified herein, were aligned with 53 additional chicken Mx sequences obtained from GenBank. Overall, these sequences resulted in the identification of 72 SNP within the Mx cds (data not shown). These sequences were then analyzed for individual codons with evidence of either purifying (synonymous substitutions (dS) >dNS) or diversifying (dN >dS) selection using various models. Twenty codons were identified by at least one of the six models, 11 with evidence for diversifying selection and nine with evidence of purifying selection (Table 7). Of these 20, 16 codons correspond with SNP identified in the nine elite lines, and encompass five of the seven SNP associated with performance traits (Table 5). Of the two remaining SNP, one (MxP-55) is located in the promoter and therefore was not analyzed for codon selection, and the second (MxCDS694) corresponds with codon 232, which has evidence of being under purifying selection via MxCDS696.
SNP Association With Mx Structural Elements
The RaptorX protein structure prediction server [27] was used to infer the location of SNP within potential structural regions. The crystal structure of the human MxA protein reported by Gao et al. [23] was identified as the best fit. Based on this tertiary structure, a similar 3D structure was predicted for the chicken Mx protein (Figure 2). It should be noted that the structure reported by Gao et al. [23] starts at Tyr45 within the hsMxA sequence and has four regions within the G-domain and one in the loop L4s that were not resolved. Also, as the chicken Mx has approximately 40 additional N-terminal amino acids not found in the mammalian Mx proteins [28], the structure predicted by the RaptorX server starts with Ser84. The chicken Mx protein is predicted to have a similar number of alpha helices, beta sheets, and loop regions (Figure 2 and 3). Examination of the location of these dS and dNS SNP within the Mx protein structure demonstrate that these changes tend to be distributed across the whole sequence, but with a tendency for the dNS SNP to be located at the N-terminal end and the dS SNP more concentrated in the middle domain (MD) (Figure 3).
Examination of these different SNP within the context of functional domains demonstrated several dS and dNS SNP surrounding the GTPase active site within the G-domain (ueure 3 and 4A). While this structure was not fully resolved for MxA [23], six SNP that were found to be associated with performance traits or under selection appear to be located within the G-domain and are clustered around the GTPase active site (Table 5 and Figure 4A). Of these six sites, two were associated with both performance traits and selection (codon 117 and 232). Interestingly codon 232 contains two SNP. SNP MxCDS694 is associated with a dNS change and was found to be associated with performance traits, whereas SNP MxCDS696 does not result in an amino acid change and was associated with purifying selection (Table 5 and 7).
Table 3. Mx haplotypes identified in the 9 elite lines as determined by 6 promoter, and 34 exon SNP
Table 4. Mx haplotype frequency changes over time by line
The end of the G-domain is defined by the conserved P384 that forms hinge 2 (Figure 2) and marks the start of the second of three bundle signaling element (BSE) regions [23]. The three BSEs contain a high number of hydrophobic residues, encompassing between 35–42% of amino acids in this region and appear to interact (Figure 4B). As was described for hsMxA, there appeared to be more interactions between α2B and α3B than with α1B , with L401, which was found to be under purifying selection (Table 7), interacting with the leucine residues of the leucine zipper in α3B (Figure 4B) [23].
After the central BSE region, the protein forms a large stalk region comprised of the middle domain (MD) and the GTP effector domain (GED) (Figure 4C). From the loop region (L1BS) that forms the transition between BSE2 and stalk there are 4 α - helices and 3 loop regions (α1NS , L1S , α1CS , L2S , α2S , L3S , and α3S ) that make up the MD portion of the stalk. This is followed by loop L4S and serves as the transition between the MD and GED regions of the stalk. The GED then contains an additional 2 α-helices and 1 loop (α4S , L5S , and a5S ,) before a final loop that connects the stalk with BSE3 (L2BS) (Figure 4C). This region plays key roles in Mx oligomerization [23,29] and virus specificity [30]. Comparing the amino acid sequences of chicken and hsMxA (Figure 3), one observes that many of the residues reported to be important for oligomerization are clustered in α1NS , L1S , and L2S of the MD as well as the C-terminal end of α4S , L5S , and a5S of the GED. These regions correspond with the majority of chMx codons identified as being under purifying selection (Figure 4 and Table 7).
Alternatively, four codons in the stalk region were identified as being under diversifying selection (Table 7). Three of these dNS sites are located in the α2S , L3S , α3S region of the MD, with the one remaining site (S631N) located in α4S of the GED (Figure 3 and 4C). Only two (A548V and S631N) of these four sites were found to differ among the nine elite lines examined as part of this study (Table 7). The other two sites were identified based on sequence alignments including all full-length chMx sequences.
Discussion
The Mx genes, and the large GTPase protein they encode, are among the best-studied interferon-stimulated antiviral effector molecules. Their identity, and even their name, is based on their ability to inhibit virus replication, specifically influenza virus. Roughly 10 years after the Mx genes of chickens and ducks were first identified and reported to have no antiviral activity [3,4], studies by Ko et al. [5] examined 15 different breeds and identified 25 SNP resulting in 19 different haplotypes. These 19 haplotypes were then cloned and expressed in mammalian cell lines to assess their antiviral activity. The results of these analyses suggested that some chicken Mx alleles may have antiviral activity and this putative activity appeared to be conferred by the SNP MxCDS1892 (note: Ko et al. reported this SNP as 2032 as they numbered from the start of the mRNA), resulting in a change from serine to asparagine at amino acid 631 [5].
Table 5. Mx SNP with a significant allele substitution effect, and allele with favorable effect by trait.
Table 6. Mx haplotypes with significant effect.
Table 7. Evidence of sites within the chicken Mx under selection.
Figure 2. Predicted chicken Mx structure. The chicken Mx sequences were each analyzed using the RaptorX protein structure prediction server. These results identified the crystal structure of human MxA (PDB ID: 3SZR) as the most closely related to the chicken Mx sequence. These results where then visualized using PolyMol, and the regions of the HsMxA (aa#s) and chicken Mx (aa#s) compared. As before the GTPase Domain is shown in orange, the bundle signaling elements are in red, the stalk region which is comprised of the central dyanmin region and the GTPase effector domain are shown in green and blue respectively. The conserved proline residue that forms the hinge between the G-domain and the second BSE is shown in black. doi:10.1371/journal.pone.0108054.g002
Since this initial observation, several laboratories have surveyed various poultry populations and reported over 72 potential SNP either in the literature or in the GenBank database (data not shown). However, most efforts have focused on MxCDS1892, which is often referred to as the ‘‘resistance allele’’ [9]. Surveys of various native, commercial, and laboratory strains of chickens have reported rates of the ‘‘resistant allele’’ ranging from 59.2% to 72.4% and have suggested that the native breeds have a higher frequency of the ‘‘resistant allele’’ than commercial production birds [9,31,32]. Limited information, if any, was presented on haplotype information within these breeds.
This current study surveyed and calculated the frequency for SNP found within nine elite commercial egg production lines and analyzed them for association with various mortality and performance traits. Out of the 36 SNP identified in these genetic lines, seven SNP were significantly associated with one or more traits; however, interestingly MxCDS1892 was not among those seven. Examination of each of these seven SNP and the favorable allele for each trait indicates that, within a given line, the same SNP may be associated with multiple traits but have different favorable alleles for each; making it difficult, if not impossible, to understand its true biological significance.
There are multiple reports of Mx variation in different chicken lines. These previous studies attempted to correlate ‘‘functional’’ variants to observations of resistance or susceptibility to viral infection. While this approach is often a first step in understanding how a specific sequence is associated with a trait of interest, it does not account for the context of the variants [33]. In actuality SNP are not independent of one another, due to linkage disequilibrium. The association of a haplotype, or functional block of sequence, is the proper approach to determine associations with complex phenotypes such as viral response. The increased significance of the haplotype approach in association studies has been shown with ApoE variants and its association with Alzheimer’s disease, in which the haplotype structure analysis identified the causative protein variant [34] and in transmembrane xenobiotic transporters with two or more amino acid variants [35]. These examples are structurally analogous to Mx, where the haplotype structure is vital to the consideration of functional variants’ association with viral response [23].
Figure 3. Amino acid alignment of the HsMxA and galGal4. The different Mx functional domains are represented in colored text (BSE = red, G-domain = orange, Middle stalk domain = green, GTPase effector domain = blue) with the N-terminal region not represented in shown in italics, and the loop regions that connect BSE2 to MD (L1BS) and MD to GED (L4S ) in plain text. Positions in orange bold represent the conserved GTPase enzymatic domain and underlined orange text denotes the GTP binding site. Secondary structural elements as described by Gao et al. [23] et al. are indicated above the HsMxA sequence. Alpha helices are represented by red and with stripped bars. Beta strands are represented by arrows. Amino acids with similar numbers above the alignment indicate positions described to interact during oligimerization. HsMxA positions labeled with ‘‘#’’ indicates amino acid with forms hydrogen bonds with the backbone of the α3B , and ‘‘!’’ denote additional positions described to be involved in oligemerization. ‘‘+’’ above the hsMxA indicates amino acid described by Mitchell et al. [30] as under diversifying selection among primate MxA sequences. Amino acid positions in the chMx associated with dNS changes are reflected by the alternate amino acid under the chMx sequence. Positions associated with dS nt changes are indicated by ‘‘^’’. chMx positions with evidence of diversifying (dNS) or purifying (dS) selection are indicated with ‘‘+’’ or ‘‘-’’ under the chMx sequence. doi:10.1371/journal.pone.0108054.g003
Mx has at least 4 functional domains, each playing a key role in the protein’s ability to exert an antiviral function. The GTPase activity in the N-terminal G-domain has long been recognized as required for Mx activity [16]; however, the exact mechanism by which GTPase activity disrupts virus replication is still unknown. In studies by Schusser et al. [14] chicken Mx was cloned and expressed from chicken embryo fibroblast cells from White Leghorn type chickens genotyped as homozygous for the resistance allele (MxCDS1892-A) or the susceptible allele (MxCDS1892-G). In addition to reporting no difference in antiviral activity, they also reported no detectable GTPase activity. In the current study White Leghorn type chickens have at least 4 haplotypes with MxCDS1892-A and 3 with MxCDS1892-G, and at least 5 different SNP combinations within the G-domain, and no alleles that only differ at MxCDS1892. Currently it is unclear how many, if any, of the different chicken Mx haplotypes have GTPase activity, or which variants within the G-domain may affect GTPase activity. Curiously, the SNP identified as under selection and/or associated with performance traits appear to be located around the edge of the GTPase active site.
In addition to the GTPase activity, Mx functions as part of a large complex oligomer dependent on key secondary and tertiary structural elements. These oligomers are made up of 16 Mx dimers that form a large ring around viral ribonucleoprotein complexes wherein the G-domain’s enzymatic activity delivers its antiviral effects [23]. Studies by Gao et al. [23,29] have begun to elucidate the key amino acids critical for the formation of this complex quaternary structure for human MxA and have determined that most of these residues are located within the MD and GED regions. The GED region and specifically loop 4 also appear to play a role in defining Mx viral specificity [30]. Across the 9 elite lines examined here, 9 SNP were identified in this region, 4 dNS and 5 dS. The majority of the dS SNP appear to be in close proximity to residues described to be important for oligimerization of human MxA, and even includes a dS SNP (MxCDS1248) that was associated with performance traits. The GED is also where the ‘‘resistance allele’’ is located (MxCDS1892). The overall significance of these SNP variants on Mx function, either individually or within a haplotype is still unclear. Given the numbers of SNP identified across genetic lines, evaluation of chicken Mx functionality will require better consideration of the haplotypes instead of SNP in isolation.
Figure 4. Ribbon structure of the three main functional domains of the Mx protein. (A) Ribbon structure of the predicted chMx G-domain. The amino acids associated with the GTPase active site are shown in light gray and those associated with GTP binding in dark gray. The amino acids found to be under selection and/or associated with traits are depicted in black (dS) or yellow (dNS). Amino acid 232 had 2 SNP associated with it, one dS and one dNS and is shown in red white and blue. The amino acid and position numbers are next to each selected site with the alternate amino acid indicated in parentheses if applicable. Positions that were associated with both traits and selection are denoted with ‘‘*’’. (B) Ribbon structure of the interacting BSE elements based on the predicted structure. Position 401, with evidence of purifying selection, is shown in black along with the conserved hydrophobic residues associated with the Lucien zipper (red) interactions. (C) Ribbon structure of the predicted stalk domain. The amino acids found to be under selection and/or associated with traits are depicted in black (dS) or yellow (dNS). The structure was rotated 180° (top vs bottom) in order to be able to visualize all the affected position in the stalk. doi:10.1371/journal.pone.0108054.g004
Historical analysis of the haplotype frequencies within the lines evaluated herein indicated that there has been a significant shift in the haplotypes present within six of these lines. Simultaneously, these lines are under intensive selection for numerous traits related to egg production, general animal health, and resistance to MDV. The change in frequency of specific haplotypes is correlated with genetic progress in these lines, suggesting specific advantage of certain haplotypes. The associations found between Mx haplotype and various production traits are interesting. Many common avian viral diseases are known to cause mild to severe reduction in egg production, decrease appetite, depress the immune system and affect the physiology of the reproductive tract. Anti-viral properties of Mx variants could be providing enhanced resistance to viruses routinely encountered throughout the lifecycle of a bird, consequently providing a slight overall improvement in performance.
Review of previous reports of chicken Mx sequence diversity and its functional role in viral resistance provides few conclusive answers. This current work has focused on developing a comprehensive understanding of the significance of sequence diversity of the Mx gene in multiple lines of chicken with multigeneration genotypes and extensive production trait information. Among these lines additional sequence variants were identified that had not been described previously, and more importantly new discrete haplotypes were observed whose frequency appears to be under selection across multiple generations. Thus it is apparent that in future studies of chicken Mx the complete haplotype of the gene should be considered as the functional unit rather than a single SNP.
In addition to the haplotype, it is also important to understand these differences in context of their location within the three dimensional structure of the mature protein. The important functional components of the mature protein can be identified either in structural modeling studies or by evaluating the selection pressures on individual residues as described. These variant sites may provide insight into how Mx functions in the response to virus. The degree of variation contained within the chicken Mx, particularly within commercial stocks, is counter to preconceptions that commercial stocks are highly inbred with limited variability. These levels of diversity within the commercial lines provide vast opportunities for subsequent functional studies. The identification of two novel recombinants within these lines indicates that novel variation does arise and can be maintained within highly selected commercially utilized genetic lines.
Collectively, these data represent the most exhaustive survey of genetic diversity within the Mx gene of commercial layer-type chickens. In addition to the identification of novel SNP, this data reports the association of Mx SNP with both disease resistance and performance traits, and highlights the need for a better understanding of the haplotypes formed by all of the SNP. Mx is a large, complex protein with multiple functional domains. Each domain plays a role in the Mx oligomer and its ultimate function in the host. Understanding how these various SNP and haplotypes interact with each other to function properly in the cell will be key to our understanding the role Mx plays in the interferon-mediated response to viral infections in chickens.
Author Contributions
Conceived and designed the experiments: JF JA EB CA PS NO MK. Performed the experiments: JF JA EB RA AL CA PS MK. Analyzed the data: JF JA CA PS NO MK. Contributed reagents/materials/analysis tools: JF JA CA PS NO MK. Wrote the paper: JF JA EB CA PS NO MK.
References
1. Horisberger MA, Hochkeppel HK (1985) An interferon-induced mouse protein involved in the mechanism of resistance to influenza viruses. Its purification to homogeneity and characterization by polyclonal antibodies. J Biol Chem 260: 1730–1733.
2. Watanabe T (2007) Polymorphisms of the chicken antiviral MX gene. Cytogenet Genome Res 117: 370–375.
3. Bernasconi D, Schultz U, Staeheli P (1995) The interferon-induced Mx protein of chickens lacks antiviral activity. J Interferon Cytokine Res 15: 47–53.
4. Bazzigher L, Schwarz A, Staeheli P (1993) No enhanced influenza virus resistance of murine and avian cells expressing cloned duck Mx protein. Virology 195: 100–112.
5. Ko JH, Jin HK, Asano A, Takada A, Ninomiya A, et al. (2002) Polymorphisms and the differential antiviral activity of the chicken Mx gene. Genome Res 12: 595–601.
6. Ko JH, Takada A, Mitsuhashi T, Agui T, Watanabe T (2004) Native antiviral specificity of chicken Mx protein depends on amino acid variation at position 631. Anim Genet 35: 119–122.
7. Livant EJ, Avendano S, McLeod S, Ye X, Lamont SJ, et al. (2007) MX1 exon 13 polymorphisms in broiler breeder chickens and associations with commercial traits. Anim Genet 38: 177–179.
8. Balkissoon D, Staines K, McCauley J, Wood J, Young J, et al. (2007) Low frequency of the Mx allele for viral resistance predates recent intensive selection in domestic chickens. Immunogenetics 59: 687–691.
9. Sartika T, Sulandari S, Zein MS (2011) Selection of Mx gene genotype as genetic marker for Avian Influenza resistance in Indonesian native chicken. BMC Proc 5 Suppl 4: S37.
10. Wang Y, Brahmakshatriya V, Lupiani B, Reddy S, Okimoto R, et al. (2012) Associations of chicken Mx1 polymorphism with antiviral responses in avian influenza virus infected embryos and broilers. Poult Sci 91: 3019–3024.
11. Ewald SJ, Kapczynski DR, Livant EJ, Suarez DL, Ralph J, et al. (2011) Association of Mx1 Asn631 variant alleles with reductions in morbidity, early mortality, viral shedding, and cytokine responses in chickens infected with a highly pathogenic avian influenza virus. Immunogenetics 63: 363–375.
12. Benfield CT, Lyall JW, Tiley LS (2010) The cytoplasmic location of chicken mx is not the determining factor for its lack of antiviral activity. PLoS ONE 5: e12151.
13. Benfield CT, Lyall JW, Kochs G, Tiley LS (2008) Asparagine 631 variants of the chicken Mx protein do not inhibit influenza virus replication in primary chicken embryo fibroblasts or in vitro surrogate assays. J Virol 82: 7533–7539.
14. Schusser B, Reuter A, von der Malsburg A, Penski N, Weigend S, et al. (2011) Mx is dispensable for interferon-mediated resistance of chicken cells against influenza A virus. J Virol 85: 8307–8315.
15. Sironi L, Williams JL, Moreno-Martin AM, Ramelli P, Stella A, et al. (2008) Susceptibility of different chicken lines to H7N1 highly pathogenic avian influenza virus and the role of Mx gene polymorphism coding amino acid position 631. Virology.
16. Pitossi F, Blank A, Schroder A, Schwarz A, Hussi P, et al. (1993) A functional GTP-binding motif is necessary for antiviral activity of Mx proteins. J Virol 67: 6726–6732.
17. Kranis A, Gheyas AA, Boschiero C, Turner F, Yu L, et al. (2013) Development of a high density 600K SNP genotyping array for chicken. BMC Genomics 14: 59.
18. Thorvaldsdottir H, Robinson JT, Mesirov JP (2013) Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration. Brief Bioinform 14: 178–192.
19. Semagn K, Babu R, Hearne S, Olsen M (2014) Single nucleotide polymorphism genotyping using Kompetitive Allele Specific PCR (KASP): overview of the technology and its application in crop improvement. Molecular Breeding 33: 1– 14.
20. Fulton JE, Arango J, Arthur JA, Settar P, Kreager KS, et al. (2013) Improving the outcome of a Marek’s disease challenge in multiple lines of egg type chickens. Avian Dis 57: 519–522.
21. Wolc A, Arango J, Settar P, O’Sullivan NP, Dekkers JC (2011) Evaluation of egg production in layers using random regression models. Poult Sci 90: 30–34.
22. Delport W, Poon AF, Frost SD, Kosakovsky Pond SL (2010) Datamonkey 2010: a suite of phylogenetic analysis tools for evolutionary biology. Bioinformatics 26: 2455–2457.
23. Gao S, von der Malsburg A, Dick A, Faelber K, Schroder GF, et al. (2011) Structure of myxovirus resistance protein a reveals intra- and intermolecular domain interactions required for the antiviral function. Immunity 35: 514–525.
24. Schumacher B, Bernasconi D, Schultz U, Staeheli P (1994) The chicken Mx promoter contains an ISRE motif and confers interferon inducibility to a reporter gene in chick and monkey cells. Virology 203: 144–148.
25. Yin CG, Zhang CS, Zhang AM, Qin HW, Wang XQ, et al. (2010) Expression analyses and antiviral properties of the Beijing-You and White Leghorn myxovirus resistance gene with different amino acids at position 631. Poult Sci 89: 2259–2264.
26. Kasai Y, Chen H, Flint SJ (1992) Anatomy of an unusual RNA polymerase II promoter containing a downstream TATA element. Mol Cell Biol 12: 2884– 2897.
27. Kallberg M, Wang H, Wang S, Peng J, Wang Z, et al. (2012) Template-based protein structure modeling using the RaptorX web server. Nat Protoc 7: 1511– 1522.
28. Berlin S, Qu L, Li X, Yang N, Ellegren H (2008) Positive diversifying selection in avian Mx genes. Immunogenetics 60: 689–697.
29. Gao S, von der Malsburg A, Paeschke S, Behlke J, Haller O, et al. (2010) Structural basis of oligomerization in the stalk region of dynamin-like MxA. Nature 465: 502–506.
30. Mitchell PS, Patzina C, Emerman M, Haller O, Malik HS, et al. (2012) Evolution-guided identification of antiviral specificity determinants in the broadly acting interferon-induced innate immunity factor MxA. Cell Host Microbe 12: 598–604.
31. Li XY, Qu LJ, Yao JF, Yang N (2006) Skewed allele frequencies of an Mx gene mutation with potential resistance to avian influenza virus in different chicken populations. Poult Sci 85: 1327–1329.
32. Seyama T, Ko JH, Ohe M, Sasaoka N, Okada A, et al. (2006) Population research of genetic polymorphism at amino acid position 631 in chicken Mx protein with differential antiviral activity. Biochem Genet 44: 437–448.
33. Clark AG (2004) The role of haplotypes in candidate gene studies. Genet Epidemiol 27: 321–333.
34. Fullerton SM, Clark AG, Weiss KM, Nickerson DA, Taylor SL, et al. (2000) Apolipoprotein E variation at the sequence haplotype level: implications for the origin and maintenance of a major human polymorphism. Am J Hum Genet 67: 881–900.
35. Leabman MK, Huang CC, DeYoung J, Carlson EJ, Taylor TR, et al. (2003) Natural variation in human membrane transporter genes reveals evolutionary and functional constraints. Proc Natl Acad Sci U S A 100: 5896–5901.
36. Li XY, Qu LJ, Hou ZC, Yao JF, Xu GY, et al. (2007) Genomic structure and diversity of the chicken Mx gene. Poult Sci 86: 786–789.