Whiteflies are well known as severe agricultural pests, devastating all sorts of green ornamental and vegetable plants around the world. Whitefly species such as the sweet potato Bemisia tabaci Gennadius pierce and suck the sap, causing direct and indirect damages to various plant species. Bemisia damages the plants at all various moments of their life cycle. Females lay eggs on the inner surface of young leaves. Eggs will give birth to nymphs and adults that feed on the original leaf, releasing a sweet liquid (honeydew), which makes both the leaf and the fruit very sticky. Then a black fungus develops (fumago) and ultimately alters leaf photosynthesis and fruit cosmetic value. Among more than twenty bioforms, the biotype B of B. tabaci is known to cause squash silverleaf; this is how it received its most common name, “silverleaf whitefly” . B expansion now worryingly shifts to another Bemisia biotype (Q), which turns out to become even more and extremely invasive throughout the whole world .
Hence, multiple studies have been initiated in Bemisia to understand better the biology of the different biotypes. The B and Q-biotypes of B. tabaci have the same development pattern, but are rather different in many other aspects including bacterial endosymbiont composition, dispersal behavior, fecundity, insecticide resistance and plant-host preference, among others . They are morphologically indistinguishable share all common characteristics of the developmental processes from eggs to adults through four nymphal instar stages that are usually parasitized by wasps [4–5]. The mechanisms of defense in Q whiteflies have received immediate attention particularly since the biotype is known to develop a strong resistance to a high and large variety of insecticides, including mainly the neonicotinoids [6–10]. Still, we know so far very little about complete molecular mechanisms involved in this process.
Meanwhile, a large protein family called Chemo-Sensory Proteins (CSPs) has been described in various physiological systems of insects including Hemiptera [11–16]. CSPs have been originally described in regenerating legs of the cockroach Periplaneta americana and mature olfactory organs of Drosophila melanogaster, suggesting a function in development and olfaction depending on species [17–20]. CSPs have been proposed to play a role in olfaction as a sort of odor binding protein by delivering hydrophobic sensory molecules to sensory neurons in locusts and ants [21–22]. However, a role in relation with development has been seriously brought up in bees following the result of CSP knock out experiments. In Apis mellifera, the lack of functional CSP has been shown to cause abnormal head development . This is in agreement with the occurrence of CSPs during various stages of the insect development as found in moths [24–25]. This is also in agreement with the very broad tissue expression profile characteristic of this very peculiar protein family [26–32].
In addition to olfaction and development, numerous other physiological functions have been suggested by various studies of the CSP family. The fact that a CSP protein (Mp10) is produced by the saliva of the green peach aphid Myzus persicae and triggers plant physiological defenses has suggested a role for CSP as effector protein rather than in the transportation of hydrophobic odorant chemical molecules . Other similarly fancy experiments in moths have suggested a role of CSP as wetting agent to reduce the surface tension of aqueous sugar solutions and thereby the pressure involved in sucking nectar . In contrast, the observation that expression of two CSP genes is up regulated during the response to bacterial infection in D. melanogaster has suggested some immunological function for this protein family . In agreement with this observation, we have shown in a totally independent study that fourteen CSP genes in the silkworm moth Bombyx mori are significantly up regulated in various nonchemosensory tissues in response to insecticide avermectins . These two studies together from Bombyx to Drosophila are in strong agreement with a role of CSPs in xenobiotic degradation and insect defense [35, 36].
In whiteflies, we previously reported a typical behavior of B. tabaci chemosensory protein-1 (BtabCSP1) in response to high dose of neonicotinoid Thiamethoxam . We also reported genetic variation between B and Q biotypes for BtabCSP2, suggesting the occurrence of biotype-specific CSP mechanisms within aleyrods . These two studies in Bemisia opened the question of what is the function of biotype-specific CSP in insecticide resistance, whether all CSPs from aleyrods have a role in relation with insecticide chemicals, what is the mode of action for those who have the ability to counteract insecticide, and what is the mode of action of others. Here, we described genes and RNAs encoding three different chemosensory proteins in the B and Q biotypes of the aleyrodid whitefly B. tabaci (BtabCSP1, BtabCSP2 and BtabCSP3). We linked these three Bemisia CSPs with other functionally characterized CSPs from other insects and tested their behavior during the insect development and in response to different time-doses of insecticide thiamethoxam. We found in B and Q that CSP1 and CSP2 mainly expressed in adults, while CSP3 expressed in third-instar nymphs. We then observed an extreme fluctuation of the BtabCSP1 gene expression over time points after insecticide treatments specifically in biotype Q, while CSP2 and CSP3 expression levels did not change very significantly. Curiously, however, BtabCSP2 expression fluctuated over time when insects were exposed to the plant leaf; it switched on in B when it switched off in Q. We thus tried to link the Bemisia CSPs with the expression of possibly related genes. Finally, we attempted to find interaction of CSP1, CSP2 and CSP3 with a true volatile or non-volatile cognate chemical ligand.
BtabCSP1, BtabCSP2 and BtabCSP3 characterize Bemisia tabaci Q and B biotypes
Analyzing the EST database from the whitefly B.tabaci , three different sequences significantly related to CSPs were identified and called BtabCSP1, BtabCSP2 and BtabCSP3, respectively (Table 1). BtabCSP1, BtabCSP2 and BtabCSP3 are proteins of 107–110 amino acids with an isoelectric point of 6.61–8.84. Totally, thirty-four amino acids are strictly conserved between the three CSPs including the four cysteines characteristic of the CSP family. The middle part of the BtabCSP proteins (amino acid motif 40–70) is particularly well conserved. BtabCSP1, BtabCSP2 and BtabCSP3 rather differ in the N and C-terminal sequences (Fig 1).
The hydropathy plots of the three proteins are also very similar, showing high levels of hydrophobicity. However, the relative flexibility plots are rather different. Two flexible domains are predicted to occur in the protein BtabCSP1. The flexible domain in BtabCSP2 and BtabCSP3 is predicted to span over different regions (S1A Fig). The protein structure homology models built using MbraCSPA6 as template are very similar, except for C-terminal α-helix (S1B Fig). This was seen as a first indication that BtabCSP1 have different functions than CSP2 and CSP3.
We previously identified numerous substitution sites on the RNA encoding CSP1 in the B and Q biotypes of B. tabaci . In this study, we found a high degree of RNA variance for CSP2, but not for CSP3 (S2 Fig). Some of these substitution sites in CSP2-RNA sequences (KM078671-KM078679) led to crucial changes in amino acid sequences, particularly in the motif between N-terminal α1 and α2-helices (Figs 1 and S1 and S1 and S2 Tables). However, CSP2 clones mostly differed by the C-terminal tail. In clones from B and Q (I74 and I80), an early stop codon led to the truncation of the three last residues ASA (S2 Fig and S1 and S2 Tables). Comparatively, only a few nucleotide replacements occurred on the RNA encoding CSP3 (KM078694-KM078697) although we cloned a mutation (I62-BtabCSP3) in Q that repeated three times the motif TKES in the N-terminal region (S2 Fig and S2 Table).
We thus aimed to identify the three CSP gene structures in selective PCR experiments. Using specific CSP1, CSP2 and CSP3 primers generated genomic DNA (gDNA) fragments of about 1800, 2000 and 5000 bps, respectively. Two gDNA fragments were amplified in Q for CSP2 gene (Fig 2A). Two 2 000 bps-gDNA bands for CSP2 were also detected in Q samples by Southern blot and DIG-labeling of PCR products. Gene diversity was observed also for CSP1. CSP1 probe hybridized to three main gDNA bands (Fig 2B). Using gDNA from ten individuals amplified repeatedly two CSP1 PCR bands for Q, suggesting that the number of CSP genes is different between Q and B biotypes of the sweet potato whitefly B. tabaci (S3 Fig). In contrast, PCR and blot using CSP3 probe identified only one major gDNA band of about 5 Kbs (Fig 2A and 2B).
Despite differences in the number of gene copies, sequencing gDNA encoding for CSP1, CSP2 and CSP3 in B and Q showed that the exon-intron structures were strictly conserved in the two biotypes. CSP1, CSP2 and CSP3 are single-intron genes of about 1500, 1880 and 3360 bps length, respectively (KM078680-KM078693). Intron differs in size, but it is always located at the same position (after the first base of codon for amino acid residue at position 45).
Interestingly, we found widespread biotype B and Q sequence differences not only in gDNA encoding CSP1 and CSP2, but also in gDNA encoding CSP3 in the aleyrodid sweet potato whitefly B. tabaci (Fig 2C). The comparison between B and Q revealed a high number of sites where the gDNA sequence encoding for CSP1 from B differs from the corresponding gDNA sequence of the Q-biotype. The same observation was made comparing between B and Q the gDNA sequences encoding for CSP2 and CSP3, respectively (S4 Fig). Q even showed a very peculiar sequence (Q260) in the mid-region of the CSP2 gene , demonstrating the occurrence of biotype-specific landmarks in B. tabaci CSPs.
BtabCSP1, BtabCSP2 and BtabCSP3 represent three different orthology groups of CSPs
Analyzing amino acid sequences, a particularly high degree of similarity was found not only between whitefly and aphid CSPs, but also between CSPs from Hemiptera and various counterparts mainly from coleopteran, dipteran, lepidopteran and phthirapteran species (S5 Fig). Bemisia CSP1 was mainly related to aphid, fly and mosquito CSPs (about 50% identity, 70% similarity). Bemisia CSP2 found orthologs not only in aphids and flies, but also in beetles, mealy bugs and moths. Bemisia CSP3 was similar (about 70%) to CSPs from Aedes, Culex, Ceratitis, Delia, Drosophila, Musca, Stomoxys and many bark beetle, flour beetle, Pediculus and true bug species (S5 Fig).
A phylogenetic analysis (RAxMLGUI) based on amino acid sequences of BtabCSP1, BtabCSP2, BtabCSP3 and CSPs from other insect species confirmed a clear relationship between BtabCSPs and CSPs from hemipteran and dipteran species (Figs 3 and S5). Using crustacean CSPs as outgroups, all of the CSPs used for the analysis fell on a common “insect” branch with high bootstrap values (94%). Interestingly, we noted that BtabCSP1 fell outside the main group of CSPs similarly to developmental AmelGB10389 gene from honeybees, very distantly related to CSPs described in the olfactory system (Fig 3). In addition, on the phylogenetic amino acid tree, BtabCSP2 clearly grouped with effector protein Mp10 and developmental genes such as AgamCSP5 and MvicOSD, suggesting that these four CSPs play a similar role eventually as effector proteins in both developmental and immunological systems (Fig 3). In contrast to BtabCSP1 and BtabCSP2, BtabCSP3 fell much closer to a large group including in particular locust olfactory CSPs/OSDs, the bee pheromone olfactory ASP3c gene (CAJ01448) and two known developmentally regulated CSP genes, p10 and HvirCSP1 (Fig 3). However, BtabCSP3 did not make a clear orthology group with any of the other CSPs. It even made a singleton despite close relatedness to TcitOSD (CAJ01484), a CSP from the brown citrus aphid, Toxoptera citricida (Fig 3). This strongly suggested a mechanism very specific to whiteflies for this peculiar type of CSP.
BtabCSP1, BtabCSP2 and BtabCSP3 differentially express across various developmental stages of biotypes B and Q
We then compared the two biotypes B and Q of B. tabaci on expression of the three CSP genes identified (BtabCSP1, BtabCSP2 and BtabCSP3).
Under control conditions, CSP1 and CSP2 showed preferential expression in biotype Q, while CSP3 gene expression was preferentially expressed in the biotype B. Slightly higher expression in males was detected for CSP1 and CSP2. No significant differences in CSP3 expression were found comparing males and females (S6 Fig).
Detailed comparison of the gene expression profiles at seven developmental stages of the whitefly showed that CSP1 gene expression significantly increased by a factor of 10–30 at the adult stage in the two biotypes (Fig 4). CSP2 gene expression gradually increased starting at the third-instar stage to reach a 300-fold increase at the adult stage specifically in the biotype Q. Such an increase in CSP2 gene expression across the different developmental stages of B. tabaci was not observed in B biotype (Fig 4). In the two biotypes B and Q, CSP3 gene expression remarkably increased (by a factor of 20–40) in third-instar nymphs (Fig 4).
BtabCSP1, BtabCSP2 and BtabCSP3 differentially express in response to age and thiamethoxam insecticide exposure
Hence, CSP1, CSP2 and CSP3 genes appeared to be differentially regulated across various developmental stages, with CSP1 and CSP2 highly expressed in adults, while CSP3 highly
Fig 3. Phylogenetic analysis of amino acid sequences from B. tabaci CSP1, CSP2, CSP3 and orthologs in other insect species (RAxMLGUI). Aech: Acromyrmex echinatior, Apis: Acyrthosiphon pisum, Alin: Adelphocoris lineolatus, Aaeg: Aedes aegypti, Adar: Anopheles darlingi, Agam: Anophales gambiae, Amel: Apis mellifera, Aluc: Apolygus lucorum, Blun: Biphyllus lunatus, Bter: Bombus terrestris, Bimp: Bombus impatiens, Bmor: Bombyx mori, Cflo: Camponotus floridanus, Cson: Culicoides sonorensis, Dper/Dvir/Dyak: Drosophila persimilis/virilis/yakuba, Dpon: Dendroctonus ponderosae, Gmor: Glossina morsitans morsitans, Hsal: Harpegnatus saltator, Harm: Heliothis armigera, Lmig: Locusta migratoria, Mrot: Megachile rotundata, Mvic: Megoura viciae, Mdir: Metopolophium dirhodum, Mper: Myzus persicae, Npub: Nylanderia pubens, Nlug: Nilaparvata lugens, Nrib: Nasonovia ribis nigri, Phum: Pediculus humanus corporis, Sgre: Schistocerca gregaria, Scal: Stomoxys calcitrans, Tpit: Thaumetopoea pityocampa, Tcit: Toxoptera citricida, Tcas: Tribolium castaneum. Sequences are all published in NCBI. B. tabaci CSP1, CSP2 and CSP3 sequences are from JQ678989-ADG56568, KM078671- KM078679 and KM078694- KM078697, respectively. CSP sequences from Daphnia pulex (DpulCSP1 and DpulCSP2; ABH88167, ABH88166) are used as an outgroup. The triangles indicate the CSP proteins that have been functionally characterized. Black triangles: developmental genes, grey triangles: effector genes, white triangles: olfactory genes.
expressed in nymphs. This observation suggested different functions for Bemisia CSPs in a good agreement with such diversity in RNA variance, gene copy number and predicted structure.
We focused about a function in relation with insecticides and xenobiotics on the basis of the results from Sabatier et al. , Xuan et al.  and Liu et al. [37, 38].
Comparison of the gene expression profiles at five exposure times (1, 4, 24, 48 and 72 h) for five doses (0, 6.25, 12.5, 25 and 50 μg/ml) of insecticide thiamethoxam showed that CSP1 gene expression significantly increased in a time- and dose-dependent manner specifically in the biotype Q (Fig 5). After 1 and 4 h of insecticide treatment, no changes in CSP1 gene expression were noted for all five doses tested in the two biotypes. However, we observed in Q gradual increase in CSP1 gene expression using gradual concentrations of thiamethoxam after 24 h (Fig 5). No effects were seen in Q-whiteflies after 48 h, but stimulatory effects were observed after 72 h exposure to thiamethoxam dosed at 12.5 and 50 μg/ml, triggering an increase by a factor of about 10–25 in CSP1 gene expression (Fig 5). We did not observe such a time/dose effect of thiamethoxam on CSP1 expression in B (Fig 5). We observed no time/dose effect of thiamethoxam on CSP2 and CSP3 expression, neither in B nor Q (Fig 5).
However, we observed in the two biotypes an effect of age on CSP2 expression (100–200 folds increase), while no age-related effects were seen for CSP1 and CSP3 (Fig 5). Age-related effects on CSP2 were clearly opposite in B and Q; CSP2 up-regulated at 4 and 24 h in B, while it clearly up-regulated at 1, 48 and 72 h in Q (Fig 5), clearly illustrating biotype-specific changes with age in CSP2 expression in the sweet potato whitefly B. tabaci.
In order to understand the reason behind the modulation of CSP1, CSP2 and CSP3, we subsequently analyzed gene expression for some possibly related genes such as cytochrome P450 oxidase CYP6CM1 (GQ214539), CYP6CX1 (GQ292715), ecdysone receptor EcR (EF174329), trehalase (JX024261) and juvenile hormone binding protein JHBP (see S7 Fig). We identified B. tabaci juvenile hormone binding protein (BtabJHBP) through the analysis of the whitefly EST database . The EST clone BT-TYLCV-014-1-G6-T3_G06 showed between 30 and 60% identity with members of the JHBP superfamily from various insect species including mainly Acyrthosiphon pisum (XP_001949424) and Reticulitermes flavipes (ADM18966) in a blastx analysis (not shown). We saw that BtabCSP3 and BtabJHBP had coordinate expression in nymphs (see Figs 4 and S7). In addition, we saw no correlation between BtabCSP expression and expression of trehalase and EcR, but we saw increased CYP6CM1 expression in Q and reduced CYP6CX1 expression in B in response to sublethal dose of insecticide thiamethoxam molecule as found for CSP1 (see Figs 5 and S7) .
BtabCSP1, BtabCSP2 and BtabCSP3 proteins differentially bind linoleic acid and cinnamaldehyde
To complete our systemic investigation of CSPs of whitefly, we did a full comprehensive study analyzing not just CSP1, but also functions of CSP2 and CSP3 (Figs 6 and 7 and S8–S11, Tables 2 and 3 and S3 Table).
We first used pure recombinant CSP protein samples, a selected repertoire of ligands and binding assay fluorescence spectroscopy (1-NPN) to analyze the functional properties of BtabCSP1, for which we found increased expression in Q following exposure to thiamethoxam (see Fig 5). In a preliminary binding assay, BtabCSP1 did not display certain binding ability with 25% pure thiamethoxam. Binding of thiamethoxam to CSP1 was found to be rather weak (IC50 = 326.26 μM, Ki = 285.43 μM), so there was not obvious relevance between insecticide binding to CSP protein and thiamethoxam induction (S9 Fig and Table 2).
To CSP1 protein, chemicals such as α-pentyl-cinnamaldehyde and S(-)-limonene showed a binding constant of 12.63±1.3 and 82.21±3.47 μM, respectively (Figs 6 and S9 and Table 2). The IC50 values obtained with other chemicals such as trans-2-hexen-1-ol, linalool, 1-hexanol, geraniol, Z3-hexen-1-ol, α-terpineol, nerolidol, n-hexaldehyde, n-heptane, 2-heptanone, (1S)- (-)-camphor, 6-methyl-5-hepten-2-one, β-ionone, octanoic acid, decanoic acid, cinnamic acid, lauric acid, amyl acetate, methyl salicylate, caryopyllene, myrcene, R(+)-limonene, ocimene, αterpinene and m-xylene were comprised between 14.59±1.75 and 95.08±3.72 μM (Figs 6 and S9 and Table 2). No binding or IC50>100 μM was observed for other ligands including plant odor volatiles. However, in our selected set of 56 chemicals, we observed strong binding affinity of CSP1 to a eighteen carbons long fatty acid chain having three double bonds, i.e. linoleic acid (LA-CSP1: IC50 = 6.70±0.52, Ki = 5.78±0.39 μM; Figs 6 and S9, Table 2 and S3 Table).
We then set out to know the binding ability of BtabCSP2 and BtabCSP3 to most of all chemicals, but more precisely with linoleic acid and α-pentyl-cinnamaldehyde, respectively (Figs 6 and S8–S10 and Table 2). Importantly, no or low binding affinity of CSP2 and CSP3 was found for linoleic acid chemical (LA-CSP2: IC50 >100 μM; LA-CSP3: IC50 = 48.45±3.42, Ki = 43.48 ±2.86 μM; Figs 6 and S10 and Table 2). Even more importantly, both CSP2 and CSP3 proteins displayed significantly much higher binding affinity for the common direct contact toxic chemical of plant oil, α-pentyl-cinnamaldehyde (IC50 = 6.50–11.13±0.37–1.24, Ki = 5.82–10.28 ±0.31–1.17 μM; Figs 6 and S10, Table 2 and S3 Table), showing that CSP1 and CSP2 (and CSP3) from B. tabaci whiteflies have two completely different functions, although both related to detoxification and insect defense.
Finally, we studied the molecular structure of bound CSP1, CSP2 and CSP3 from B. tabaci using docking molecular techniques (Figs 7 and S11 and Table 3). Molecular modelling using multiple alignments that contained all seven CSP templates (1K19, 1KX8, 1KX9, 1N8U, 1N8V, 2JNT, 2GVS) failed to yield molecular models with the large cavity as observed in templates 1N8U and 1N8V. Molecular docking of linoleic acid and α-pentyl-cinnamaldehyde on these models gave binding modes where the ligands bound exclusively on the surface of the proteins
Fig 7. Docking analysis for the functional structure of B. tabaci CSP1, CSP2 and CSP3. Docking results showing (A) binding of linoleic acid to BtabCSP1, (B) binding of α-pentyl-cinnamaldehyde to BtabCSP2 and (C) binding of α-pentylcinnamaldehyde to BtabCSP3. Shown on the left are the cartoon representations of the bound ligands to the CSP models and on the right the Ligplot charts showing the residues that interact with the ligands. Amino acid sequences of mature peptides (after removal of peptide signal) and α-helical profiles for each CSP are provided below each cartoon representation. Residue numberings in these profiles are those that are featured in the LigPlot charts.
with poor energies. Molecular modelling of BtabCSP1, BtabCSP2 and BtabCSP3 with the holo forms of the CSP as templates yielded models that displayed the presence of a large cavity that
could accommodate linoleic acid and α-pentyl-cinnamaldehyde with much improved binding energies (Figs 7 and S11).
The docking showed that linoleic acid could be accommodated into the large cavity of BtabCSP1 after the ligand adopted a U shape (Fig 7). Indeed the aliphatic chain is too long to be accommodated in the cavity if it were to remain straight. The ligand establishes mainly hydrophobic contacts with cavity residues. Noteworthy, a hydrogen bond between the carboxylic group of linoleic acid and the C = O of the backbone of Ala50 was observed in this preferred binding mode (Fig 7). The predicted Kd value was 14.9±8.6 μM (Table 3). We observed several possible orientations of the same U-shaped linoleic acid in this large cavity with binding energy levels that were grossly equivalent. The same kind of binding modes were observed for linoleic acid in BtabCSP2 and BtabCSP3, but the predicted Kd values showed between three fold to six fold weaker binding affinities (Table 3).
Docking results for α-pentyl-cinnamaldehyde showed that all three CSPs could bind the ligand in the large cavity. The interactions are mainly hydrophobic, but clearly involved different sets of residues as shown for BtabCSP2 and BtabCSP3 in Fig 7. BtabCSP3 was predicted to have a higher affinity to the ligand with predicted Kd of 6.45±5.7 μM, which is about two fold higher than for BtabCSP2 and BtabCSP1 (Table 3).
We identified and analyzed the gene structures of three chemosensory proteins from the whitefly Bemisia tabaci (BtabCSPs). Four to twenty-one CSPs exist in the various insect species investigated so far [39–41]. Similarly, our PCR and Southern blot analysis of genomic DNA in the sweet potato whitefly B. tabaci is indicative of a high diversity in CSPs (undiscovered homologs, recently duplicated genes and/or pseudogenes). Therefore, BtabCSP1, BtabCSP2 and BtabCSP3 most probably do not represent the full complement of the CSP gene family in Bemisia. They represent, however, homologous genes sharing 37–43% identity with each other and probably the most abundant CSP-encoding RNAs expressed in the sequence tag database from adults and nymphs in B. tabaci (see Figs 1 and 2 and S1–S4 and Table 1) [4, 37, 38, 42].
Interestingly, the analysis of genomic DNA encoding CSP1, CSP2 and CSP3 allows us to clearly distinguish between the biotypes B and Q in the Bemisia complex (see Fig 2). The Qbiotype is known to have a higher adaptive capacity and in particular a stronger insecticide resistance capacity than the B-biotype [43–44]. Correlatively, we find that BtabCSP1, BtabCSP2 and BtabCSP3 are all three single-intron genes similarly to most of all CSPs, but clearly display biotype-specific hallmarks, particularly BtabCSP2 (see Figs 2 and S2 and S3) . This suggests perhaps a link between the genetic repertoire of CSPs and the insecticide resistance capacities of the insect. Q-whiteflies retain a very specific genetic sequence (Q260) originating from bacteria that may be crucial for this process . Furthermore, Q apparently contains more CSP genes than B (see Figs 2 and S3).
Also interestingly, we show a lot of biotype-specific variants of CSP1 and CSP2 transcripts (see S2 Fig and S1 and S2 Tables) . This probably represents variation at the RNA level. A recent study from Xuan et al. in the silkworm moth B. mori have clearly demonstrated that variants of CSP transcripts do not represent variation at the genomic level, but rather tissue specific RNA mutations resulting in the synthesis of multiple abundant CSP protein isoforms mainly differing by the terminal tails [45–46]. In addition, in the sweet potato whitefly B. tabaci, we show a high number of base substitution sites on the genomic DNA comparing CSP1, CSP2 and CSP3 between the two biotypes. We also show a different PCR profile analyzing CSP-encoding gDNA in Q and B individuals (see S3 Fig). Finally, we show that intron has been inserted or deleted in specific CSP genes during the evolutionary process of the two specific Bemisia biotypes (see S4 Fig). Confirming the results from Liu et al. , all these observations strongly indicate that B and Q B. tabaci biotypes are not truly unique species, subspecies or twin species but rather represent two very distinct species, each governed by very specific physiological systems [47–48].
Curiously, we find a lower degree of RNA and gDNA mutations for whitefly CSP3 compared to CSP1 and CSP2 (see S2–S4 Figs and S1 and S2 Tables). This is in agreement with the hypothesis that the degree of mutation depends on the function the protein has in a specific tissue of the insect . A not striking but significant amino acid sequence similarity is found between these whitefly CSPs and some other CSPs from many various insect species (see Figs 3 and S5). A phylogenetic analysis based on amino acid sequences and implemented in RAxMLGUI shows insertion of BtabCSP1, BtabCSP2 and BtabCSP3 in a very large group of CSPs referring to many various functions. A BtabCSP1 ortholog in Hymenoptera (ASP3c, CAJ01448) found to be abundantly expressed in the antennae of adult honeybees and wasps is also found in high amounts in the hemolymph of drones [37, 50–52]. Another ortholog in bees (AmelGB10389) is involved in tegument development . Orthologs in locusts are not only up-regulated during phase transition, but also highly expressed in antennae and legs at the adult stage [12, 53]. Other orthologs in flies are up-regulated in olfactory organs in adult alate swarmers [54–55], similarly to some orthologs from the alfalfa plant bug Adelphocoris lineolatus (AlinCSP2) . Still, the same such CSP orthologs (pherokines) are found in hemolymph of virus-infected flies , suggesting a very general function for all of these CSPs. However, in our exhaustive study of two whitefly B. tabaci biotypes, we show at first glance that CSP1 and CSP2 preferentially express in Q, while CSP3 preferentially expresses in B (see S6 Fig), suggesting that at least these three CSPs in Bemisia might possibly be involved in some very different biotype-specific functions.
Interestingly, in our analysis of CSP functions in B. tabaci, we show that CSP1 and CSP2 are specific to the adult stage, suggesting a function in reproduction and/or plant odor detection at least for these two CSPs. In contrast, the extremely high levels of CSP3 expression at the third instar stage rather suggests a function in the preparation into adult stages, in particular in the development of sensory organs such as the antennae, head, eyes and various legs (see Fig 4). Many CSPs in other insect species have been shown to be associated with developing sensory organs [17, 23–25]. In addition, specifically in B. tabaci, a CSP from nymphs such as CSP3 could also play a function in wax lipid secretion and/or insect defense against parasitoids [57– 58]. A function of CSPs in insect defense is strongly suggested by the analysis of genes abundantly expressed in the whitefly B. tabaci EST database from infected tissues . No CSP-ESTs are found in the cDNA library of salivary glands in agreement with transcriptomic studies . CSP3-ESTs are found in cDNAs from the instar library, but CSP1 and CSP2-ESTs are specifically found in the libraries of mature adults fed on tomato plants infected by the geminiviruses TYLCV (tomato yellow leaf curl virus) and TOMOV (tomato mottle virus) (see Table 1).
Hence, we set out to correlate the function of Bemisia CSPs to insect defense and in particular to whitefly resistance to neonicotinoid insecticide thiamethoxam. We measured transcript levels of CSP1, CSP2 and CSP3 genes with the treatments of five time points and five doses as well as the expression of genes involved in insecticide resistance . The lifetime of adult whiteflies is of one month in the laboratory conditions. We have only used a population of young adults for our experiments, rejecting the possibility of biological mortal ratio of insects. There are probably variations due to various inter-individual physiological conditions, so we calibrated quantitative measurements of gene expression to a negative control (0 μg/ml) for each time point and for each CSP. The results confirmed previous data from Liu et al. .
There might be that CSP transcripts are regulated by many internal and external factors such as age, biotype, temperature and humidity, background odor, plant chemicals, light, circadian rhythm and season rhythm, etc., but for two time points, we show a clear dose effect of thiamethoxam on CSP1, particularly in biotype Q. CSP1 expression up-regulated 10–20 fold following exposure to sublethal doses of thiamethoxam at 24 and 72 h, but not at 48 h (see Fig 4). Perhaps it takes 48 h after involving CSP1 for full-system recovery to occur; perhaps it takes different durations of exposition to insecticide to recruit multiple resistance genes in a variety of defense mechanisms [61–62]. Time and dose dependent induction of detoxifying genes is usually observed at 24 h . Hundreds genes are differentially expressed at 48 and 72 h post-infected Bemisia in response to fungal infection . Similarly, we show only late responses to exposition of thiamethoxam molecule for CSP1 (see Fig 4). This delay could be due to the time needed for the thiamethoxam molecule to penetrate through the cuticle of adult whiteflies. However, it could also simply be that CSP1 is a late-response gene enrolled during prolonged exposure to xenobiotics. Sublethal doses of insecticide (6.25–50 μg/ml) are known to have variously delayed effects on key biological traits of the insect such as feeding behavior, honeydew excretion and fecundity [65–66]. Therefore, different CSPs could well be involved at different exposure time to protect various tissue-specific physiological pathways.
Nevertheless, in contrast to Q-CSP1, we note only little if any effects of thiamethoxam molecule on CSP2 and CSP3 from Bemisia (see Fig 4), suggesting that these two genes are not typically involved in the insect neonicotinoid response. BtabCSP2 and BtabCSP3 could be involved in the immunological response to another class of insecticides and/or simply have other functions than immunological defense.
It is noticeable in the sweet potato whitefly B. tabaci that relative expression levels of CSP1 and CSP3 were stable over time in series of negative controls (0 μg/ml). Intriguingly, however, we found that CSP2 was turned on in biotype B when it was turned off in biotype Q (see Fig 4). The phenomenon behind such an odd variation in CSP2 gene expression was rather mysterious. The vials were capped with an absorber cotton stopper, so there could not be an air contaminant or any signal released by B to inhibit Q. It could be instead a hormone or some toxic plant chemical that has differential effects on B and Q [67–68]. This was a very interesting point to characterize the CSP2 function by using same methods with CSP1 and CSP3.
In our study in Bemisia whiteflies we see some coordinate expression between CSPs and other gene families such as CYPs and JHBPs over development and/or insecticide exposure (see Figs 4 and 5 and S7). Coordinate expression of CSP1 and CYP6CM1 in Bemisia Q is in agreement with coordinate expression of CSPs and CYPs over insecticide exposure shown to be tissue-specific in Bombyx moths . Two CSPs, BmorCSP7 and BmorCSP10, and JH esterase had coordinate expression . Taken together, the two studies suggest some relationships between CSP, CYP and JH related genes. The interactions among the CSP, CYP and JHBP genes probably need to be cautiously addressed by recombinant proteins interaction in vitro, RNAi, in situ hybridization and/or immunohistochemistry in moths, whiteflies and other insect species. It may be that some CSPs work with CYPs to mediate xenobiotics response, while some others play a key role in JH signaling pathways.
We have checked for the mode of action of BtabCSP1, BtabCSP2 and BtabCSP3 using protein expression, purification and fluorescence spectroscopic binding assay (see Figs 6 and S8– S10, Table 2 and S3 Table). Using BtabCSP1, for which we observed significantly increased expression in response to insecticide thiamethoxam molecule in Q biotype, we find no binding for volatile odorant molecules including terpenoids (see Figs 6 and S8 and S9, Table 2 and S3 Table), contrasting results obtained using various CSPs from mosquitoes and bugs [55, 69]. Using a similar binding assay, we find that BtabCSP1 rather involves in the binding of long fatty acid chains such as C18-linoleic acid as other proteins in the odorant binding protein and Niemann-Pick type C2 protein families [70–71]. The three 3D structures and binding data available for CSPs are in agreement with binding to various hydrophobic ligands including fatty acid molecules of various sizes and shapes [72–76]. Linoleic acid (delta 9,12 double bonds) is an unsaturated long carbon chain fatty acid molecule present in most of all insects in all tissues for lipid metabolism and fatty acid biosynthesis [77–78]. In the pheromone gland tissue, it is a key derivative of the fatty acyl intermediates from biosynthetic pathways for specific sex pheromone chemicals . In addition, it is known to be involved in immunological response. Genes involved in linoleic acid metabolism are up regulated upon insecticide exposure along with CYP genes [80–81]. One function for linoleic acid is to help produce lipids and phospholipid-derived hormones that are directly involved in anti-xenobiotic response . The molecule even can retain by itself an anti-xenobiotic activity . Fatty acids such as linoleic acid molecule are therefore certainly crucial not only for various processes such as pheromone biosynthesis, hormone synthesis, neural sensory development and/or adult tissue growth, but also for immunological defense including general responses against insecticide or some other toxic xenobiotic chemicals.
Importantly, along our work on Bemisia CSP1 we show that two additional BtabCSPs do not bind fatty acid, but rather interact directly with specific toxic xenobiotic chemical. BtabCSP2 and BtabCSP3 proteins have a completely different functional task than CSP1. Using same binding assay method of CSP1, we clearly show that linoleic acid binds neither to CSP2 nor CSP3. CSP2 and CSP3 attach smaller, more volatile cyclic component structures such as α-pentyl-cinnamaldehyde with a much more higher affinity (see Figs 6, S8–S10, Table 2 and S3 Table). Interestingly, α-pentyl-cinnamaldehyde present in large amounts in plant essential oils is known not only for its pleasant cinnamon fragrance, but also for its very high insecticide activity [84–86]. This strongly suggests that, while CSP1s might be involved in the transport of fatty acids necessary to counteract neonicotinoid infection, CSP2s and CSP3s might rather be involved in the transport and direct degradation of much more volatile contact insecticide chemicals such as cinnamaldehydes and structurally related compounds.
We report molecular data in support of speciation and Q/B biotype-specific physiological pathways in the sweet potato/silver leaf whitefly B. tabaci. Our study shows that a particular CSP gene (BtabCSP1) expresses at the adult stage and responds to thiamethoxam intoxication specifically in the Q-biotype whitefly known to be highly resistant to neonicotinoid insecticides. Our study also shows that the BtabCSP1 gene encodes for a protein that binds particular long chain fatty acid, linoleic acid, among a large set of volatile and non-volatile ligands. The docking and 3D analysis of CSP1 with linoleic acid is supplied (see Figs 7 and S11 and Table 3), illustrating a profile of seven α-helices and detailed binding mechanism in the lipid solubilization process. BtabCSP1 bound to linoleic acid in a cavity formed by α3, α4 and α5 helices (see Figs 7 and S11). Residue numberings hereafter are as per the indications given in Figs 7 and S11. In the front of Cys55, linoleic acid molecule is sandwiched between Leu51-Ile 70-Phe81 and Ala66-Phe92-Lys95-Phe98 with the acid group anchored by Ala50 in the core of the protein. The extremity of the hydrophobic lipid chain is in close contact with the side chains of Phe81 and Phe98 and backbone of Lys95 (see Fig 7). This structural data is in good agreement with general distribution and behavior of most of all CSPs [20, 24–32, 36]. Particularly CSP1 expression profiling strongly supports a general function in the transport of elongated lipids and fatty acids. In Bemisia, CSP1 ubiquitously expressed in adult tissues from antennae to wing tips [20, 24–32, 36]. We note that BtabCSP1 is particularly highly expressed in the wings (see S12 Fig). In the wings, fatty acids such as linoleic acid and proteins such as CSP1 may be able to determine the wettability and the ability to fly. In most other tissues including antennae and legs, the CSP-C18 complex may be essential for immunological response under xenobiotic stress such as exposure to insecticide (see Figs 5–7). Additionally, in most of all tissues, specific CSP-xenobiotic complexes are certainly crucial for specific responses of insect defense. While results from Sabatier et al. in the Drosophila fruit fly are in favor of an important role of CSPs in toxin degradation , our results in the Bemisia whitefly are in favor of a very important role of CSPs not only in the innate immune response, but also in adaptive (or acquired) immunity to a specific infectious agent such as a highly toxic host plant oil chemical molecule.
In this study, we also supply the docking and 3D analysis of CSP2 and CSP3 bound to cinnamaldehyde (see Fig 7 and S11 and Table 3). Similarly to CSP1, CSP2 and CSP3 are made of seven α-helices. The ring of the insecticide sits in the middle of Ile14-Ile19-Tyr29-Leu49 and Leu60-Ile76-Glu94-Tyr97 in CSP2 and CSP3, respectively. The aldehyde group interacts with Ile14 in CSP2, while it rather connects with Leu56 and Trp90 in CSP3. The extremity of the short aliphatic chain curves in the two proteins, folding in a net of amino acid residues including Tyr11-Glu42-Glu45-Leu46 (CSP2) and Phe32-Phe36-Leu39-Trp90-Leu93 (CSP3). BtabCSP2 and BtabCSP3 bound to the aromatic aldehyde in the central cavity of the prism, the N- and C-terminal α-helical tails remaining free (see Figs 7 and S11) . This can be perhaps helpful to envision new strategies to interfere with fatty acid metabolism, cell signaling and/or insecticide resistance in an insect pest. Most of all BmorCSP genes are up regulated in the gut and other various tissues of the female silkworm infected by insecticide . Our study presently shows that additional CSPs in Bemisia (BtabCSP2 and BtabCSP3) are able to directly interact with a strong potent contact insecticide volatile molecule, strongly suggesting the importance of CSPs at many different levels of the insect defense mechanism. It seems that this mechanism has reached an extremely high level of sophistication in insects, particularly in sweet potato whitefly B. tabaci biotype Q.
Materials and Methods
B. tabaci B- and Q-biotypes were collected from eggplant fields in Shandong province (North/ East-Penninsula, China). Shandong Academy of Agricultural Sciences gave permission to conduct field study on site (permit number SAAS 20110720). The field study did not involve endangered or protected species, but rather a very serious pest of agricultural crops worldwide.
Insect rearing and tissue collection
Each B. tabaci biotype was identified using specific genetic markers, i.e. Cleaved Amplified Polymorphic Sequences (CAPs) of cytochrome oxidase I (COI) . The two biotypes, B and Q, were then reared separately on mature cotton plants maintained in climatic incubators (26°C temperature, 60% humidity, 16/8 h day/night). Males were distinguished from females based on size and shape of the abdominal tip. The different development stages (crawlers, 2nd to 4th nymphs, pupae, adults) were distinguished microscopically based on size, shape, position of wings and color (red colored eyes). For each developmental stage present on the same infested plant, hundreds of individuals were collected in an eppendorf tube, kept on ice for 10 min and freeze-dried in nitrogen before storage at -70°C. Tissues were collected separately from a hundred of 1-2-days-old Q-adults under microscope and immediately frozen until RNA extraction.
Molecular cloning of genomic DNA and cDNA
Antennal, abdominal, thoracic, leg, head and wing RNAs were extracted using Trizol for onestep Reverse Transcriptase PCR experiments. Whole body was used for RNA extraction in further molecular studies. Genomic DNA (gDNA) was extracted using DNeasy blood and tissue kit (Qiagen) for pools of thirty individuals in each biotype. Total RNA was extracted from the whole body of pooled individuals using the TrizolTM method (Life Technologies) and treated with DNase I (Fermentas). Complementary DNA (cDNA) was synthesized using RevertAidTM First Strand cDNA Synthesis Kit (Fermentas). Primers used to clone cDNA and gDNA of BtabCSP1, BtabCSP2 and BtabCSP3 were as follows:
BtabCSP1s: 5’-atgcaggttttgactttagtt-3’ (MQVLTLV),
BtabCSP1as: 5’-ttacagggtttgtccgaag-3’ (FGQTL),
BtabCSP2s: 5’-atgttcaaagttctcgtggt-3’ (MFKVLVV),
BtabCSP2as: 5’-ttaggcggatgccttgag-3’ (LKASA),
BtabCSP3s: 5’-atgaacaagattgtattggc-3 (MNKIVLA),
BtabSCP3as: 5’-ctagactttgatacccaatt-3’ (KLGIKV).
Additional primers tuned to specific intron fragments were used to sequence full-length gDNA clones:
PCR was done in a TaKaRa PCR Thermal Cycler Dice, programmed as follows: 94°C for 5 min; 35 cycles of 94°C for 30 s, 52°C for 30 s and 72°C for 1 min; final extension at 72°C for 7 min. PCR products were separated by electrophoresis in a 1% agarose gel stained with ethidium bromide. After staining, cDNA and gDNA PCR products of interest were purified using EasyPure Quick Gel Extraction Kit (TransGen Biotech) and cloned into pEasy T1 vector (TransGen Biotech). For each biotype-specific cDNA or gDNA PCR product, ten clones were subjected to sequencing on an ABI3700 sequencer instrument using the RR Dye Deoxy terminator cycle sequencing kit (Perkin Elmer) and Sp6/T7 sequencing primers.
PCR products were amplified from specific BtabCSP1, BtabCSP2 and BtabCSP3 plasmids as templates, purified and labeled with Digoxigenin-11-dUTP using DIG-High Prime (DIG High Prime DNA Labeling and Detection Starter Kit I; Roche). Genomic DNA sequences corresponding to BtabCSP1, BtabCSP2 and BtabCSP3 were amplifed in biotypes B and Q using HiFi Taq (TransGen Biotech). PCR products and labeled λ-EcoR T14 digest DNA markers were separated using 0.8% agarose gel electrophoresis at low voltage for 10 h. After denaturation and neutralization, PCR products and markers were transferred from gel to nylon membrane by capillary transfer using 20× SSC for 20 h. Following transfer, nylon membrane was backed in a vacuum oven maintained at 80°C for 2 h. Membrane DNA was then prehybridized in DIG Easy Hyb buffer at 42°C for 2 h and hybridized with specific DIG BtabCSP probe in DIG Easy Hybridization buffer (12 rpm, 39°C, overnight). After hybridization, membrane was incubated in 2× SSC/0.1%SDS (2 × 5min) and stringency wash solution (0.5× SSC/0.1% SDS) at 65°C for 2 × 15min on a shaker agitator. Immunological detection was done according to DIG High Prime DNA Labeling and Detection Starter Kit I (Roche).
The 9110 Sequences from the EST whitefly database were downloaded from GenBank (www. ncbi.nlm.nih.gov) and analyzed using the blastx algorithm of the NCBI database. In this sequence similarity study, eighteen clones showed specific hits to OS-D superfamily (Table 1). Sequences orthologous to BtabCSP1, BtabCSP2 and BtabCSP3 in other insect species were searched by screening the NCBI database using the BLASTP tool (http://blast.ncbi.nlm.nih. gov/). Only the sequences having total score above 120, query coverage above 90%, e-value below 1e-33 and a maximum identity superior to 50% were selected for phylogenetic analysis.
For phylogenetic analysis, BtabCSP and orthologous sequences were aligned in MUSCLE Version 3.8.31. The resulting PHYLIP file (SEAVIEW; http://pbil.univ-lyon1.fr/software/ seaview3) was used as template for further processing in RAxMLGUI . In RAxMLGUI, the phylogenetic tree was built using parsimony with the inclusion of maximum likelihood (PROTGAMMA model, BLOSUM62, 100 boostraps). The two CSP sequences (ABH88167, ABH88166) from the water flea Daphnia pulex (Arthropoda, Crustacea, Branchiopoda, Cladocera, Daphniidae) were used as outgroup.
Quantitative real-time PCR (qRT-PCR)
One-step Reverse transcriptase PCR (TAKARA Bio. Inc) was used to check for tissue-distribution of CSP1 following the method described in Liu et al. . qRT-PCR was used to measure the changes of CSP1, CSP2 and CSP3 gene expression during development and over insecticide exposure. In a typical qRT-PCR experiment, cDNA templates (20 ng/μl) were used in a total volume of 20 μl containing SYBR Green I PCR Master Mix (Roche) and 200 nM of each primer, using 96-well optical-grade PCR plates and matched optical-grade membrane of StepOne Plus ABI system. qPCR primers were specifically designed to generate an amplicon of about 100–150 bps:
qPCR-BtabCSP1s: 5’-gacaactacaccaagtgcc-3’ (DNYTKC), qPCR-BtabCSP1as: 5’-gtagaacttgatgaccttacg-3’ (RKVIKFY), qPCR-BtabCSP2s: 5’-caacgtcgacgatattttgg-3’ (NVDDIL), qPCR-BtabCSP2as: 5’-cttttctgtgcatttggagc-3’ (CSKCTE), qPCR-BtabCSP3s: 5’-tgatggatgaacacacatgc-3’ (LMDEHTC), qPCR-BtabCSP3as: 5’-gaatgaccttttcggatcca-3’ (SGSEKVIR).
We used actin (housekeeping gene) as an internal invariant endogenous control for qRT-PCR (BtabActin-sense: 5’-tcttccagccatccttcttg-3’, BtabActin-antisense: 5’-cggtgatttccttctgcatt-3’; BT-H-019-1-C5-T3_C05). We also compared in each body sample the expression values of CYP6M1, CYP6X1, JHBP, EcR, USP and trehalase genes referred to BtabActin as control (= 1). Primers were as follows (GQ214539, GQ292715, G6-T3_G06, EF174329, EF174330, JX024261):
qPCR-BtabCYP6CM1s: 5’-tccacgagggaattttagct-3’, qPCR-BtabCYP6CM1as: 5’-gcagcgtctcatcaataactt-3’, qPCR-BtabCYP6CX1V1s: 5’-gtgccctacatctcgcctatca-3’, pPCR-BtabCYP6CX1V1as: 5’-ccagccactctttcaccattcc-3’, pPCR-BtabEcRs: 5’-actcaacgacgagttgttctcc-3’, pPCR-BtabEcRas: 5’-gctacattttcttcttctggtgc-3’, pPCR-BtabTres: 5’-caacaatgggacgcacaaa-3’, pPCR-BtabTreas: 5’-ctcacctccgcctccaatag-3’, pPCR-BtabJHBPs: 5’-aacggaaaccaaagactgag-3’, pPCR-BtabJHBPas: 5’-ggaacacttggttgagtaca-3’.
qRT-PCR products were sequenced to attest primer specificity. Each sample in B and Q was analyzed in triplicates. Three biological samples were used for each time point at each developmental stage and in the experiments on the effects of insecticide on the expression of CSP genes (see below). Reactions were run in a StepOne Plus system with initial denaturing step consisting of 2 min at 50°C and 10 min at 95°C, followed by 40 cycles of 15 s at 95°C-30 s at 60°C and terminated by a dissociation curve program from 60 to 95°C with a heating rate of 0.1°C and a continuous fluorescence acquisition. Expression levels were determined using the relative 2(-Delta Delta C(T)) method . Data were statistically analyzed by one-way Anova in SPSS.
Thiamethoxam is one major neonicotinoid insecticide, known to act on the nicotinic acetylcholine receptors (nAChR) within the insect central nervous system. To expose B. tabaci to the neonicotinoid insecticide thiamethoxam (25% WG, Syngenta, China), we chose the cotton leaf-dip method [37, 90]. One week<-old adult forms from the laboratory colonies of B- and Q-biotypes of B. tabaci were collected separately and exposed in the laboratory conditions (26°C temperature, 60% humidity, 16/8 h day/night) to 0, 6.25, 12.5, 25 and 50 μg/ml of thiamethoxam, respectively. Significant mortality rate was observed only for higher doses (>100 μg/ml) at different exposure time. A 100 μg/ml dose of insecticide killed most of the whiteflies after 72 h exposure.
The experiment started (time 0) when insecticide treated and non-treated plant discs (2.5 cm diameter) were individually placed on agar base in 20-ml glass vials.Twenty-eight whiteflies belonging either to B or Q-biotype were aspirated into each vial. Vials were capped with an absorbent cotton stopper and inverted. Insecticide-treated whiteflies fed in treated vials on cotton plant leaf discs dipped for 10 s in formulated thiamethoxam diluted with distilled water. Whiteflies kept in vials containing cotton plant leaf discs of the same diameter dipped for 10 s in distilled water only were used as controls (0 μg/ml of thiamethoxam). After 1, 4, 24, 48 and 72 h post-exposure, all individuals in each category were collected in an eppendorf tube in treated and untreated groups of whiteflies in each of the two biotypes, kept on ice for 10 min and freeze-dried in nitrogen before storage at -70°C.
Protein expression and purification
Recombinant CSP1, CSP2 and CSP3 proteins were expressed using ChampionTM pET SUMO protein expression system (Invitrogen) according to the Invitrogen protocol. The coding region was amplified by PCR using two specific primers:
For protein expression, specific PCR products were ligated into Pet SUMO vector, before to use the construct to transform competent One Shot Mach1-T1 E.coli cells. Sequencing a dozen of clones identified constructs with correct orientation and gene sequence. The CSP1, CSP2 and CSP3 constructs were all three sub-cloned into BL21 (DE3) One Shot cells. Single colonies were grown in 10 ml LB medium containing 50 μg/ml kanamycin at 37°C with shaking (200 rpm) overnight. The next day, 50 ml LB medium containing 50 μg/ml kanamycin with 1 ml of the overnight culture was grown at 37°C with shaking (200 rpm) until the optical density reached 0.4–0.6. CSP1 and CSP3 proteins were induced with 1mM isopropyl β-D-thiogalactoside (IPTG) for 3 h at 37°C, while CSP2 protein was induced with IPTG for 3 h at 32°C. In the three experiments, the cells were harvested by centrifugation at 5000 rpm for 10 min, resuspended in lysis buffer (20 mM sodium phosphate, 500 mM NaCl, 10 mM imidazole, pH 8.0), lysed by sonication (10 s, 5 passes) and centrifuged again at 12 000 rpm for 15 min at 4°C. Proteins from the clear supernatant and the precipitated pellet were then collected separately and analysed by SDS-PAGE. Western blot was done to attest the identity of the expressed protein. The proteins separated by SDS-PAGE were transferred to nitrocellulose blotting membranes (Pall Corporation) at 1mA/cm2 for 2 h using semi-dry transfer system JY-ZY3 (Beijing JunyiDongfang Electrophoresis Equipment CO. LTD). Blocking was done in TBST (10 mM TrisHCl, 0.15 M NaCl, 0.05% Tween-20) overnight at 4°C. Heliothis CSP1b antibody was used as primary antibody at dilution of 1:1000 as described in Xuan et al. . Anti-rabbit IgG labeled with HRP was used as secondary antibody at a dilution of 1:3000. Protein was detected using the HRP-DAB chromogenic substrate detection system (Tiangen) as described by the manufacturer.
SUMO-BtabCSP1, BtabCSP2 and BtabCSP3 fusion proteins were purified through Ni-NTA column and gel-filtration on a Superdax 75 column conditioned in the AKTA Purifier 10 system (GE Healthcare Biosciences). The N-terminal peptide containing the 6xHis tag and SUMO was cleaved using SUMO protease (Invitrogen) following the manufacturer’s recommendations and the cleaved protein purified using Ni-NTA column. Resulting peak protein fractions were desalted by HiTrap desalting column (GE Healthcare Biosciences) and concentrated using 3-kDa cutoff filters (Millipore). The final concentration of pure recombinant CSP1, CSP2 and CSP3 protein samples was assessed by the Bradford method using BSA as standard.
Fluorescence binding assay
Protein-ligand binding affinity was determined for the three recombinant proteins. Emission fluorescence spectra were recorded on a Jasco FP-6500 Series fluorescence spectrofluorometer. To measure the affinity of the fluorescence ligand of 1-NPN to CSP1, CSP2 and CSP3, a 2 μM solution of protein in 50 mM Tris buffer (pH 7.4) was titrated with aliquots of 1 mM methanol solutions of ligand to a final concentration of 2–30 μM. Fluorescence was measured using an exciting wavelength of 337 nm. Emission spectrum was recorded between 350 and 500 nm. Ligand affinity was measured in competitive fluorescence binding assay using 2 μM of 1-NPN as fluorescent reporter and 2–50 μM of a competitor ligand solution. Binding constants of competitors were calculated from IC50 values using the equation: Ki = [IC50]/ (1+[1-NPN]/ K1-NPN), [1-NPN] and K1-NPN being the free concentration of 1-NPN and the dissociation constant of the complex CSP/1-NPN, respectively .
Molecular modeling of CSP1, CSP2 and CSP3
3D models for BtabCSP1, BtabCSP2 and BtabCSP3 were built using Modeller . For each of these three targets, appropriate structural templates were obtained manually from the PDB after a Blast search. Seven structural templates were found in PDB (1K19, 1KX8, 1KX9, 1N8U, 1N8V, 2JNT, 2GVS). These shared pairwise sequence identities between 37.1% and 40% with BtabCSP1, between 42% and 50% with BtabCSP2 and between 45.3% and 54.1% with BtabCSP3. Two of them (1N8U, 1N8V) were in the holo form and contained a bound ligand while the others were in the apo form and did not contain a ligand (1K19, 1KX8, 1KX9, 2JNT, 2GVS). The former differed sensibly to the apo form for they have a well-identified large cavity thus showing that CSP proteins undergo structural changes upon binding to ligands. Both apo and holo forms were used for the modelling. They were aligned with the BtabCSP targets using multiple sequence alignment in the Geneious software and the alignments were used by Modeller using the standard automodel procedure and both VTFM and MD optimization procedures for building and refining 3D models for each of the BtabCSP. In order to take into account inherent flexibility of the protein, 100 models for each BtabCSP were built. These display structural variations namely at the side chains levels hence mimicking protein flexibility. These were used in the subsequent docking detailed below.
The binding of linoleic acid and α-pentyl-cinnamaldehyde to BtabCSP1, BtabCSP2 and BtabCSP3 was further assessed using molecular docking simulations. For each CSP, docking of the ligands were performed on the 100 models using AutodockVina 4.2 with standard docking parameters, a grid that englobed all the protein and ligands that were left completely flexible . For each CSP, the most populated cluster of ligand poses across all the 100 protein models was selected as the most representative binding mode of the ligand. From these, average predicted binding energy (in kcal/mol) were calculated and corresponding average predicted Kd values (in μM) were derived where Kd = exp(ΔG/RT) with T = 300 K. In order to take into account the flexibility of the protein, this Kd valued was further weighted by the fraction free/ bound where free is the amount of models among 100 where the most populated ligand pose was not observed and bound is the amount of models among 100 where the most populated ligand pose was observed. Qualitative analysis of the residues involved in the interaction with the ligands was performed using the LigPlot+ software .
This article was originally published in PLoS ONE 11(5): e0154706. doi:10.1371/journal.pone.0154706. This is an Open Access article distributed under the terms of the Creative Commons Attribution License.