Skip to main content

ORIGINAL RESEARCH article

Front. Vet. Sci., 11 December 2024
Sec. Parasitology

Phylogeny and divergence time estimation of the subfamily Amphipsyllinae based on the Frontopsylla diqingensis mitogenome

Ju PuJu PuXiaoxia LinXiaoxia LinWenge Dong
Wenge Dong*
  • Yunnan Provincial Key Laboratory for Zoonosis Control and Prevention, Institute of Pathogens and Vectors, Dali University, Dali, China

Fleas are primarily parasites of small mammals and serve as essential vectors of the transmission of plague. The subfamily Amphipsyllinae (Siphonaptera: Leptopsyllidae) consists of 182 species across 13 genera, widely distributed worldwide. Only two species of Amphipsyllinae have been sequenced for complete mitogenomes to date. It hinders the taxonomy and evolutionary history studies of fleas. In this study, we first sequenced the Frontopsylla diqingensis mitogenome and performed comparative mitogenomic analyses with the two other species (Frontopsylla spadix and Paradoxopsyllus custodis) in Amphipsyllinae available in the NCBI database. The evolutionary process of Amphipsyllinae was comprehensively analyzed in terms of nucleotide composition, codon usage, nucleotide diversity, tRNA secondary structure, nucleotide skew, phylogeny tree, and divergence time. Nucleotide diversity and tRNAs of three species of fleas of Amphipsyllinae have differences among different species. The effective number of codon (ENC)-plot, neutrality curve, PR2, and correspondence analysis (COA) showed that the codon preference of Amphipsyllinae was influenced mainly by natural selection. For phylogenetic trees and divergence time of the order Siphonaptera, our results showed two concatenated data matrices, namely, PCG: (((Ceratophyllidae + Leptopsyllidae) + ((Vermipsyllidae + Hystrichopsyllidae) + Ctenophthalmidae)) + (Pulicidae + Pygiopsyllidae)); PCGRNA: ((((Ceratophyllidae + Leptopsyllidae) + ((Vermipsyllidae + Hystrichopsyllidae) + Ctenophthalmidae)) + Pulicidae) + Pygiopsyllidae). We concluded that P. custodis and Macrostylophora euteles from GenBank are the same species by phylogenetic trees and sequence alignment, and supported the monophyly of Amphipsyllinae. Amphipsyllinae diverged in the Cenozoic, approximately 73.37–40.32 million years ago (Mya). The majority of the species within the intraordinal divergence into extant lineages occurred after the K-Pg boundary. The common ancestor of the extant order Siphonaptera diverged during the Cretaceous. Our findings supported those of Zhu et al. (1). This study provides new insights into the evolutionary history and taxonomy of the order Siphonaptera.

1 Introduction

The order Siphonaptera, commonly known as fleas, feed on host blood and serve as vectors of plague, making them significant medical insects worldwide. Recent molecular studies have demonstrated they are a monophyletic group (1). The origin of Siphonaptera can be traced back to the Mesozoic Jurassic (2). However, Zhu et al. (1) suggested that the common ancestor of Siphonaptera began to diverge in the Cretaceous. Due to the scarcity of fossil records, the early evolution of fleas is rarely known, and the earliest fossils of fleas recorded and identified were in the Eocene and Miocene (Palaeopsylla klebsiana, Palaeopsylla dissimilis, Peusianapsylla baltica, and Peusianapsylla groehni from Baltic amber) (38). Fleas are species-rich and complex in morphology, and their origin and taxonomic status have been controversial. In the 19th century, some researchers classified fleas as beetles based on their morphological characteristics (9). In the 20th century, some researchers suggested that fleas were closely related to the Orders Mecoptera and Diptera, constituting the group Antliophora (10). In the 21st century, it was found that the orders Siphonaptera and Mecoptera were sister groups (11, 12), but some researchers supported that the order Siphonaptera and the family Nannochoristidea (Mecoptera) were sister groups (13). Wu et al. (14) presumed that some genera and species of the families Macropsyllidae, Stephanocircidae, and Pygiopsyllidae (currently a subfamily of Hystrichopsyllidae) at the base of tree parasitized the older host (Basidioidea and Monoporida), while the family Ceratophyllidae at the top of tree parasitized the younger host (Lagomorpha, Rodentia, Eulipotyphla, etc.,). Such an alignment differs from that studied by Hopkins and Rothschild (14). Currently, more than 2,500 species of Siphonaptera are regarded as valid species or subspecies—Lewis et al. (15) conjectured that the genera and species described are approximately 250–300 species. The drastic changes in the natural environment and climate in recent years have resulted in some host species being critically endangered or even extinct. Some flea species with high host specificity, such as those in Ceratophyllidae, Ischnopsyllidae, Stephanocircidae, and Pygiopsyllidae families, may also be at risk of extinction. Fleas are temporary hosts or important vectors for many zoonotic diseases, such as Yersinia pesti, Rickettsia typhi, and Bartonella henselae (1618). The evolutionary history, taxonomic status, and transmission of plague of fleas are worth studying.

The head of the subfamily Amphipsyllinae (Siphonaptera: Leptopsyllidae) is integricipit, with an antennal fossa that may be proximal to the top of the head and lacks cleft. Usually, there is no genal comb; if there is a genal comb, there is only one or two spines backward. Amphipsyllinae with 5 tribes, 13 genera, and 182 species (subspecies) distributed mainly in Palaearctic (mostly) and Neopelagic (14). However, (15, 19) classified Amphipsyllinae into 23 genera and 177 species, with the vast majority parasitizing mammals, and only a few parasitizing birds. Previously, the subfamily Leptopsyllinae, characterized by a genal comb, was often included in the family Hystrichopsyllidae, while Amphipsyllinae lacking a genal comb was included in Ceratophyllidae. It was later found that Amphipsyllinae and Leptopsyllinae were more closely related, and the two subfamilies lumped into one family (Leptopsyllidae). So far, the complete mitogenomes have been determined for only three Amphipsyllinae species. Mitogenomes with simple structure, maternal inheritance, and fast evolution were used to study the phylogeny, molecular evolution, insect taxonomy, and population genetics of insects (2023). The typical insect mitogenome contains 37 genes (13 protein-coding genes, 22 tRNA genes, and 2 rRNA genes) and one non-coding region. Thirteen of these protein-coding genes are respiratory chain components in the inner mitochondrial membrane. Transfer tRNA (tRNAs) usually possess typical cloverleaf structures, consisting of an amino acid acceptor arm, anticodon arm, dihydrouracil arm (DHU), and TψC arm. Typically, the amino acid acceptor arm is 7 bp, the DHU arm is 4 bp, TψC arm is 5 bp, and the anticodon arm is 5 bp. rRNAs (rrnL and rrnS) are primarily used in phylogenetic studies (24). Non-coding regions, also called control regions, are the regulatory regions for replication and transcription initiation of mitogenomes and have the fastest rate of evolution (25, 26).

In this study, (a) the mitogenome of Frontopsylla diqingensis Li and Hsieh, 1974, was determined and analyzed for the first time, with its morphological characteristics described in detail; (b) in combination with the mitogenome sequences of Amphipsyllinae species available in GenBank (Frontopsylla spadix Jordan and Rothschild, 1921: NC073018 and Paradoxopsyllus custodis Jordan, 1932: OQ627398), we analyzed the structural features and variation of the mitogenomes of Amphipsyllinae to further clarify the evolutionary differences among them; and (c) the phylogenetic relationships and origin of Siphonaptera were evaluated, and the divergence times among various groups of Siphonaptera were estimated.

2 Materials and methods

2.1 Specimen collection, morphological identification, DNA extraction, and mitogenome sequencing

In May 2012, Ochotona thibetana Milne and Edwards, 1871, (Lagomorpha, Ochotonidae) was captured with mouse cages in Diqing Tibetan Autonomous Prefecture, Yunnan Province, China, and taken to the laboratory for ectoparasite collection. All specimens of F. diqingensis were collected on O. thibetana and stored in an Eppendorf tube with 95% ethanol at −80°C for subsequent identification and DNA extraction. F. diqingensis and O. thibetana were deposited at Dali University. All specimens were trapped and preserved with the approval of the Animal Ethics Committee of Dali University under the approval number MECDU-201912-20.

The morphological identification is primarily based on “Fauna Sinica insecta Siphonaptera” (14). For specimens to be photographed, we first placed them in distilled water to clean them and to stretch the wrinkled fleas. The specimens were sealed and dried, then photographed and identified in detail for morphological features using an OLYMPUS (SZ2-ILST) micrographic system. Five specimens were sent to Shanghai Winnerbio Technology Co., Ltd. (Shanghai, China). The tissues were cryoprocessed, ground, and mixed with cetyltrimethylammonium bromide (CTAB). CTAB is a cationic active agent that lysed membrane proteins and lipids, as well as depolymerized nuclear proteins. CTAB-nucleic acid complexes are soluble and stable at high salt (>0.7 mM) concentrations. Still, at low salt concentrations (0.1–0.5-mM NaCl), CTAB-nucleic acid complexes precipitate due to reduced solubility, while the majority of the proteins, polysaccharides, etc. remain dissolved in the solution. The CTAB-nucleic acid complex is separated from sugars, proteins, etc., by centrifugation, and nucleic acids can be precipitated by ethanol, while CTAB dissolved in ethanol and removed. DNA concentration and purity were measured using a spectrophotometer and electrophoretically tested for completeness. Sequencing was performed on the Illumina NovoSeq 6,000 platform (Illumina, Inc.). During Illumina sequencing, a sequencing library was constructed using about 1 μg of genomic DNA. According to the manufacturing protocol, the DNA samples were sheared into 400–500-bp fragments using a Covaris M220 Focused Acoustic Shear (Covaris, Inc.). The sheared fragments were used to prepare Illumina sequencing libraries. The prepared library was then used for paired-end Illumina sequencing (2 × 150 bp) on an Illumina NovaSeq 6,000 platform.

2.2 Mitochondrial sequence assembly, annotation, and analysis of F. diqingensis

MitoZ 2.3 (27) was used to assemble the mitogenome, and BWA v0.7.17 (28) and Samtools v0.1.20 (29) were used to assess the confidence of the assembly data (sequencing depth ≥ 100×). Annotation was performed using Geneious Prime 11.0 (30). Protein-coding genes and rRNA genes were identified by BLAST2.16 (31) and MITOS Web Server (32). The tRNA gene was predicted using tRNAscan SE2.0 (33) and ARWEN (34), and then manually checked and corrected. The annotated mitogenome sequence of F. diqingensis has been deposited to GenBank (accession number: PP083946).

CodonW was used to analyze the codon preference of three species. Skew was calculated as AT skew = (A − T)/(A + T) and GC skew = (G − C)/(G + C). ClustalX (35) and MEGA (36) were used to calculate the conserved and variant sites of tRNA genes; DnaSP v5 (37) was used to analyze the nucleotide diversity, variable site, conserved site, and singleton site of 13 protein-coding genes.

2.3 Phylogenetic analysis

Boreus elegans Carpenter, 1935 (38) was used as the outgroup. Bayesian inference (BI) (39) tree and Maximum likelihood (ML) (40) tree were constructed based on two concatenated data matrices (PCG: 13 protein-coding genes; PCGRNA: 13 protein-coding genes and 2 rRNA gene sequences) of 23 species (Supplementary Table S1). Sequence alignment was initially performed using MAFFT (41) software and then optimized using MACSE v 2.06 (42) software. MACSE uses the classic “Needleman–Wunsch” algorithm to correctly identify pseudogenization events and maintain the ancestral codon structure. Since comparing PCG sequences directly using MACSE was too slow, the PCG sequences were first compared using MAFFT and then optimized using MACSE. Significant gaps and ambiguous sites were omitted using Gblocks (43). The best-fit nucleotide substitution model for the tree was determined using ModelFinder (44). The best-fit nucleotide substitution model for constructing ML and BI trees is shown in Supplementary Table S2. IQ-TREE (45) was used to construct the ML tree. The number of bootstrap repetitions in the maximum likelihood analysis was set to 1,000, and the support rate of the node branch uses the maximum likelihood bootstrap proportion (BSP). When BSP ≥ 70%, the node has a high support rate. MrBayes 3.2.6 (39) was used to construct the BI tree. The BI tree underwent 3,000,000 generations, with trees sampled every 1,000, each with four independent Markov Chain Monte Carlo (MCMC). When Bayesian posterior probabilities (BPP) ≥ 95%, the posterior probability had a higher support rate. Figtree (46) was used to visualize and edit the evolutionary tree.

2.4 Divergence time estimation

Based on the reconstructed phylogenetic tree, the divergence time of 23 species of 15 genera in 7 flea families was estimated using Bayesian Evolutionary Analysis Sampling Trees (BEAST 2.0) (47), with data from 13 PCGs. Due to limited flea fossil records and available sequence data, fossil calibrations (amber specimens, see Introduction section) were employed to confirm true Siphonaptera, which convey verifiable knowledge of the existence of specific lineages of Siphonaptera over time. (1) The genus Pulex Linnaeus, 1758. Although this flea fossil was placed into the Miocene, the age range of Dominican amber was controversial, with estimates running from 20 to 15 Mya (Iturralde–Vinent and MacPhee, 1996) to 45–30 Mya (Schlee, 1990). It was set to 35–15 Mya to reflect this uncertainty (1, 48). (2) Ctenophthalmidae: 40–35 Mya was used for calibration points, and the root node was not calibrated. First, the best model for accurate phylogenetic estimation was selected using ModeFinder (44). The most suitable replacement model, GTR, was set in BEAUti, along with the adoption of a strict molecular clock model. The tree prior model used the Yule process, and the prior distribution was set to normal to estimate the divergence time. The Markov chain (MCMC chain) ran for 40,000,000 generations, with trees sampled every 1,000 generations. The convergence of the parameter estimates was performed using Tracer. The ESS value of the parameters was >200. The top 10% of the tree was discarded as burn-in. Trees were edited and visualized using tvBOT (49).

3 Results

3.1 Morphological characteristics and mitogenome organization of F. diqingensis

From a careful examination of specimens of F. diqingensis, the following characteristics were used for identification: The sternum IX (rear arm) is not developed and narrow. It bears six bristles. There is a row of seven bristles above it, and the spacing between the two bristles is very short. The top of the immovable process is up to the midpoint of the movable process, which is higher than F. spadix Jordan and Rothschild, 1921. The immovable process is conical, and its upper anterior angle is mostly a right angle. The concave degree of the upper part of the trailing edge of the movable process is usually deeper. There are three sub-thorns in the middle and lower part of the trailing edge of the movable process. There are three spiniforms in the middle and lower part of the trailing edge of the movable protrusion. The shape of slermum VII (rear arm) differs from F. spadix. The tail boundary of spermatheca is unclear (Figure 1).

Figure 1
www.frontiersin.org

Figure 1. Morphological characteristics of (A) (♂) and (B) (♀) of Frontopsylla diqingensis from China.

The coding region of F. diqingensis mitogenome (GenBank accession number: PP083946) is 16,153 bp in size (the non-coding region is not complete, which will not be discussed in this article) (Figure 2). The nucleotide composition and skew are shown in Table 1. Among the 37 genes, 4 PCGs (nad1, nad4, nad4L, and nad5), 8 tRNA genes (trnY, trnC, trnQ, trnV, trnL1 (tag), trnP, trnH, and trnF), and 2 rRNA genes (rrnL and rrnS) were encoded on the N-chain, while the remaining 23 genes (nad2, cox1, cox2, atp8, atp6, cox3, nad3, nad6, cob, trnI, trnM, trnW, trnL2 (taa), trnK, trnD, trnG, trnA, trnR, trnN, trnS1 (tct), trnE, trnS2 (tga), and trnT) were encoded on the J-chain. The AT-skew of the F. diqingensis mitogenomes was −0.027, and the GC-skew was −0.228. There are a total of 13 intergenic spacer regions (106 bp), with the largest intergenic spacer region being 32 bp, followed by 21 bp. There are 11 overlap regions (28 bp), with the largest overlap region being 7 bp and the smallest overlap region being 1 bp.

Figure 2
www.frontiersin.org

Figure 2. Organization of the Frontopsylla diqingensis mitogenome. tRNA genes were shown with the single-letter abbreviations of their corresponding amino acids. The two leucines are denoted by L1 (tag) and L2 (taa), and the two serines are denoted by S1 (tct) and S2 (tag).

Table 1
www.frontiersin.org

Table 1. Organization of the Frontopsylla diqingensis mitogenome.

3.2 Mitogenome composition of F. diqingensis

The total length of the 13 protein-coding genes was 11,121 bp, the longest was nad5 (1,716 bp), and the shortest was atp8 (162 bp). All 13 protein-coding genes use the typical ATN as the start codon (ATG: 7, ATA: 3, ATT: 2, and ATC: 1), and the stop codons are TAA, TAG, and incomplete T (Table 1). The 13 protein-coding genes of F. diqingensis encode a total of 3,707 amino acids. The most frequently used code was UUA (RSCU = 4.46), and the lowest frequency used code was CCG (RSCU = 0.1) (Supplementary Table S4). The average length of 22 tRNAs was 65.3 ± 2.59 bp, and the size of a single gene was 62–70 bp. The secondary structure of tRNA is shown in Supplementary Figure S1. Among the 22 tRNA genes, except for trnS1 (tct) lacking D-arm, all other tRNA genes formed typical cloverleaf structures. In addition to the typical Waston-Crick pairing (A-T, G-C), a total of 23 mismatches (G-T: 9, T-G: 7, T–T: 5, and A-A: 2) occurred in the mitogenome folding process of F. diqingensis, of which trnH mismatch was the most occurred (4 times) (Supplementary Figure S1). The total length of the two rRNAs was 2093 bp, both located on the N chain. The AT content of rrnL was 81.8%, and the AT content of rrnS was 80.1%.

3.3 The mitogenomes of different species in the subfamily Amphipsyllinae

F. diqingensis, F. spadix, and P. custodis showed high AT (75.5, 78.8, and 76.8%) contents. The AT-skew of F. diqingensis was −0.027, and the GC-skew was −0.228. The AT-skew of F. spadix was −0.036 and the GC-skew was −0.214. The AT-skew of P. custodis was −0.008, and the GC-skew was −0.259. The AT content of the first, second, and third codons of protein-coding genes of three species of flea was analyzed, and it was found that the AT content in the first and second codons was much smaller than that in the third codon. It shows that the third codon of the three species in Amphipsyllinae prefers to use A and T bases (Table 2). The protein-coding genes of three species use the typical ATN as the start codon, with ATG being used most frequently (Supplementary Table S5). Except for nad2 and nad4 with incomplete T or TA as the stop codon, the remaining 11 protein-coding use the typical TAN as the stop codon, with TAA used most frequently (Supplementary Table S5).

Table 2
www.frontiersin.org

Table 2. Nucleotide composition of 3 species of the subfamily Amphipsyllinae.

The tRNA secondary structures of F. diqingensis, F. spadix, and P. custodis in Amphipsyllinae have little difference among different species, and all of them form typical cloverleaf structure (except trnS1 (tct)). The 22 tRNAs had 1,458 sites, of which 1,312 were conserved, accounting for 90.0% of the total sites. There were 146 variable sites, accounting for 10.0% of the total sites. There were 14 mutation sites of trnC and 1 mutation site of trnQ (Supplementary Table S6). The majority of these mutation sites were at the junction of arms and rings (Supplementary Figure S2).

The nucleotide diversity index (π) of 13 protein-coding genes in Amphipsyllinae was 0.3842. There were 3,991 variable sites (V), 7,147 conserved sites (C), and 3,890 singleton sites (S).

3.4 The relative synonymous codon usage (RSCU) of the subfamily Amphipsyllinae

The relative synonymous codon usage of three species of Amphipsyllinae was analyzed and found that the codon usage preference patterns of F. diqingensis, F. spadix and P. custodis were very similar (Supplementary Table S4, Supplementary Figure S3). F. diqingensis, F. spadix, and P. custodis had 26, 27, and 27 high-frequency codons (RSCU >1), respectively. It can be seen from RSCU that there is a specific preference for codon usage, but the reason for its preference is not apparent. We use parity rule 2 (PR2), effective number of codon (ENC), neutrality curve, and corresponding analysis (COA) to analyze the reasons that affect codon usage preference. PR2 showed that mutation, selection, and other factors affected codon usage through qualitative analysis and found that protein-coding genes of three species of fleas were not located at the central site (0.5, 0.5). Among them, F. diqingensis and F. spadix have a T/C preference, and P. custodis has an A/C preference (Figure 3). The ENC values of protein-coding genes of 3 species in Amphipsyllinae ranged from 30.11 to 46.54. The majority of them were below the standard curve (the atp8 gene is not included because CodonW calculates the NC value based on the principle of ENC (50) calculation, and when the gene does not contain amino acids with eight synonymous genes, the ENC value of this sequence is not calculated). Among them, 13 protein-coding genes Nc ≤ 35 have a total of 15 genes, F. diqingensis has 4 genes, F. spadix has 9 genes, and P. custodis has 2 genes (Supplementary Figure S4). In the neutrality curve, GC12 of three species has no significant linear relationship with GC3s (the coefficient of determination R2 is used mainly to quantify the strength of the linear relationship between two variables. The closer the R2 value is to 1, the stronger the linear relationship between GC12 and GC3s; the closer the R2 value is to 0, the weaker the linear relationship between GC12 and GC3s). Therefore, the absolute value of the regression coefficient cannot represent the degree of mutation pressure on protein-coding genes in the species (Figure 4). In the COA analysis, three species of fleas are scattered together. There is no situation in which one species is clustered alone, indicating that F. diqingensis, F. spadix, and P. custodis have similar codon usage preferences (Figure 5).

Figure 3
www.frontiersin.org

Figure 3. Codon usage preference analysis of PCGs in the subfamily Amphipsyllinae by PR2, the y-axis represents the content of A/(A + T) at the third position of the codon, and the x-axis represents the content of G/(G + C) at the third position of the codon. (A) Frontopsylla diqingensis; (B) Frontopsylla spadix; (C) Paradoxopsyllus custodis (A blue dot indicates a protein-coding gene).

Figure 4
www.frontiersin.org

Figure 4. Codon usage preference analysis of PCGs in the subfamily Amphipsyllinae by neutral curve. The y-axis represents the average GC content of the first base and the second base (GC12) of the codon, and the x-axis represents the content of G and C at the third position of codon. (A). Frontopsylla diqingensis; (B). Frontopsylla spadix; (C). Paradoxopsyllus custodis. (A blue dot indicates a protein-coding gene).

Figure 5
www.frontiersin.org

Figure 5. Correspondence analysis (COA) of PCGs in the subfamily Amphipsyllinae.

3.5 The evolutionary relationships of the subfamily Amphipsyllinae

We obtained four trees from two concatenated data matrices (PCG and PCGRNA) and two tree-construction methods (ML and BI) (Figure 6). Four trees are highly similar; the difference is that the taxonomic status of Hystrichopsylla weida qinlingensis Zhang et al. (1984), Dorcadia ioffi Smit (1953), and Aviostivalius aklossi bispiniformis Li and Wang (1958), is ambiguous. In Figure 6A, PCG: The 23 flea species are divided into two major lineages, forming three clades: (Pulicidae + Pygiopsyllidae) forms one clade, while ((Ceratophyllidae + Leptopsyllidae) + ((Vermipsyllidae + Hystrichopsyllidae) + Ctenophthalmidae)) forms two clades that are more closely related. In Figure 6B, PCGRNA: The 23 flea species are divided into two major lineages, forming three clades: Pygiopsyllidae form one clade, while ((Ceratophyllidae + Leptopsyllidae) + ((Vermipsyllidae + Hystrichopsyllidae) + Ctenophthalmidae)), and Pulicidae forms two clades that are more closely related. The families Leptopsyllidae, Ctenophthalmidae, and the subfamily Ceratophyllinae are paraphyletic groups. The families Pulicidae and Ceratophyllidae form a monophyletic group. The families Pulicidae and Pygiopsyllidae are closely related to the base of the phylogenetic tree. In Amphipsyllinae, F. diqingensis and F. spadix were clustered into one clade (BPP = 1, BSP = 100). P. custodis and Macrostylophora euteles Jordan et Rothschild, 1,911 were clustered into one clade (BPP = 1, BSP = 100). The families Ctenophthalmidae, Vermipsyllidae (one species), and Hystrichopsyllidae (one species) had low node support.

Figure 6
www.frontiersin.org

Figure 6. Phylogenetic relationship of the order Siphonaptera was reconstructed by Bayesian inference (BI) and maximum likelihood (ML) method. (A): PCG; (B): PCGRNA). (“★” indicates topological inconsistency).

3.6 Divergence time estimation of the subfamily Amphipsyllinae

According to the estimated divergence time, Siphonaptera originated in the Early Cretaceous around 119.3 Mya (the divergence times and confidence intervals for nodes/clades are shown in Supplementary Table S7). Clade 1 (Pygiopsyllidae and Pulicidae) begins to diverge from clades 2 (Ctenophthalmidae; Vermipsyllidae; and Hystrichopsyllidae) and 3 (Leptopsyllidae and Ceratophyllidae) at 101.21 Mya. Clade 2 (Ctenophthalmidae; Vermipsyllidae; and Hystrichopsyllidae) and Clade 3 (Leptopsyllidae and Ceratophyllidae) began to diverge at 86.26 Mya. The divergence time between Pygiopsyllidae and Pulicidae is approximately 95.36 Mya, and the relationship between Pygiopsyllidae and Pulicidae is closer. Amphipsyllinae diverged approximately between 73.37 and 40.32 Mya. The Ceratophyllidae family diverged most recently at 73.37 Mya, while F. spadix and F. diqingensis diverged at approximately 40.32 Mya. The majority of the intraordinal divergence into Siphonaptera’s extant lineages occurred after the K-Pg boundary (Figure 7).

Figure 7
www.frontiersin.org

Figure 7. Divergence time estimation of various lineage in the order Siphonaptera (Node labels are the mean values of the estimated ages. The node bars indicate 95% confidence intervals).

4 Discussion

Morphological characteristics are the basis for systematic taxonomy. Accurate identification is essential for reconstructing phylogeny. The majority of the identification maps of flea species in the available data are hand-drawn and lack clear color maps. Therefore, this study presents a clear color morphology map to supplement the hand-drawn maps. Three species of Amphipsyllinae retained the ancestral pattern of mitochondrial gene arrangement of arthropods. AT-skew is negative and is mainly influenced by directional evolution pressure and asymmetric replication. The nucleotide diversity of 13 protein-coding genes in Amphipsyllinae was 0.3842, indicating that nucleotide diversity differences among Amphipsyllinae were significant, and showed high genetic diversity. Gene exchange between species was more frequent. Among them, the nad4 and nad6 genes were the two genes with the highest nucleotide diversity among the 13 protein-coding genes, indicating that these two genes have high nucleotide variability within Amphipsyllinae, and can be used as potential molecular markers for future exploration of population differentiation in Amphipsyllinae (Supplementary Figure S4).

We analyzed the relative synonymous codon usage (RSCU) of 13 protein-coding genes in three species of Amphipsyllinae and evaluated the synonymous codon usage bias (Supplementary Table S4, Supplementary Figure S3). An RSCU >1.6 indicates an overrepresented codon, while an RSCU >1 indicates a strong preference for that codon. An RSCU of 1 indicates no preference, meaning all codons are used equally, and an RSCU <1 indicates a weak preference (51, 52). An RSCU of 0 indicates that the codon has not been used. It was noted that the high-frequency codons predominantly end in A or U, while the codons ending with G or C were either rarely used or not used at all. Specific codons always tend to form codon clusters or repeats (53). There are also unique species preferences for G/C-ending codons (54, 55). We used COA analysis (Figure 5), PR2 (Figure 3), ENC-plot (Figure 8), and neutral curve analyses (Figure 4) to further explore the causes of codon usage bias. In the ENC-plot analysis, there were 15 genes with Nc < 35 in three species of fleas, with Nc < 35 indicating a significant preference for codon usage (56). At the same time, the Nc value is also an essential index for evaluating the expression of endogenous genes. The codon preference of highly expressed genes is greater, and their Nc values are smaller. On the other hand, lowly expressed genes contain a greater variety of rare codons and have larger Nc values (50). Only a few genes in three species of fleas fall above the standard curve, and the majority of the genes fall below it, indicating that the majority of the genes have been influenced by natural selection during evolution. No genes were located at the coordinates (0.5, 0.5) for three species of fleas in the PR2 analyses, suggesting that these genes have been influenced mainly by selection during evolution. Because GC content is usually determined by the mutation process of the whole genome (57), it often reflects the strength of directional mutation pressure (58). In the case of mutation pressure, the mutation at the third position of the codon usually does not alter the type of the amino acid, which is a synonymous mutation. Mutations occurring at the first and second codons are non-synonymous mutations, which affect the function and activity of related genes, but these mutations do not happen frequently. Therefore, the probability of mutation at the third codon position is much greater than that at the first and second codon positions. In the neutral curve analysis, the GC12 and GC3s of protein-coding genes of three species of fleas have no significant linear relationship, and the genes were scattered, indicating that these genes were mainly affected by selection during evolution. In summary, it can be deduced that the main reason for codon bias of 13 protein-coding genes in Amphipsyllinae was selection pressure. Additionally, base composition (59), nature of amino acids (aromatic and hydrophobic) (60), and codon context (61) might also be influenced by codon usage preference. There are more generally accepted theories, such as the “energy compensation” theory, which thought that the synthesis of the G + C base pair required more energy and nitrogen than the A + T base pair in nucleotide synthesis (62). The “selection-mutation-drift” theory suggested that mutations were predisposed, and natural selection reflected codon usage preference in the evolutionary process (63). These theories need to be studied further.

Figure 8
www.frontiersin.org

Figure 8. Codon usage preference analysis of PCGs in the subfamily Amphipsyllinae by ENC-plot, the y-axis represents the Nc value, and the x-axis represents the content of G and C at the third position of the codon. (A) Frontopsylla diqingensis; (B) Frontopsylla spadix; (C) Paradoxopsyllus custodis.

Phylogenetic trees were reconstructed using both BI and ML methods, based on two concatenated data matrices from 23 species of Siphonaptera. The majority of the species in the ingroup are clustered together in the same family or genus, and the taxonomic status is clear. Ceratophyllinae, Leptopsyllinae, and Amphipsyllinae are clustered into one clade. F. spadix and F. diqingensis clustered into one clade (BPP = 1, BSP = 100). Consistent with traditional taxonomy. To one’s surprise, P. custodis and M. euteles of different subfamilies were clustered together (BPP = 1, BSP = 100). We further alignment and found the mtDNA sequence of P. custodis and M. euteles are highly similar (98.8%), cox1 was 98.6%, cob was 98.8%, rrnL was 99.5%, and rrnS was 99.8%. This suggests a possible error in the author’s uploaded sequences, as P. custodis and M. euteles sequences from GenBank may actually represent the same species. As shown in Figure 6, Amphipsyllinae was monophyletic (currently, with relatively poor species, it could affect the accuracy of phylogenetic estimation). Ceratophyllinae and Ctenophthalmidae are paraphyletic. Pulicidae and Pygiopsyllidae are located at the base of the phylogenetic tree, indicating that Pulicidae and Pygiopsyllidae are the earliest divergence groups among the selected 23 species. The divergence time tree (Figure 7) showed that Pygiopsyllidae and Pulicidae were the first groups to diverge (95.36 Mya). However, Wu (14) suggested that the family Ceratophyllidae should be placed above the family Pulicidae (which includes the previous family Tungidae), and additional data are needed to confirm this. Vermipsyllidae, Hystrichopsyllidae, and Ctenophthalmidae were clustered together (BPP = 0.81/96, BSP = 0.40/53), although the node had weak support, except for PCGRNA (BI), which showed strong node support, in line with previous findings (6466). This showed the taxonomic status of the superfamily Hystrichopsylloidea is in a state of confusion. Morphologically, the three families are quite different, but at the molecular level, they belong to the same evolutionary clade, which may be the reason why there are too few species, and it is hoped that more species will be added to resolve their taxonomic status in the future. Whether the superfamilies Vermipsylloidea and Hystrichopsylloidea are separated from the superfamily Ceratophylloidea to become two separate families remains to be investigated (14).

The divergence time of various lineages in Siphonaptera was estimated and showed that it comprises two sequentially derived lineages, of which Pulicidae and Pygiopsyllidae diverged earlier. The subfamily Amphipsyllinae is the earliest diverging of the 3 subfamilies (Ceratophyllinae, Leptopsyllinae, and Amphipsyllinae), which is also reflected in its higher nucleotide diversity index (Figure 7). Figure 7 shows that the common ancestor of the extant fleas diverged during the Cretaceous. The majority of intrordinal divergence into extant flea lineages occurred after the K-Pg boundary, consistent with Zhu et al. (1). However, this timeline is far from the Mesozoic fossil “dinosaur fleas” identified by Huang et al. in Northeast China, as well as from the pterosaur to mammal hosts. Two fleas (100-125Mya) from the Cretaceous period were reported by some scholars, but the specimens were lost, and these records could not be confirmed (67). Even the attribution of certain Mesozoic fossils to Siphonaptera provides some interesting scenarios of flea evolutionary history. For example, Rasnitsyn (2002) considers these three fossils (Strashila incredibilis, Saurophthirus longipes, and Tawinia australis) together as “pre-fleas” because of the prominent lower forehead and relatively short antennae (68). We did not support the aforementioned opinions. Our findings supported the study by Zhu et al. (1). First, dinosaur fleas are morphologically quite different from extant fleas, and the earliest definitive fossil specimen of a flea has so far been found in the Eocene (Palaeopsylla spp. in Baltic Amber) (4). Second, there were conflicts in geographical location (Northeast China belongs to Laurasia, and fleas originated in Gondwana). Finally, Zhu et al. (1) used 205 species of fleas from 16 families to prepare DNA sequence datasets of mitogenomes, nuclear protein coding, and ribosomal genes for constructing phylogenetic trees and divergence time trees. Combined with the ancestral host associations and geographical distribution, it is concluded that the earliest differentiation is Macropsyllidae (Late Cretaceous). In summary, we supported that the common ancestors of fleas diverged in the Cretaceous. We cannot rule out errors in divergence time estimation, because of the initial use of mitogenomes and the scarcity of exact fossils of fleas. Mitochondrial genomes evolve at a faster rate than nuclear genomes, making them more effective for studies at lower taxonomic ranks. However, they have limitations when applied to studies at higher taxonomic ranks. The rapid evolution rate may lead to long-branch attraction (LBA) artifacts, which would affect the accuracy of the phylogeny. The evolutionary history and diversification of the tribe can be better resolved by increasing species representation and geographic coverage.

5 Conclusion

In this study, we sequenced the mitogenome of F. diqingensis for the first time. The composition of mitogenomes in three species from Amphipsyllinae differed among species. The phylogenetic tree supported that the majority of the species in the ingroup are clustered together in the same family or genus. Ceratophyllinae and Ctenophthalmidea are paraphyletic groups, while Pulicidae and Ceratophyllidae are monophyletic. We concluded that P. custodis and M. euteles from GenBank are the same species by phylogenetic trees and sequence alignment, and supported the monophyly of Amphipsyllinae. Divergence time estimation showed that the common ancestor of fleas diverged in the Cretaceous. Amphipsyllinae begins to diverge at approximately 73.37–40.32 Mya. The majority of the species of the intraordinal divergence into extant lineages occurred after the K-Pg boundary. This study provides new insights into the evolutionary history and taxonomy of Siphonaptera.

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: https://www.ncbi.nlm.nih.gov/nuccore/PP083946.1/.

Ethics statement

The animal study was approved by All methods and procedures used in the capture rodent process were in accordance with the guidelines and regulations approved by the Animal Ethics Committees at Dali University. Approval ID is MECDU-201912–20. The study was conducted in accordance with the local legislation and institutional requirements.

Author contributions

JP: Conceptualization, Data curation, Formal analysis, Methodology, Software, Visualization, Writing – original draft, Writing – review & editing. XL: Formal analysis, Methodology, Software, Writing – review & editing. WD: Data curation, Funding acquisition, Methodology, Resources, Writing – review & editing.

Funding

The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. We acknowledge funding support from the National Natural Science Foundation of China (NO. 32260152 to Wen-Ge Dong).

Acknowledgments

We thank Chen-Fu Zhao for his help in specimen collection.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fvets.2024.1494204/full#supplementary-material

References

1. Zhu, Q, Hastriter, MW, Whiting, MF, and Dittmar, K. Fleas (Siphonaptera) are cretaceous, and evolved with Theria. Mol Phylogenet Evol. (2015) 90:129–39. doi: 10.1016/j.ympev.2015.04.027

PubMed Abstract | Crossref Full Text | Google Scholar

2. Huang, D, Engel, MS, Cai, C, Wu, H, and Nel, A. Diverse transitional giant fleas from the Mesozoic era of China. Nature. (2012) 483:201–13. doi: 10.1038/nature10839

PubMed Abstract | Crossref Full Text | Google Scholar

3. Dampf, A. Palaeopsylla klebsiana n. sp: ein fossiler Floh aus dem baltischen. Bernstein: Teubner (1911).

Google Scholar

4. Peus, F. Über die beiden Bernstein-Flöhe (Insecta, Siphonaptera). Paläontol Z. (1968) 42:62–72. doi: 10.1007/BF02987128

Crossref Full Text | Google Scholar

5. Poinar, G. Fleas (Insecta: Siphonaptera) in Dominican amber. Med Sci Res. (1995):23.

Google Scholar

6. Beaucournu, JC, and Wunderlich, J. A third species of Palaeopsylla Wagner, 1903, from Baltic Amber (Siphonaptera: Ctenophthalmidae). Entomologische Zeitschrift. (2001) 111:296–8.

Google Scholar

7. Beaucournu, JC. Palaeopsylla groehni n. sp., quatrième espèce de puce connue de l'ambre de la Baltique (Siphonaptera, Ctenophthalmidae) (2003) 108:217–20. doi: 10.3406/bsef.2003.16957,

Crossref Full Text | Google Scholar

8. Gao, T, Shih, C, Rasnitsyn, AP, Xu, X, Wang, S, and Ren, D. New transitional fleas from China highlighting diversity of early cretaceous ectoparasitic insects. Curr Biol. (2013) 23:1261–6. doi: 10.1016/j.cub.2013.05.040

PubMed Abstract | Crossref Full Text | Google Scholar

9. Crowson, RA, and Hennig, WJSB. Die Stammesgeschichte der Insekten (1970) 19:393–6. doi: 10.2307/2412281,

Crossref Full Text | Google Scholar

10. Kristensen, NP. The phylogeny of hexapod “orders”. Critic Rev Recent Accounts. (1975) 13:1–44. doi: 10.1111/j.1439-0469.1975.tb00226.x

Crossref Full Text | Google Scholar

11. Chalwatzis, N, Hauf, J, Van De Peer, Y, Kinzelbach, R, and Zimmermann, FK. 18S ribosomal Rna genes of insects: primary structure of the genes and molecular phylogeny of the Holometabola. Ann Entomol Soc Am. (1996) 89:788–803. doi: 10.1093/aesa/89.6.788

Crossref Full Text | Google Scholar

12. Whiting, MF. Assembling the tree of life. Oxford: Oxford University Press (2004).

Google Scholar

13. Tihelka, E, Giacomelli, M, Huang, D, Pisani, D, Donoghue, P, and Cai, C. Fleas are parasitic scorpionflies. Palaeoentomology. (2020) 3:641–53. doi: 10.11646/palaeoentomology.3.6.16

Crossref Full Text | Google Scholar

14. Wu, H. Fauna Sinica Insecta Siphonaptera. Bejing: Science Press (2007).

Google Scholar

15. Lewis, RE. Résumé of the Siphonaptera (Insecta) of the world. J Med Entomol. (1998) 35:377–89. doi: 10.1093/jmedent/35.4.377

PubMed Abstract | Crossref Full Text | Google Scholar

16. Bitam, I, Dittmar, K, Parola, P, Whiting, MF, and Raoult, D. Fleas and flea-borne diseases. Int J Infect Dis. (2010) 14:e667–76. doi: 10.1016/j.ijid.2009.11.011

Crossref Full Text | Google Scholar

17. Hamzaoui, BE, Zurita, A, Cutillas, C, and Parola, P. Fleas and flea-borne diseases of North Africa. Acta Trop. (2020) 211:105627. doi: 10.1016/j.actatropica.2020.105627

PubMed Abstract | Crossref Full Text | Google Scholar

18. Wells, LE, and Elston, DM. What's eating you? Oriental rat flea (Xenopsylla cheopis). Cutis. (2020) 106:124–6. doi: 10.12788/cutis.0072

PubMed Abstract | Crossref Full Text | Google Scholar

19. Ioff, V. Zur Systematik der Flöhe aus der Unterfamilie Ceratophyllinae (1936) 9:428. doi: 10.1007/BF02119894,

Crossref Full Text | Google Scholar

20. Armstrong, KF, and Ball, SL. DNA barcodes for biosecurity: invasive species identification. Philos Trans R Soc Lond Ser B Biol Sci. (2005) 360:1813–23. doi: 10.1098/rstb.2005.1713

PubMed Abstract | Crossref Full Text | Google Scholar

21. Pakendorf, B, and Stoneking, M. Mitochondrial DNA and human evolution. Annu Rev Genomics Hum Genet. (2005) 6:165–83. doi: 10.1146/annurev.genom.6.080604.162249

Crossref Full Text | Google Scholar

22. Simon, C, Buckley, T, Frati, F, Stewart, J, and Beckenbach, A. Incorporating molecular evolution into phylogenetic analysis, and a new compilation of conserved polymerase chain reaction primers for animal mitochondrial DNA. Annu Rev Ecol Evol Syst. (2006) 37:545–79. doi: 10.1146/annurev.ecolsys.37.091305.110018

Crossref Full Text | Google Scholar

23. Cameron, SL, and Whiting, MF. Mitochondrial genomic comparisons of the subterranean termites from the genus Reticulitermes (Insecta: Isoptera: Rhinotermitidae). Genome. (2007) 50:188–202. doi: 10.1139/g06-148

PubMed Abstract | Crossref Full Text | Google Scholar

24. Jost, MC, and Shaw, KL. Phylogeny of Ensifera (Hexapoda: Orthoptera) using three ribosomal loci, with implications for the evolution of acoustic communication. Mol Phylogenet Evol. (2006) 38:510–30. doi: 10.1016/j.ympev.2005.10.004

PubMed Abstract | Crossref Full Text | Google Scholar

25. Clayton, DA. Replication of animal mitochondrial DNA. Cell. (1982) 28:693–705. doi: 10.1016/0092-8674(82)90049-6

Crossref Full Text | Google Scholar

26. Boyce, TM, Zwick, ME, and Aquadro, CF. Mitochondrial DNA in the bark weevils: size, structure and heteroplasmy. Genetics. (1989) 123:825–36. doi: 10.1093/genetics/123.4.825

PubMed Abstract | Crossref Full Text | Google Scholar

27. Meng, G, Li, Y, Yang, C, and Liu, S. MitoZ: a toolkit for animal mitochondrial genome assembly, annotation and visualization. Nucleic Acids Res. (2019) 47:e63. doi: 10.1093/nar/gkz173

PubMed Abstract | Crossref Full Text | Google Scholar

28. Li, H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. ArXiv. (2013) 2:1303. doi: 10.48550/arXiv.1303.3997

Crossref Full Text | Google Scholar

29. Li, H, Handsaker, B, Wysoker, A, Fennell, T, Ruan, J, Homer, N, et al. The sequence alignment/map format and SAMtools. Bioinformatics. (2009) 25:2078–9. doi: 10.1093/bioinformatics/btp352

PubMed Abstract | Crossref Full Text | Google Scholar

30. Kearse, M, Moir, R, Wilson, A, Stones-Havas, S, Cheung, M, Sturrock, S, et al. Geneious basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics. (2012) 28:1647–9. doi: 10.1093/bioinformatics/bts199

PubMed Abstract | Crossref Full Text | Google Scholar

31. Altschul, SF, Madden, TL, Schäffer, AA, Zhang, J, Zhang, Z, Miller, W, et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. (1997) 25:3389–402. doi: 10.1093/nar/25.17.3389

PubMed Abstract | Crossref Full Text | Google Scholar

32. Bernt, M, Donath, A, Jühling, F, Externbrink, F, Florentz, C, Fritzsch, G, et al. MITOS: improved de novo metazoan mitochondrial genome annotation. Mol Phylogenet Evol. (2013) 69:313–9. doi: 10.1016/j.ympev.2012.08.023

PubMed Abstract | Crossref Full Text | Google Scholar

33. Chan, PP, Lin, BY, Mak, AJ, and Lowe, TM. tRNAscan-SE 2.0: improved detection and functional classification of transfer RNA genes. Nucleic Acids Res. (2021) 49:9077–96. doi: 10.1093/nar/gkab688

PubMed Abstract | Crossref Full Text | Google Scholar

34. Laslett, D, and Canbäck, B. ARWEN: a program to detect tRNA genes in metazoan mitochondrial nucleotide sequences. Bioinformatics. (2008) 24:172–5. doi: 10.1093/bioinformatics/btm573

PubMed Abstract | Crossref Full Text | Google Scholar

35. Thompson, JD, Gibson, TJ, and Higgins, DG. Multiple sequence alignment using ClustalW and ClustalX. Curr Protoc Bioinformatics. (2002). Chapter 2: Unit 2.3. doi: 10.1002/0471250953.bi0203s00

Crossref Full Text | Google Scholar

36. Kumar, S, Tamura, K, and Nei, M. MEGA: molecular evolutionary genetics analysis software for microcomputers. Comput Appl Biosci. (1994) 10:189–91. doi: 10.1093/bioinformatics/10.2.189

PubMed Abstract | Crossref Full Text | Google Scholar

37. Librado, P, and Rozas, J. DnaSP v5: a software for comprehensive analysis of DNA polymorphism data. Bioinformatics. (2009) 25:1451–2. doi: 10.1093/bioinformatics/btp187

PubMed Abstract | Crossref Full Text | Google Scholar

38. Beckenbach, AT. Mitochondrial genome sequences of representatives of three families of scorpionflies (order Mecoptera) and evolution in a major duplication of coding sequence. Genome. (2011) 54:368–76. doi: 10.1139/g11-006

PubMed Abstract | Crossref Full Text | Google Scholar

39. Ronquist, F, and Huelsenbeck, JP. MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics. (2003) 19:1572–4. doi: 10.1093/bioinformatics/btg180

PubMed Abstract | Crossref Full Text | Google Scholar

40. Stamatakis, A. RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics. (2006) 22:2688–90. doi: 10.1093/bioinformatics/btl446

PubMed Abstract | Crossref Full Text | Google Scholar

41. Katoh, K, Misawa, K, Kuma, K, and Miyata, T. MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. (2002) 30:3059–66. doi: 10.1093/nar/gkf436

PubMed Abstract | Crossref Full Text | Google Scholar

42. Ranwez, V, Harispe, S, Delsuc, F, and Douzery, EJ. MACSE: multiple alignment of coding SEquences accounting for frameshifts and stop codons. PLoS One. (2011) 6:e22594. doi: 10.1371/journal.pone.0022594

PubMed Abstract | Crossref Full Text | Google Scholar

43. Talavera, G, and Castresana, J. Improvement of phylogenies after removing divergent and ambiguously aligned blocks from protein sequence alignments. Syst Biol. (2007) 56:564–77. doi: 10.1080/10635150701472164

PubMed Abstract | Crossref Full Text | Google Scholar

44. Kalyaanamoorthy, S, Minh, BQ, Wong, TKF, von Haeseler, A, and Jermiin, LS. ModelFinder: fast model selection for accurate phylogenetic estimates. Nat Methods. (2017) 14:587–9. doi: 10.1038/nmeth.4285

PubMed Abstract | Crossref Full Text | Google Scholar

45. Nguyen, LT, Schmidt, HA, von Haeseler, A, and Minh, BQ. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol. (2015) 32:268–74. doi: 10.1093/molbev/msu300

PubMed Abstract | Crossref Full Text | Google Scholar

46. Rambaut, A. FigTree, a graphical viewer of phylogenetic trees. Institute of Evolutionary Biology University of Edinburgh (2009).

Google Scholar

47. Bouckaert, R, Vaughan, TG, Barido-Sottani, J, Duchêne, S, Fourment, M, Gavryushkina, A, et al. BEAST 2.5: an advanced software platform for Bayesian evolutionary analysis. PLoS Comput Biol. (2019) 15:e1006650. doi: 10.1371/journal.pcbi.1006650

PubMed Abstract | Crossref Full Text | Google Scholar

48. Lewis, RE, and Grimaldi, DA A pulicid flea in Miocene amber from the Dominican Republic (Insecta, Siphonaptera, Pulicidae). American museum novitates. (1997) no.3205.

Google Scholar

49. Xie, J, Chen, Y, Cai, G, Cai, R, Hu, Z, and Wang, H. Tree visualization by one Table (tvBOT): a web application for visualizing, modifying and annotating phylogenetic trees. Nucleic Acids Res. (2023) 51:W587–w592. doi: 10.1093/nar/gkad359

PubMed Abstract | Crossref Full Text | Google Scholar

50. Wright, F. The 'effective number of codons' used in a gene. Gene. (1990) 87:23–9. doi: 10.1016/0378-1119(90)90491-9

PubMed Abstract | Crossref Full Text | Google Scholar

51. Liu, J, Zhu, D, Ma, G, Liu, M, Wang, M, Jia, R, et al. Genome-wide analysis of the synonymous codon usage patterns in Riemerella anatipestifer. Int J Mol Sci. (2016) 17:17. doi: 10.3390/ijms17081304

PubMed Abstract | Crossref Full Text | Google Scholar

52. Yang, HJ, Yang, ZH, Ren, TG, and Dong, WG. The complete mitochondrial genome of Eulaelaps huzhuensis (Mesostigmata: Haemogamasidae). Exp Appl Acarol. (2023) 90:301–16. doi: 10.1007/s10493-023-00802-6

PubMed Abstract | Crossref Full Text | Google Scholar

53. Behura, SK, and Severson, DW. Codon usage bias: causative factors, quantification methods and genome-wide patterns: with emphasis on insect genomes. Biol Rev Camb Philos Soc. (2013) 88:49–61. doi: 10.1111/j.1469-185X.2012.00242.x

PubMed Abstract | Crossref Full Text | Google Scholar

54. Vicario, S, Moriyama, EN, and Powell, JR. Codon usage in twelve species of drosophila. BMC Evol Biol. (2007) 7:226. doi: 10.1186/1471-2148-7-226

PubMed Abstract | Crossref Full Text | Google Scholar

55. Behura, SK, and Severson, DW. Coadaptation of isoacceptor tRNA genes and codon usage bias for translation efficiency in Aedes aegypti and Anopheles gambiae. Insect Mol Biol. (2011) 20:177–87. doi: 10.1111/j.1365-2583.2010.01055.x

PubMed Abstract | Crossref Full Text | Google Scholar

56. Lal, D, Verma, M, Behura, SK, and Lal, R. Codon usage bias in phylum Actinobacteria: relevance to environmental adaptation and host pathogenicity. Res Microbiol. (2016) 167:669–77. doi: 10.1016/j.resmic.2016.06.003

PubMed Abstract | Crossref Full Text | Google Scholar

57. Plotkin, JB, and Kudla, G. Synonymous but not the same: the causes and consequences of codon bias. Nat Rev Genet. (2011) 12:32–42. doi: 10.1038/nrg2899

PubMed Abstract | Crossref Full Text | Google Scholar

58. Zhang, WJ. Codon analysis and its application in bioinformatics and evolutionary studies. Doctor. (2006)

Google Scholar

59. Arhondakis, S, Auletta, F, Torelli, G, and D'Onofrio, G. Base composition and expression level of human genes. Gene. (2004) 325:165–9. doi: 10.1016/j.gene.2003.10.009

Crossref Full Text | Google Scholar

60. Knight, RD, Freeland, SJ, and Landweber, LF. A simple model based on mutation and selection explains trends in codon and amino-acid usage and GC composition within and across genomes. Genome Biol. (2001) 2:Research0010. doi: 10.1186/gb-2001-2-4-research0010

PubMed Abstract | Crossref Full Text | Google Scholar

61. Jia, W, and Higgs, PG. Codon usage in mitochondrial genomes: distinguishing context-dependent mutation from translational selection. Mol Biol Evol. (2008) 25:339–51. doi: 10.1093/molbev/msm259

PubMed Abstract | Crossref Full Text | Google Scholar

62. Chen, WH, Lu, G, Bork, P, Hu, S, and Lercher, MJ. Energy efficiency trade-offs drive nucleotide usage in transcribed regions. Nat Commun. (2016) 7:11334. doi: 10.1038/ncomms11334

PubMed Abstract | Crossref Full Text | Google Scholar

63. Bulmer, M. The selection-mutation-drift theory of synonymous codon usage. Genetics. (1991) 129:897–907. doi: 10.1093/genetics/129.3.897

PubMed Abstract | Crossref Full Text | Google Scholar

64. Zhang, Y, Fu, YT, Yao, C, Deng, YP, Nie, Y, and Liu, GH. Mitochondrial phylogenomics provides insights into the taxonomy and phylogeny of fleas. Parasit Vectors. (2022) 15:223. doi: 10.1186/s13071-022-05334-3

PubMed Abstract | Crossref Full Text | Google Scholar

65. Chen, B, Liu, YF, Lu, XY, Jiang, DD, Wang, X, Zhang, QF, et al. Complete mitochondrial genome of Ctenophthalmus quadratus and Stenischia humilis in China provides insights into fleas phylogeny. Front Vet Sci. (2023) 10:1255017. doi: 10.3389/fvets.2023.1255017

PubMed Abstract | Crossref Full Text | Google Scholar

66. Liu, Y, Chen, B, Lu, X, Jiang, D, Wang, T, Geng, L, et al. Complete mitogenomes characterization and phylogenetic analyses of Ceratophyllus anisus and Leptopsylla segnis. Front Vet Sci. (2023) 10:1218488. doi: 10.3389/fvets.2023.1218488

PubMed Abstract | Crossref Full Text | Google Scholar

67. Riek, EF. Lower cretaceous fleas. Nature. (1970) 227:746–7. doi: 10.1038/227746a0

PubMed Abstract | Crossref Full Text | Google Scholar

68. Whiting, M, Whiting, A, Hastriter, M, and Dittmar, K. A molecular phylogeny of fleas (Insecta: Siphonaptera): origins and host associations. Cladistics. (2008) 24:677–707. doi: 10.1111/j.1096-0031.2008.00211.x

Crossref Full Text | Google Scholar

Keywords: Amphipsyllinae, mitogenome, plague, phylogeny, divergence time

Citation: Pu J, Lin X and Dong W (2024) Phylogeny and divergence time estimation of the subfamily Amphipsyllinae based on the Frontopsylla diqingensis mitogenome. Front. Vet. Sci. 11:1494204. doi: 10.3389/fvets.2024.1494204

Received: 15 September 2024; Accepted: 22 November 2024;
Published: 11 December 2024.

Edited by:

Nicola Pugliese, University of Bari Aldo Moro, Italy

Reviewed by:

Hafiz Ishfaq Ahmad, University of Veterinary and Animal Sciences, Pakistan
Daniel Maximo Correa Alcantara, Fundação Oswaldo Cruz, Brazil

Copyright © 2024 Pu, Lin and Dong. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Wenge Dong, ZG9uZ3dlbmdlMjc0MEBzaW5hLmNvbQ==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.