Background
Papillomaviruses (PVs) are small epitheliotropic viruses infecting mammals, reptiles, birds and fish. They are found in healthy skin and mucosa [
1], benign proliferative epithelial lesions, and malignant cancers [
2‐
4]. PVs virions are comprised of a naked capsid containing a circular, double-stranded DNA genome of approximately 8 kb organized into an upstream regulatory region (URR), an early gene region, and a late gene region [
5]. The URR contains regulatory sequences for initiation of viral replication, genome maintenance and regulation of gene expression. The early region contains up to seven open reading frames (ORFs) encoding regulatory proteins (E6, E7, E5, E1, E2 and E4, nested in E2). The late region contains two ORFs encoding the capsid proteins L1 and L2. According to recent analyses, the ancestral PV genome consisted of the
E1,
E2,
L2 and
L1 genes, whereas the PV oncogenes (
E6,
E7 and
E5) were acquired later during PV evolution [
6,
7]. Although the
E6 and
E7 oncogenes in PVs infecting mammals appear to have a common ancestor, several extant PV genome do lack either
E6 or
E7 [
6]
, suggesting repeated loss of these genes [
8]. PV classification is based on the nucleotide sequence similarities within
L1, being the most conserved gene. Sequence differences of more than 10% define a new PV type if the complete genome has been cloned and sequenced. Even though most PV types share less than 60% of
L1 nucleotide identity with PVs from other genera, their assignment to species and genera requires the analysis of phylogenetic position, genome organization, biology and pathophysiology[
9].
Papillomaviridae are divided into the
First- and
Secondpapillomavirinae subfamilies.
Firstpapillomavirinae consists of only one PV,
Sparus aurata Papillomavirus 1 (SaPV1), the only classified fish PV so far. SaPV1 is very divergent from other PVs, and has a unique genome organization containing only the URR,
E1,
E2, L2, and
L1 genes [
10], shared by other PVs genomes isolated from other fish species (GenBank accessions MH510267, MH616908, MH617143, and MH617579). The
Secondpapillomavirinae consist of 52 genera named after the Greek alphabet and variations thereof. Within this clade, genera can be grouped into crown-groups: four well-defined clades spanning Alpha-OmikronPVs, Beta-XiPVs, Lambda-MuPVs, Delta-ZetaPVs, an additional, ill-defined clade of PVs infecting other mammals, and a yet unclassified clade, consisting of PVs infecting birds and turtles [
6].
PVs infecting cetartiodactyls are plentiful, only second in number to PVs infecting primates. They do not constitute a monophyletic group, but are scattered instead into several crown-groups. PVs infecting ruminants belong within the
Delta-,
Xi-,
Epsilon-,
Dyoxi, Dyokappa-, Phi- and
DyolambdaPV genera within the Beta-XiPV and Delta-ZetaPV crown groups. In this manuscript we have focused on the description of a novel PV,
Capra hircus papillomavirus 2 (ChPV2), previously identified in teat-papillomas of a Damascus goat in Turkey [
12]. A previously described goat PV,
Capra hircus papillomavirus 1 (ChPV1) [
11] was classified as the only member of the
Phipapillomavirus, sister taxon of a RtPV1 infecting Timor deer, and closely related to
Xipapillomavirus. Here, we describe the genetic characterization and phylogenetic analysis of the novel goat PV ChPV2.
Discussion
Here we present the genomic and phylogenetic characterization of ChPV2 isolated from a teat wart of a Damascus goat from Turkey. Its genome shows the typical organization of PV genomes and contains seven putative genes E6, E7, E1, E2 E4 as well as L2 and L1. Splice site prediction suggested three different potential E1^E4 splice patterns, one out them encoding for a very small potential protein of only 33 aa that seems unlikely to be expressed. Further experimental evidence is needed to verify transcription and potential splicing and translation of E1^E4 mRNAs.
The ability of PVs to trigger cell proliferation is related to specific activities of the oncogenes the PV genome contains. In oncogenic human papillomaviruses (HPVs) the major oncoproteins are E6 and E7. E6 proteins from oncogenic HPVs interact with a series of host cell proteins through two zinc finger domains containing the CXXC motif [
28‐
32], that binds LXXLL motifs in their cellular counterparts. The paradigm of these interactions is the recruitment of cellular E3 ubiquitin ligase (E6AP) to target p53 to proteasomal degradation [
33,
34]. Again in oncogenic HPVs E6 a short C-terminal PDZ-binding motif (X-T/S-X-V/L) further facilitates interactions with proteins containing PDZ domains [
30,
31,
35‐
37], promoting for instance the induction of epithelial hyperplasia anchorage-independent growth and tumorigenic potential [
36,
38], but it is not necessary for maintenance of the PV genome [
39]. In the novel ChPV2, the E6 protein contains two zinc binding motifs but no PDZ binding motif, suggesting low transforming potential of this protein.
In oncogenic HPVs, E7 interacts with members of the retinoblastoma protein family, (pRB), through conserved regions that contain pRB binding motif (LXCXE) followed by a casein kinase II (CKII) phosphorylation site and a zinc binding domain [
40,
41]. This interaction leads to the dissociation of pRB and E2F transcription factor, thus promoting cell cycle progression that can eventually lead to uncontrolled cell growth and to the development of proliferative lesions. In ChPV2 E7 all components described to be necessary for effective binding of pRB, seem to be present, suggesting effective pRB binding. Whether the presence of these motifs contribute to proliferation of the ChPV2 associated teat warts needs further experimental evidence.
The presence and positions of regulatory elements within the ChPV2-URR and early genes differ from the previously described ChPV1 [
11] (Additional file
1: Table S5, Figure S2). The ChPV2-URR contains a TATA box upstream of the
E6 start codon. Upstream of the TATA box a potential E1BS was identified, that is accompanied by three E2BSs, of which one E2BS is located 25 bp upstream and two sites located 23 and 38 bp downstream of the E1BS. The equidistant proximity of the two E2BS flanking the E1BS, suggests a functional role for ChPV2 genome replication. Whether the third E2 binding site contributes to replication control or transcriptional regulation of the ChPV2 early promotor remains unclear and needs further experimental evidence. It can only be speculated, whether the fourth potential E2 binding site at the 3′ end of the L1 ORF is functionally relevant. Additionally, two imperfect E2BS were identified in close proximity towards each other within the
E7 ORF. Probably they represent an enhancer element of the ChPV2 late promoter as they are located closely upstream of a potential TATA box representing the core element of the ChPV2 late promoter. Because of one mismatch to these E2BS consensus sequence, it remains to be determined experimentally whether these elements are functional or bind E2 with less affinity than the perfect E2BS within the URR.
ChPV2 clusters within the
Beta-XiPV crown group, being confidently basal to
XiPV1.
XiPV1 have been isolated from papillomas of the upper alimentary canal [
42] and cutaneous papillomas of cattle from Great Britain [
43] cutaneous warts of cattle from Japan and squamous papilloma lesions on cattle teats from Japan [
44,
45] as well as cutaneaous papillomatous lesion of cows in Brazil [
46,
47]. Apart from a distinct geographical occurrence and different host species, ChPV2 shares its cutaneous localization and teat involvement with the closely related
XiPV1, further supporting its phylogenetic position. The nucleotide sequence of the ChPV2
L1 gene (Table
1) shows nucleotide identity to
L1 genes of other
XiPVs ranging between 58.3% and 71.8%. According to the conventions for naming and grouping of PVs, these findings supports the definition of ChPV2 as a new PV type as member of
XiPV1.
Interestingly, the only other known goat PV, ChPV1, isolated from healthy skin [
11], is not closely related to ChPV2. The position of the sister taxa ChPV1 and RtimPV1 is well supported and consistently basal to XiPVs. However,
L1 nucleotide identity values are borderline for the current standards, and the taxonomic relationship of these ChPV1 and RtimPV1 with respect to XiPVs has to be elucidated by the ICTV.
The fact that the ChPV2 infected goat shared space with cattle, opens the door to speculate about virus circulation between species. Although ChPV2 is closely related to bovine PVs, sequence similarities between ChPV2 and its closest relatives are probably too small to claim, that ChPV2 arose from an interspecies transmission event. We believe that, ChPV2 and/or closely related Xi PVs, that infect bovids, ovids and cervids might rather represent broad spectrum PVs, that could be able to infect different host species. Therefore, ChPV2 and closely related PVs might rather mimic the case of BPV1, which can infect cattle, sheep, several deer species, horses, zebras and tapirs. Currently, we do not know how prevalent this virus is among goats, or whether it is present in cattle alongside. Further studies are necessary to investigate the relative prevalence of the virus in the different species, as well as the efficiency of transmission within and between species to allow for differentiation between the broad spectrum PV and the interspecies transmission hypothesis.
Both the
E6 and
E7 oncogenes in PVs infecting mammals appear to have a common ancestor [
6]. It has been suggested previously that
E6 may have been lost two separate times within the
XiPV clade [
8]. Among the closest relatives of ChPV2 all PV genomes containing only
E7 cluster together, except for BPV12 that clusters within PVs containing both
E6 and
E7 (Fig.
2 and Additional file
1: Figure S7), supporting the repeated loss of
E6 hypothesis [
6,
8]. However, it is possible that the separate
E6 and
E7, and concatenated
E1E2L2L1 gene trees do not accurately describe the evolutionary history of these PVs. Nonetheless, we did not identify any rogue taxa for the constructed trees.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.