Cervical cancer is one of the most common female malignant tumor, and the incidence rate ranks first among malignant tumor of female reproductive system. According to the data of 2020 from World Health Organization (WHO), there were about 604 thousand new cases and 342 thousand deaths worldwide every year, which seriously threatened women's lives and health. Since professor Hausen of Germany found that Human papillomavirus (HPV) was closely related to the occurrence of cervical cancer in the late 1970s, HPV detection was gradually used for cervical cancer screening. HPVs are a heterogeneous group of nonenveloped, double-stranded, circular DNA viruses of 8000 nucleotides in size that infect the basal layer of the epithelium of skin (cutaneous HPV types) and mucous tissues (mucosal HPV types) [
1,
2]. A phylogenetic tree construction based on the nucleotide sequence of the L1 gene classified the HPV types into five genera – alpha, beta, gamma, mu and nu [
3,
4]. The Alpha papillomaviruses are further classified into high-risk (HR) types and low-risk (LR) types, according to their potential to cause anogenital cancer [
1,
3,
5]. Persistent infection with HR-HPV genotypes triggers the integration of HPV DNA into the host genome that eventually leads to the accumulation of chromosomal damages and genome destabilisation in the infected cells [
6‐
9]. The frequency of viral integration increases with the severity of cervical intraepithelial neoplasia (CIN) of stages I, II, III and the integrated form of the HPV DNA is found in the vast majority of cervical cancer cases [
9,
10]. The WHO published the latest screening and treatment guidelines for cervical precancerous lesions in 2021 and recommend the use of HPV DNA testing as a primary screening for cervical cancer [
11]. At present, polymerase chain reaction (PCR) technology is mainly used to detect HPV in clinic, but this method can not check the activity of HPV and the status of genome integration. According to the previous reports, the integration site of HPV is closely related to the occurrence, progression and recurrence of cancer. The main molecular mechanism may be the HPV insertion will affect the expression of upstream and downstream genes near the integration sites, the protein structure and the genomic structure’s instability [
12,
13]. In recent years, the liquid microarray enrichment technology combined with next-generation sequencing technology can detect the subtypes of HPV, load, virus integration, insertion sites, etc. However, the read length is short, the accuracy of this method may be affected, which may lead to the low total detection rate of integration sites [
9,
14]. Nanopore sequencing technology, as a new generation of sequencing technology in recent years, has the advantages of high throughput, ultra-long reading length, real-time sequencing, small and easy to carry instruments. It has obvious advantages for testing genome structure variation, virus insertion sites and insertion structure detection. It is widely used in many life sciences and clinical medical research.
Aminolevulinic acid photodynamic therapy (ALA-PDT) is a combination of medicine and machinery. After exogenous ALA is given, it is selectively absorbed by proliferating active cells. After a series of enzymatic reactions, a large number of photosensitive substances protoporphyrin IX is generated in mitochondria. After irradiation with a specific wavelength light source, reactive oxygen species such as singlet oxygen and oxygen free radicals are produced, which can selectively destroy the diseased tissue and achieve the purpose of treatment. In recent years, ALA-PDT has been used to treat cervical intraepithelial neoplasia (CIN). Studies have shown that the histological remission rate of ALA-PDT in the treatment of CIN I was 75%—86% in 3–6 months; after the treatment, the histological remission rate of CIN II reached 90% in 3–12 months [
15]. Our research also achieved good results in the treatment of cervical intraepithelial neoplasia and cervical HPV infection with ALA-PDT [
16]. In our study, nanopore sequencing technology was used to detect the HPV virus sequence of patients' samples before and after photodynamic therapy for cervical intraepithelial neoplasia. The proportion of HPV and the number of inserted human genome were counted and analyzed. Compare the HPV integration hotspots and related genes in samples at different stages before and after treatment of cervical lesions, and try to find the integration site spectrum of genes related to the progression and poor prognosis of cervical lesions. Then evaluate the feasibility of the detection methodology as an accurate companion diagnostic tool, and provide basic research data for prognosis evaluation and the adjustment of individual treatment plan.
Study design and ethics
Four patients with cervical intraepithelial neoplasia complicated with high-risk HPV infection who were admitted to the Department of Gynecology, Beijing You'an Hospital Affiliated to Capital Medical University from January 2021 to December 2021 were selected as the research objects. Two cases’ histopathology of cervical biopsy were CIN I while the other two were CIN II. The cases aged 30–52 years (Table
1). All the cases completed two stages of treatment [3 times of photodynamic therapy (3 PDT) and 6 times of photodynamic therapy (6 PDT)]. All patients were aware of this study and signed informed consent. This study was approved by the Ethics Committee of Beijing You'an Hospital jing you ji lun zi [2019]003.
Table 1
Patient enrollment information
B02 | 30 | Cervical exfoliated cell | Before treatment | 16 | negative | CIN I |
B02-1 | 30 | Cervical exfoliated cell | 3 PDT | negative | negative | N/A |
B02-2 | 30 | Cervical exfoliated cell | 6 PDT | negative | negative | cervicitis |
B15 | 33 | Cervical exfoliated cell | Before treatment | 52 | ASCUS | CIN II |
B15-1 | 33 | Cervical exfoliated cell | 3 PDT | negative | negative | N/A |
B15-2 | 33 | Cervical exfoliated cell | 6 PDT | negative | negative | CIN I |
B16 | 52 | Cervical exfoliated cell | Before treatment | 56、58、59 | LSIL | CIN I |
B16-1 | 52 | Cervical exfoliated cell | 3 PDT | 56、59 | negative | N/A |
B16-2 | 52 | Cervical exfoliated cell | 6 PDT | 56、59 | negative | cervicitis |
B19 | 30 | Cervical exfoliated cell | Before treatment | 52、58 | LSIL | CIN II |
B19-1 | 30 | Cervical exfoliated cell | 3 PDT | negative | negative | N/A |
B19-2 | 30 | Cervical exfoliated cell | 6 PDT | negative | negative | cervicitis |
The cervical exfoliated cell was collected with instructions by the disposable brush for cervix’s sampling kit (YIGUOREN biological technology, Hangzhou, China) and were put into 10 ml of preservation solution. Then 5 mL cervical exfoliated cells suspension was divided into a 1.5 mL centrifuge tube and centrifuged at 12,000–15000 rpm for 3 min. The supernatant was removed and the cell pellet was extracted and purified using TIANamp Micro DNA Kit (TIANGEN BIOTECH (BEIJING) DP316) according to the instructions. The extracted DNA was quantified using Qubit3.0, and the fragment length was inspected using Agarose gel electrophoresis. DNA samples with size > 5000 bp and more than 200 ng were further processed for library construction.
Library construction
150 ng total DNA were sheared with ultrasound according to the optimized interruption conditions (M220 Focused-Ultrasonicator was used): Peak power: 25, Duty factor: 10, Cycles/burst: 500, Avg power: 2.5. The sheared DNA was further end-repaired and A-tailed using the NEBNext® Ultra IIEnd-prep reaction buffer and enzyme mix, and 1 × VAHTS® DNA Clean Beads (N411-02; Nanjing Vazyme Biotech Co Ltd. China) were used to remove short fragments to ensure library size larger than 500 bp, then PCR Barcoding Kit (SQK-PBK004; Oxford Nanopore Technologies, UK) was used for library construction according to instruction. The generated library was quantified with the Qubit dsDNA HS Assay Kit (Thermo Fisher Scientific, MA, United States) using the Qubit 3.0 fluorometer (Invitrogen, CA, United States), the size distribution of library was checked using the Agilent 2100 TapeStation (Agilent Technologies, CA, United States). The typical size range of the library was from 0.6 to 9Kbp.
HPV sequences enrichment and nanopore sequencing
The probe set of HPVs was provided by BOKE bioscience (BOKE bioscience, Beijing, China). 500 ng libraries were pre-heated with Human cot-1(Boke bioscience) and blocker (oligo DNA by Synbio Technologies, Suzhou, China), then hybridized with HPV probes using 2 × Hybridization Buffer and Hybridization Enhancer. The hybridization mix was incubated at 95℃ for 10 min, then at 65℃ for 16 h. Dynabeads™ M-270 Streptavidin (Thermo Fisher Scientific) were used for pull down of target sequences and washing steps, then the eluted fragments containing the targeted sequences were re-amplified by 30cycles to generate sequencing library. The enriched library was quantified with the Qubit dsDNA HS Assay Kit using a Qubit 3.0 fluorometer (Thermo Fisher Scientific), and library quality was checked using Agilent 2100 TapeStation (Agilent Technologies, CA, United States). 50 ng enriched each library was pooled and attached with rapid sequencing adapters, then sequencing was performed using PromethION (FLO-PRO002) for 64 h runtime to ensure > 0.3 Gb sequence data was generated for each library.
Data analysis
The base calling was performed using the Guppy, then the clean data was processed using Nano pack that conducted length filtering on pass reads with Q-value ≥ 7 and sequence with read-length less than 500 bp were removed. The clean reads were aligned against human genome reference (hg 19) and HPV refseq databases using a in-house bioinformatic pipeline to obtain HPV sequences and viral integration sites [
17].
Functional enrichment analysis
The reads number that detected in 12 clinical samples in this study was made into the gene expression matrix, and it was divided into two groups and named BEFORE and AFTER group. Ontology gene sets dataset of MsigDB [
18,
19] was used for enrichment, the other parameters are enriched and analyzed according to the default parameter settings. The gene sets with
P value ≤ 0.05 were regarded as the significantly enriched gene sets, and the enrichment result of the top 20 were analyzed.
Discussion
Based on the analysis of HPV abundance, profiling changes of HPV integration sites and the gene set related to cervical cancer progression after enrichment, we developed a nanopore sequencing based assay as companion diagnostic tools to conduct individualized treatment. As HPV abundances increased, HPV integration sites were found increasing correspondingly, and more genes related to cervical cancer were also observed. The advantage of this assay over traditional qPCR tests, was not only better sensitivity (we can identify enriched viral reads when viral load below the LOD of qPCR) to avoid misdiagnosis, but also quantitatively detection of viral abundance and integration sites at once, moreover, with enriched reads over viral genome, we could more accurately identify viral genotype, sub-genotypes or even drug-resistant variations.
The HPV genome is divided into three regions: (i) the long control region (LCR) which contains binding sites of viral regulatory proteins (E1 and E2) and cellular transcription factors, (ii) the early region (E) that encodes for the E1, E2, E4, E5, E6 and E7 genes and (iii) the late region (L) that encodes for the L1 and L2 genes [
6,
20,
21]. Over the last few decades, the strong tumourigenic capacity of HPV16 DNA triggered the extensive sequence analysis of the viral genome, several nucleotide changes have been associated with immune escape, viral persistence and cervical cancer development [
22,
23]. HPV16 is a strong carcinogenic factor, because it is detected in more than 50% of cervical cancer cases. The tumourigenic hallmark of HPV16 DNA lies mainly on the function of E6 and E7 oncoproteins and numerous analyses have proved that mutations particularly within E6 increase the oncogenicity of viral DNA [
24‐
26]. The T350G (L83V) was a nucleotide variation of HPV16 DNA on the E6 gene, implicated in the development of severe dysplasia in various geographic populations. In vitro experiments demonstrated that the coexistence of nucleotide changes G145T (Q14H), C335T (H78Y) along with T350G (L83V) considerably augment the tumourigenic capacity of the viral oncoprotein [
27,
28]. To date, no nucleotide variations have been involved in the development of cervical disease concerning E4 gene. Conversely, the nucleotide change A4042G/T (I65V) in the E5 oncogene seems to be mainly identified in specimens diagnosed with advanced cervical dysplasia [
29‐
31]. The LCR is regarded as the most polymorphic region of viral DNA, with numerous novel nucleotide variations to be continually reported in diverse research studies.
HPV infects the host cells and usually exists in a free state, HPV latent in the basal cell layer, where its DNA moves into the nucleus of infected cells in an independent exogenous chromosomal state. Subsequently, HPV integrates into the host chromosome, leading to genomic instability and tumorigenesis of the host. Upon integration the circular episomal HPV DNA is converted into a linear structure and inserted into the host chromosome. The disruption of viral genome can take place within the early region of viral genome (E1 and E2 genes) [
32‐
34], or within the late region (L1 and L2 genes) [
35,
36]. The most frequent site of disruption within the E2 ORF is located in the portion that encompasses the hinge region of the E2 protein [
34,
35,
37]. In contrast, disruptions within E1 gene have been found to be more frequently located either at the 5′ end [
32,
38], or at the 3′ end of the E1 gene [
36].
HPV is often present in cervical lesion cells with a mixed infection state, for instance, 2 different genotypes of HPV were found in patient B19, with HPV58 in the integrated state and HPV52 in a free state. Integration of HPV into the host chromosome is a key factor in cervical carcinogenesis. Genes related to cervical cancer progression was found by enrichment analysis before treatment, including CTNNB1, CCR7, AKT1, SLIT2, CDK6, etc. Human SAIL4 (sal-like 4), the new discovered proto oncogenes encode transcription factors that acted at the level of stem cells, SAIL4 protein can recognize and bind to CTNNB1 promoter region and trans‐activating CTNNB1 for accelerated expression β- Catenin, which promoted cell proliferation and the formation of tumor cervical cancer cells [
39]. Invasion and metastasis of tumor cells through the bloodstream and lymphatic vessels were key steps in the progression of cervical cancer, CCR7 was implicated in mediating lymphocyte trafficking and spreading to lymph nodes, expression of CCR7, CXCR4, VEGF-C and VEGF-D may have a synergistic effect on the malignant development of cervical cancer and lymph node metastasis [
40]. Akt1 and p-Akt1 were key elements of PI3K/Akt signaling pathways that regulate cellular processes, including proliferation, differentiation, migration and survival, the HK2 (the member of hexokinases, Associated with malignant tumor growth and distant metastasis) and AKT1 (p-AKT1) may regulated in the same networks in cervical cancer cells and synergistically promoted malignant growth and distant metastasis during the development of cervical cancer [
41]. It had been shown that SLIT2 expression levels were negatively correlated with the incidence of cervical cancer in patients treated with postoperative chemotherapy (CT) or radiotherapy (RT), indicating that SLIT2 may be a potential predictive biomarker [
42]. Cyclin-dependent kinase6 (CDK6) was a crucial regulatory cancer-related gene within the cell cycle and tumorigenesis, the changes in the level of CDK6 expression may impact on the progression of cervical precancerous lesions in the female [
43]. HPV integration frequency increases with severity of cervical precancer, and viral integrations are present in the majority of cervical cancers. In this study, patients B02 and B16, both had a pathological grade of CIN 1, and had a lower pathological grade than patients B15 and B19 with CIN 2, patients B02 and B16 had lower viral abundance and significantly fewer integration sites than the latter.
The principle of photodynamic therapy is to selectively kill actively proliferating cells. The more severe of cervical lesion, the more actively proliferating of its cells, with a greater number of HPV insertion sites and a higher HPV abundance. After photodynamic therapy, the actively proliferating cervical cells are killed and the degree of cervical lesions is significantly reduced. The integrated HPV viruses were cleared, so the HPV abundance and the number of integration sites decreased significantly in patients B02 and B19. However, in patients B15 and B16, we found that although their cervical pathological grade decreased after photodynamic therapy, their HPV abundance and the number of integration sites did not decrease accordingly, considering that the persistent HPV infection lead to new HPV integration. Especially for patient B16, the HPV59 abundance and the number of integration sites increased significantly after 3 PDT, the possible reason for this being the ineffectiveness of PDT and predicting the persistent infection of HPV and a possible exacerbation of the cervical lesions. Subsequent follow-up results also confirmed the presence of persistent infection of HPV56 and HPV59, while the cervical lesions developed from cervicitis to LSIL. Its possible explanation was the production of new HPV integration. Nanopore technology can predict the direction of disease progression earlier than TCT and pathology. Regrettably, patient B16 refused any further HPV integration site testing and pathology testing. We can still identify some genes related to cervical cancer progression from the samples of the two patients after treatment, including ATF3, GPX3, IL10, IL6, RAD51AP1, MGMT, WWOX. Activated transcription factor 3 (ATF3) was a stress-induced transcription repressor, it had been shown that ATF3 overexpression enhanced paclitaxel apoptosis in cervical cancer HeLa cells in part by stimulating TAp73, thus, it can act as a tumor suppress factor [
44]. Glutathione peroxidase 3 (GPX3) was the member of the glutathione peroxidase family of selenoproteins, GPX3 promoter methylation was one of the main causes of downregulated GPX gene expression in cervical cancer and acted as a predictive biomarker for lymph node metastasis and prognosis in cervical cancer [
45]. In cervical cancer tissues, the expression levels of IL10 and Ki-67 increased accordingly when the HPV infection rate increased, so the detection of IL10, KI67 expression and HPV infection rate can provide reference for the diagnosis of cervical cancer [
46]. Thrombocythemia, high PLR (platelet to lymphocyte ratio) and IL-6 overexpression correlated with cervical cancer progression and metastasis, they may play an important role as molecular markers of survival and may be considered as potential predictors of cervical cancer prognosis [
47]. RAD51AP1 was the main BRD4 target gene participating in radiosensitivity, and high expression of BRD4 was found to be associated with poorer prognosis and radioresistance. JQ1 (a BRD4 inhibitor) sensitized cervical cancer to radiation therapy by suppressing RAD51AP1 transcription [
48]. Studies had shown that 5-Aza-dC (DNA methyltransferase inhibitor) and X-ray radiation can inhibit expression of p16 (cyclin-dependent kinase inhibitor 2A, CDKN2A) and MGMT in cervical cancer cells, and down regulating the expression of p16 and MGMT can promote the anti-proliferation and treatment of cervical cancer cells, this discovery may provide a new treatment and a new method for cervical cancer [
49]. It had been shown that WWOX is under-expressed in cervical cancer tissues and cell lines, and the expression of WWOX significantly decreases or disappears with the development of cervical cancer, proved that the reduced expression of WWOX is strongly associated with the development of cervical cancer, regulating the expression of WWOX may be an effective and novel method to treat cervical cancer [
50]. Therefore, these two patients are still at high risk of relapse, even the clinical remission were observed. The prognosis of cervical lesions cannot be determined by pathological diagnosis alone, but also by integrating information on HPV abundance and integration profiling changes, especially those HPV integrated genes/networks related to carcinogenesis.
In this study, we developed a nanopore-sequencing based assay that provides information on HPV infection, genotype and integration all at once for patients with cervical lesions along the course of photodynamic therapy. The technique advantages of viral sequence enrichment combined with nanopore long-read sequencing contributed to more accurate prognosis determination as well as the dynamic monitoring of treatment accompaniment for a better evaluation of photodynamic therapy. That makes it a promising companion diagnostic tools and favorable supplement to traditional pathological examinations.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.