Introduction

Poultry meat production is increasing globally day by day as it is easily manageable animal protein source for human consumption compared to others. However, viral diseases incurs heavy economic losses to the poultry industry. Among the different viruses infecting birds, astroviruses are small round viruses, characterized based on its morphology (Caul and Appleton 1982). Astrovirus was first observed in human during 1975 in faeces of the infant suffering from gastroenteritis (Madeley and Cosgrove 1975). They are broadly categorized into two genera, Mammoastrovirus infecting mammals and Aviastrovirus infecting avian species. Both of these genera belongs to the family Astroviridae. Astroviruses can infect variety of host including human ((Finkbeiner et al. 2008; Madeley and Cosgrove 1975), cattle (Bouzalas et al. 2014; Li et al. 2013; Schlottau et al. 2016), sheep (Jonassen et al. 2003; Reuter et al. 2012), cat (Hoshino et al. 1981; Lau et al. 2013), dog (Martella et al. 2012; Takano et al. 2015) and deer (Smits et al. 2010).

Mammoastrovirus are mostly associated with gastroenteritis in the host. However, Aviastrovirus can cause different diseases in the different host. The members of genus Aviastrovirus mainly infects turkey, chicken and duck. Turkey Astrovirus (TAstV) causes Poultry Enteritis Mortality Syndrome (PEMS) or Poultry Enteritis Syndrome (PES) (McNulty et al. 1980; Mor et al. 2011) while in duck, astroviruses (DAstV) are associated with hepatitis (Asplin 1965; Gough et al. 1984, 1985). In chickens, two astrovirus species avian nephritis virus (ANV) (Imada et al. 2000; Shirai et al. 1991) and Chicken Astrovirus (CAstV) (Baxendale and Mebatsion 2004) have been reported. Initially CAstV was accepted as enterovirus causing growth retardation (Schultz-Cherry et al. 2001; Todd et al. 2009). Later on it was also found to be associated with gout (Bulbule et al. 2013) and hatchability problem (Smyth et al. 2013) in broiler chickens. Recently CAstV has been linked with ‘white chicks’, a disease characterized by weakness and white plumage of hatched chicks (Smyth et al. 2013) and leading to increased mortality in chicks and embryo has also been reported in Poland (Sajewicz-Krukowska et al. 2016) and Brazil (Nunez et al. 2016).

Chicken astroviruses are non-enveloped, 25–35 nm in diameter and contain non-segmented positive sense ssRNA genome comprised of 6.8 to 7.9 kb in length (Matsui and Greenberg 2001; Mendez et al. 2007). Whole genome sequencing and other studies have revealed that the basic structure and molecular mechanism is almost similar for all the astroviruses sequenced till date (Koci et al. 2000). Initially there was no authentic diagnostic method available for astrovirus which relied mostly on electron microscopy (Madeley and Cosgrove 1975; McNulty et al. 1980) or immunoassay (Baxendale and Mebatsion 2004). However, advances in the molecular biology have led to the development of some easy techniques for detection of viruses. Astroviruses can also be diagnosed using RT-PCR (Pantin-Jackwood et al. 2008; Smyth et al. 2009; Todd et al. 2009). Lee et al. (2013) suggested that a recombinant capsid can be used for diagnosis and vaccination. Till now the genome of only three chicken astroviruses CAstV/4175 (directly submitted to NCBI), CAstV/GA2011 (Kang et al. 2012) and CAstV/Poland/G059/2014 (Sajewicz-Krukowska and Domanska-Blicharz 2016) have been reported. In this study, we sequenced and characterized whole genome of chicken astrovirus isolated from infected broiler chicks from western part of India.

Materials and methods

History and sample collection

A broiler breeder farm at Anand, Gujarat, India was facing the problem of visceral gout in the Cobb-400 commercial chicks from last three batches of the parents. The outbreaks were noticed only in the initial hatches of the parents aging 28 to 32 weeks of age. These parents were vaccinated with infectious bronchitis nephropathic vaccine strains. The fertility and hatchabilities were normal but the chicks started showing lameness with spiking mortality from 4th day onward. Mortality continued for 5–7 days and ranged from 5 to 25%. Hatches falling during winter season i.e. November to February had high mortality. Dead chicks showed increased amount of abdominal fluid, pale and greyish kidneys with dilated tubules filled with urates. Chalky white deposits of urates were found on serosal surfaces of pericardium, liver capsule, air sacs, joint capsules and mucosal surfaces of proventriculus, trachea etc. Affected chick sample was submitted to Hester Biosciences limited for diagnosis of infection during the period of February 2016. The spleen, kidney and lungs tissue samples from the freshly dead birds were collected and triturated in PBS to make a 10% solution, then passed through the 0.22 μm syringe filter and inoculated in 10 day embryonated SPF eggs through allantoic cavity route. After three blind passages, the embryos started dying 5 to 6 days post inoculation and showed haemorrhagic lesion on the body surface. The allantoic fluid from the dead embryo was collected and used for total RNA isolation for subsequent analysis by whole genome sequencing.

Library preparation and sequencing

Total RNA was extracted using TRIzol reagent (Invitrogen Carlsbad, CA, USA) and treated with RNase free DNase I (Qiagen, Hilden, Germany) to remove any DNA contamination. Total RNA thus obtained was subjected to RNAseq library preparation as per the Ion total RNAseq kit v2 (Life technologies, Carlsbad, CA, USA). The sequencing was carried on Ion torrent PGM using 316 chip (Life technologies, Carlsbad, CA, USA).

De novo assembly, gene prediction and annotation

The sequencing reads generated from the sequencer were subjected to mapping with Gallus gallus genome (WGS: INSDC: AADN00000000.4) to remove the host sequences. Remaining sequences were subjected to quality filtering Q > 25 using PRINSEQ v0.20.4 and good quality sequences were de novo assembled using GS de novo assembler. The assembled genome sequence was searched for BLASTN similarity against nr/nt database. The genome sequence was further analysed for prediction of putative open reading frames (ORF) by ORF finder tool (Stothard 2000) and manual curation for the analysis of ribosomal frameshift signal (RFS) as reported for other astroviruses (Koci et al. 2000; Sajewicz-Krukowska and Domanska-Blicharz 2016). The prediction of stem loop structure of RFS was performed by RNAfold web server (http://rna.tbi.univie.ac.at/cgi-bin/RNAfold.cgi). Non-coding RNA sequences were inferred using similarity search against Rfam database (Nawrocki et al. 2015).

Sequence comparison and phylogenetic analysis

A nearly complete genome sequences of 11 astroviruses (Table 1) were downloaded from the NCBI and predicted for presence of ORFs using ORFfinder tool and manual curation of RFS start site. Multiple sequence alignment was performed using Clustal Omega webserver (http://www.ebi.ac.uk/Tools/msa/clustalo/) to analyse the percent identity with other genomes at nucleotide and amino acids level. Phylogenetic analysis of genomes and predicted proteins were performed in MEGA6 (Tamura et al. 2013) after building alignment by clustalw algorithm and subsequent tree generation using neighbour joining method (Saitou and Nei 1987) with 1000 bootstraps replicates. Nucleotide sequences of ORF2 encoding capsid protein of reported Indian isolates were retrieved from NCBI nr database and analysed for nucleotide and amino acids similarity and phylogeny as described earlier.

Table 1 Sequence comparison of chicken astrovirus isolate CAstV/India/Anand/2016 with other full length avian astrovirus genomes and their predicted protein sequences

Linear antigen epitope prediction and comparative analysis with other chicken astroviruses

We used SVMTriP (Yao et al. 2012) which predict linear antigen epitope based on Support Vector Machine to integrate Tri-Peptide Similarity and Propensity. Capsid protein sequence was used for epitope prediction. We compared epitopes of three CAstV isolates from India [CAstV/INDIA/ANAND/2016, VRDC|CAstV|NZ|VHINP-4 (Accession No. AGB58310.1) South region (Accession No. AIC32814.1)], two from USA (CAstV/GA2011 and CAstV/4175), one from UK (accession No.AFK92952.1) and one from Poland (CAstV/Poland/G059/2014).

Results

Identification of causative agent of visceral gout in broiler chickens

Allantoic fluid collected from SPF embryo inoculated for three blind passages of field sample was subjected to total RNA isolation and next generation sequencing using ion torrent platform resulted in a total of 108,109 reads with average read length of 180 bases. After removing the host specific and low quality reads, a total of 14,121 reads were used for assembly. The genome was assembled into a consensus length of 7513 bp which was identified as Chicken astrovirus upon nucleotide blast analysis. The full length genome sequence was deposited to the GenBank under the accession number KY038163. There was an untranslated region (UTR) of 13 bp at 5′ end and of 298 bp at 3′ end with a poly-A tail stretch of 19 nucleotides. The genome was composed of A (30%, 2307 nt), T (29% 2034 nt), G (23%, 1769 nt) and C (18% 1403 nt) with a GC content of 42.2%.

Genome sequence analysis of identified chicken astrovirus isolate for the presence of coding and non-coding features

The genome encoded three ORFs (ORF1a, ORF1b and ORF2) with each ORF encoding for single protein coding gene and a partial overlap of ORF1b with ORF1a (Fig. 1a). Manual curation of the ribosomal frameshift signal (position nt 3415) revealed presence of ATG start codon preceding the slippery heptameric sequence (AAAAAAC) (Fig. 1b) followed by sequences forming the stem loop structure as predicted by RNAfold analysis (Fig. 1c). Among the analysed genomes, all four chicken astroviruses and duck astrovirus SL5 isolate were found to possess their own ATG start codon for ORF1b whereas it was found absent in other astroviruses (Fig. 1b). Analysis of 3′-UTR by Rfam analysis revealed presence of two non-coding RNAs similar to “s2m” RNA family (Accession number: RF00164) at positions 7192–7234 (e value: 1.5E−14) and 7287–7329 (e value: 8.1E−14).

Fig. 1
figure 1

Genome organization of chicken astrovirus isolate CAstV/India/Anand/2016 (a) and its sequence alignment of ribosomal frameshift signal (RFS) of ORF1b encoding RdRp (b) and predicted stem loop structure of RFS (c)

Similarity of identified chicken astrovirus isolate with known avian astrovirus isolates

Analysis of nucleotide similarity with other full length astroviruses genomes (Table 1) revealed highest identity of 86.94% with CAstV/GA2011 followed by 84.76% with CAstV/4175 and 74.48% with CAstV/Poland/G059/2014 among the chicken astroviruses. The genome showed 54.47–57.55%, 53.94–54.20% and 46.34–47.57% similarity with duck astroviruses, turkey astroviruses and avian nephritis viruses, respectively. The ORF2 coding sequence showed 98.64–99.32% identity with the reported ORF2 sequence of the Indian isolates (Table 2).

Table 2 Sequence comparison of capsid protein of chicken astrovirus isolate CAstV/India/Anand/2016 with other reported Indian astrovirus isolates

Analysis of amino acids sequence similarity of CAstV/INDIA/ANAND/2016 with other avian astroviruses suggested 92.98–96.31% similarity with ORF1a encoded serine protease, 87.26–97.69% with ORF1b encoded RdRp and 41.18–84.67% with ORF2 encoded capsid protein of chicken astroviruses (Table 1). Among the three proteins, RdRp showed highest similarity of 53.53–71.95%, followed by serine protease (30.01–58.17%) and capsid protein (28.62–38.33%) with other avian astroviruses. Among the Indian chicken astrovirus isolates, highly variable capsid protein showed 98.64–99.32% sequence identity at nucleotide level and 97.74–98.69% sequence identity at amino acids level with CAstV/INDIA/ANAND/2016 isolate (Table 2).

Phylogenetic analysis of identified chicken astrovirus isolate

Phylogenetic analysis of the astrovirus genomes suggested formation of separate cluster of chicken astroviruses and placed CAstV/INDIA/ANAND/2016 nearest to the CAstV/4175 isolate (Fig. 2). Similarly, phylogeny based on amino acids sequence of serine protease (Fig. 3a), RdRp (Fig. 3b) and capsid protein (Fig. 3c) showed close clustering among the chicken astroviruses except capsid protein of CAstV/Poland/G059/2014 isolate which was clustered with the DAstV/SL5 isolate. Among the Indian isolates, ORF2 nucleotide sequence of the CAstV/INDIA/ANAND/2016 placed nearest to the VRDC/CAstV/NZ/VHINP-4 isolate (Fig. 4a) whereas based on the amino acid sequence, it was placed nearest to the VRDC/CAstV/NZ/VHINP-7 isolate (Fig. 4b).

Fig. 2
figure 2

Phylogenetic relatedness of chicken astrovirus isolate CAstV/India/Anand/2016 based on whole genome sequence using neighbour joining method with 1000 bootstraps

Fig. 3
figure 3

Phylogenetic relatedness of chicken astrovirus isolate CAstV/India/Anand/2016 using non-structural poly protein (a), RdRp (b) and capsid protein (c) based on neighbour-joining method with 1000 bootstraps replicates

Fig. 4
figure 4

Phylogenetic relatedness of chicken astrovirus isolate CAstV/India/Anand/2016 ORF2 coding sequences (a) and ORF2 encoded capsid protein (b) with reported Indian isolates based on neighbour-joining method with 1000 bootstraps replicates

B-cell epitope analysis of capsid structural protein of identified chicken astrovirus isolate

A total of 9–10 epitopes were predicted using SVMTriP using the capsid protein sequence of the astroviruses. Epitope analysis revealed two unique epitopes in case of the CAstV/INDIA/ANAND/2016. Epitopes present at the positions 151–170 and 176–195 were common to all of the viruses analysed. Based on the epitopes predicted, CAstV/Poland/G059/2014 was found to be unique as it has not shared any epitopes with other viruses. Comparison of the three Indian CAstVs revealed that seven epitopes were common to all the three and a epitope 649–668 was differing by one amino acid substitution in the CAstV/INDIA/ANAND/2016 (Supplementary table 1).

Discussion

The poultry viral diseases have great economic impact on poultry industries worldwide as it leads to high mortality. Chicken astrovirus causes severe disease especially in young chickens (Bulbule et al. 2013; Li et al. 2013; Nunez et al. 2016; Schultz-Cherry et al. 2001; Smyth et al. 2013; Todd et al. 2009). Though culturing methods has been described for astroviruses (Baxendale and Mebatsion 2004; Nunez et al. 2015), the isolation of virus is somewhat difficult due to its poor growth in the culture (Smyth et al. 2010). Next generation sequencing technology is sophisticated as there is no need to isolate or culture the organism. Hence in the present study we directly isolated RNA from allantoic fluid of chicken embryo inoculated with clinical sample and sequenced on Ion torrent PGM platform.

The viral genome of CAstV/INDIA/ANAND/2016 was assembled into 7513 bp which is comparable with the size of other published astrovirus genomes (Chen et al. 2012; Finkbeiner et al. 2008; Sajewicz-Krukowska and Domanska-Blicharz 2016; Strain et al. 2008). The genome showed presence of three ORFs encoding for serine protease, RdRp and capsid protein as reported for other astroviruses. The RdRp of most astroviruses do not have its own start codon. Kang et al. (2012) reported that RdRp of chicken astrovirus GA2011 has its own start codon. We observed presence of ATG start codon for RdRp among all the reported chicken astroviruses and DAstV/SL5 isolate whereas other duck astrovirus and avian nephritis viruses were found to lack ATG start codon. The RNA family analysis by Rfam suggested presence of two motifs matching to corona virus stem loop II (s2m) motif in the 3′-UTR as reported for other astroviruses (Jonassen et al. 1998; Sajewicz-Krukowska and Domanska-Blicharz 2016). Although, exact function is still not uncovered, the presence of these “s2m” motifs is believed to influence gene expression through RNA-interference mechanism (Tengs et al. 2013)

Based on the nucleotide similarity, the virus was found to be closest to the chicken astrovirus isolate GA2011 (Kang et al. 2012) followed by CAstV/4175 (direct submission) and CAstV/Poland/G059/2014 (Sajewicz-Krukowska and Domanska-Blicharz 2016) among the chicken astroviruses. At amino acids level, serine protease and capsid protein of the CAstV/INDIA/ANAND/2016 virus showed highest identity with chicken astrovirus isolate GA2011 followed by CAstV/4175 and CAstV/Poland/G059/2014, whereas RdRp showed maximum identity with the chicken astrovirus isolate GA2011 followed by CAstV/Poland/G059/2014 and CAstV/4175 isolates. Capsid protein of the CAstV/INDIA/ANAND/2016 isolate showed only 41.18% identity with the CAstV/Poland/G059/2014 isolate. Sajewicz-Krukowska and Domanska-Blicharz (2016) suggested that this limited identity of capsid protein of the CAstV/Poland/G059/2014 isolate with other chicken astrovirus capsid protein but higher identity with the duck astroviruses may be due a recombination event and shared ancestors with the duck astroviruses. Among the avian astroviruses, chicken astroviruses were found to share higher identity with the duck astroviruses (Chen et al. 2012; Fu et al. 2009; Liu et al. 2014) as compared to the turkey astroviruses (Koci et al. 2000; Strain et al. 2008) and the avian nephritis virus isolates (Imada et al. 2000; Zhao et al. 2011a, b). We next analysed the sequence similarity of capsid structural protein of the CAstV/INDIA/ANAND/2016 with the capsid protein coding sequence and amino acids sequence of reported sequence of the Indian astrovirus isolates which revealed about 98–99% sequence identity at nucleotide level and about 97–98% sequence identity at amino acids level suggesting limited structural divergence and their common origin in the Indian isolates reported till date.

Phylogenetic analysis of the genome sequences as well as the protein sequences showed clustering of the CAstV/INDIA/ANAND/2016 nearest to that of CastV/4175 and CAstV/GA2011 and all four chicken astrovirus formed separate cluster except capsid protein of the CAstV/Poland/G059/2014 isolate which was clustered along with the duck astroviruses. The clustering of CAstV/Poland/G059/2014 with the duck astrovirus isolate SL5 suggest possible recombination between these isolates (Sajewicz-Krukowska and Domanska-Blicharz 2016). Based on nucleotide sequence of genomes and amino acids sequence of serine protease and RdRp, Chicken astroviruses were placed closed to the duck astroviruses compared to turkey astroviruses or avian nephritis viruses. However, based on capsid protein the turkey astroviruses were phylogenetically placed between different isolates of the duck astroviruses as well as near to the CAstV/Poland/G059/2014 isolate. These observations suggest that the capsid protein of turkey, duck and chicken astroviruses evolved through possible recombination between the astroviruses of different avian species and suggests that the turkey and duck may play an important role in epidemiology of avian astroviruses similar to that of influenza viruses (Alexander 2000). Among the Indian isolates, phylogenetic analysis of capsid protein showed placement of the CAstV/INDIA/ANAND/2016 between two north zone isolates, however very limited sequence divergence was seen among the reported Indian isolates suggesting their recent emergence and common origin.

Epitope analysis of the capsid protein sequence revealed two unique epitopes in our isolate whereas 7 epitopes were found to be shared among the Indian astrovirus isolates. Except CAstV/Poland/G059/2014 isolate which contained all unique epitopes, other isolates shared two common epitopes. Our analysis suggests that the vaccine design using the Indian astrovirus isolate may provide cross protection against prevailing isolates in India. Further, epitope mapping would be useful to design safe and effective vaccine against divergent astroviruses (Ahmad et al. 2016; Soria-Guerra et al. 2015).

In summary, whole genome analysis of Indian astrovirus isolate by next generation sequencing technology determined full length genome of the chicken astrovirus isolate for the first time in India. The present study provided genetic relatedness of the circulating Indian isolate with the other reported nearly complete genome sequences of avian astroviruses. The analysis of capsid protein sequence of reported chicken astroviruses from India revealed limited structural divergence suggesting their common ancestral origin and recent emergence. Considering high sequence identity of capsid structural protein among prevailing strains in the India, the CAstV/INDIA/ANAND/2016 isolate could serve as the potential source for its further development as a vaccine candidate. The identification of unique and shared epitopes among different astroviruses will be helpful in designing effective epitope based vaccine formulation.