The results of COG categorization of the predicted open reading frames (ORFs) are shown in Additional file
1: Figure S1. The ORFs could be categorized into 22 classes, which include S (424 ORFs, function unknown), E (339, amino acid transport and metabolism), K (293 ORFs, transcription), R (273 ORFs, general function prediction only), T (262 ORFs, signal transduction mechanisms), G (256 ORFs, carbohydrate transport and metabolism), P (253 ORFs, inorganic ion transport and metabolism), C (236 ORFs, energy production and conversion), M (229 ORFs, cell wall/membrane/envelope biogenesis), J (190 ORFs, translation, ribosomal structure and biogenesis), O (171 ORFs, post-translational modification, protein turnover, chaperones), L (163 ORFs, replication, recombination and repair), H (131 ORFs, coenzyme transport and metabolism), U (128 ORFs, intracellular trafficking, secretion, and vesicular transport), N (122 ORFs, cell motility), I (101 ORFs, lipid transport and metabolism), F (90 ORFs, nucleotide transport and metabolism), Q (73 ORFs, secondary metabolites biosynthesis, transport and catabolism), V (68 ORFs, defence mechanisms), D (37 ORFs, cell cycle control, cell division, chromosome partitioning), A (1 ORF, RNA processing and modification), and B (1 ORF, chromatin structure and dynamics).