Additional gene prediction analysis and functional annotation was performed within Imatinib Mesylate clinical trial the Integrated Microbial Genomes – Expert Review (IMG-ER) platform [35]. Genome properties The genome consists of a 4,511,574 bp long chromosome with a 35.5% G+C content and a 4,916 bp plasmid with 40% G+C content (Table 3 and Figure 3). Of the 3,857 genes predicted, 3,808 were protein-coding genes, and 49 RNAs; Fifty-one pseudogenes were identified. The majority of the protein-coding genes (62.2%) were assigned with a putative function while the remaining ones were annotated as hypothetical proteins. The distribution of genes into COGs functional categories is presented in Table 4. Table 3 Genome Statistics Figure 3 Graphical circular map of the chromosome (plasmid map not shown).
From outside to the center: Genes on forward strand (color by COG categories), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC … Table 4 Number of genes associated with the general COG functional categories Acknowledgements We would like to gratefully acknowledge the help of Maren Schr?der for growing M. tractuosa cultures and Susanne Schneider for DNA extraction and quality analysis (both at DSMZ). This work was performed under the auspices of the US Department of Energy Office of Science, Biological and Environmental Research Program, and by the University of California, Lawrence Berkeley National Laboratory under contract No. DE-AC02-05CH11231, Lawrence Livermore National Laboratory under Contract No. DE-AC52-07NA27344, and Los Alamos National Laboratory under contract No.
DE-AC02-06NA25396, UT-Battelle and Oak Ridge National Laboratory under contract DE-AC05-00OR22725, as well as German Research Foundation (DFG) INST 599/1-2.
The single genomic 16S rRNA sequence of strain O7/1T was compared using NCBI BLAST under default settings (e.g., considering only the high-scoring segment Brefeldin_A pairs (HSPs) from the best 250 hits) with the most recent release of the Greengenes database [10] and the relative frequencies, weighted by BLAST scores, of taxa and keywords (reduced to their stem [11]) were determined. The five most frequent genera were Sulfolobus (27.8%), Aeropyrum (11.3%), Desulfurococcus (11.3%), Ignicoccus (6.5%) and Vulcanisaeta (6.2%) (100 hits in total). Regarding the five hits to sequences from other members of the genus, the average identity within HSPs was 96.7%, whereas the average coverage by HSPs was 97.4%. Among all other species, the one yielding the highest score was Desulfurococcus mobilis, which corresponded to an identity of 100.0% and an HSP coverage of 100.0%.