Follow us: Twitter Linkedin

BG7 annotation of the isolate GOS1 of the E. coli O104:H4 German Outbreak sequenced with 454

We've annotated with BG7 the isolate GOS1 of the E. coli O104:H4 German Outbreak. The isolate was sequenced with 454 technology. The sequences and the assembly were kindly provided by the Gotingen Genomic Laboratory

 

We've used the following set of proteins as reference proteins:

 

- The representative Uniprot proteins corresponding to all Uniref90 clusters for all Escherichia coli proteins
- All Uniprot proteins from organisms that have in their name the terms “EHEC” or “EAEC”
- All Uniprot proteins from bacteria that have in any Uniprot field the term “toxin”
- All Uniprot proteins from bacteria that have in any Uniprot field “hemolysin”
- All the proteins from Salmonella typhi, Yersinia pestis and Shigella dysenteria

And these are the results we obtained:

 

The files with the annotation can be downloaded here:

http://s3-eu-west-1.amazonaws.com/era7bioinformatics-public-data/Era7-G2L-GOS1-Aannotation.zip

 

Annotation
G2L GOS1
Organism
E. coli
isolate
GOS1
Technology
454
Details
-
Date of the annotation
06/16/2011
BG7 Version
0.9
Number of contigs
229
Number of RNAs
126
Number of genes
5125
Number of 'perfect genes'
4551
Number of genes with frameshifts
220
Number of genes with 'putative' substitutions*
388