Follow us: Twitter Linkedin

BG7 annotation of the isolate GOS2 of the E. coli O104:H4 German Outbreak sequenced with 454

We've annotated with BG7 the isolate GOS2 of the E. coli O104:H4 German Outbreak. The isolate was sequenced with 454

 

We've used the following set of proteins as reference proteins:

 

- The representative Uniprot proteins corresponding to all Uniref90 clusters for all Escherichia coli proteins
- All Uniprot proteins from organisms that have in their name the terms “EHEC” or “EAEC”
- All Uniprot proteins from bacteria that have in any Uniprot field the term “toxin”
- All Uniprot proteins from bacteria that have in any Uniprot field “hemolysin”
- All the proteins from Salmonella typhi, Yersinia pestis and Shigella dysenteria

And these are the results we obtained:

 

The files with the annotation can be downloaded here:

http://s3-eu-west-1.amazonaws.com/era7bioinformatics-public-data/Era7-G2L-GOS2-Annnotation.zip

 

Annotation
G2L GOS2
Organism
E. coli
isolate
GOS2
Technology
454
Details
-
Date of the annotation
06/02/2011
BG7 Version
0.9
Number of contigs
263
Number of RNAs
121
Number of genes
5138
Number of 'perfect genes'
4535
Number of genes with frameshifts
216
Number of genes with 'putative' substitutions*
429