COMMUNITY
BG7 annotation of the isolate GOS1 of the E. coli O104:H4 German Outbreak sequenced with 454
We've annotated with BG7 the isolate GOS1 of the E. coli O104:H4 German Outbreak. The isolate was sequenced with 454 technology. The sequences and the assembly were kindly provided by the Gotingen Genomic Laboratory
We've used the following set of proteins as reference proteins:
- The representative Uniprot proteins corresponding to all Uniref90 clusters for all Escherichia coli proteins
- All Uniprot proteins from organisms that have in their name the terms “EHEC” or “EAEC”
- All Uniprot proteins from bacteria that have in any Uniprot field the term “toxin”
- All Uniprot proteins from bacteria that have in any Uniprot field “hemolysin”
- All the proteins from Salmonella typhi, Yersinia pestis and Shigella dysenteria
And these are the results we obtained:
The files with the annotation can be downloaded here:
http://s3-eu-west-1.amazonaws.com/era7bioinformatics-public-data/Era7-G2L-GOS1-Aannotation.zip
Annotation | G2L GOS1 |
|||
Organism | E. coli |
|||
isolate | GOS1 |
|||
Technology | 454 |
|||
Details | - |
|||
Date of the annotation | 06/16/2011 |
|||
BG7 Version | 0.9 |
|||
Number of contigs | 229 |
|||
Number of RNAs | 126 |
|||
Number of genes | 5125 |
|||
Number of 'perfect genes' | 4551 |
|||
Number of genes with frameshifts | 220 |
|||
Number of genes with 'putative' substitutions* | 388 |