NGS read assembly

NGS data assembly presents a bioinformatics challenge, mainly because of the following three key characteristics:


short fragmentsread length is at most 70bp for Illumina and SoLiD platforms, and 350bp in the new 454 Titanium; far from traditional Sanger sequencing read length.


big dataa 454 GS-FLX microbial genome with 20x coverage will yield about 200Mbp.

new technologiesdata obtained needs to be worked with in a technology-specific manner, due to different error models, read lengths, data size, ...

