SNP markers retrieval for a non-model species: A practical approach

A. Shahin, T. Van Gurp, S. A. Peters, R. G. F. Visser, J. M. Van Tuyl, P. Arens

Onderzoeksoutput: Bijdrage aan wetenschappelijk tijdschrift/periodieke uitgaveArtikelWetenschappelijkpeer review

164 Downloads (Pure)

Samenvatting


Background

SNP (Single Nucleotide Polymorphism) markers are rapidly becoming the markers of choice for applications in breeding because of next generation sequencing technology developments. For SNP development by NGS technologies, correct assembly of the huge amounts of sequence data generated is essential. Little is known about assembler's performance, especially when dealing with highly heterogeneous species that show a high genome complexity and what the possible consequences are of differences in assemblies on SNP retrieval. This study tested two assemblers (CAP3 and CLC) on 454 data from four lily genotypes and compared results with respect to SNP retrieval.


Results

CAP3 assembly resulted in higher numbers of contigs, lower numbers of reads per contig, and shorter average read lengths compared to CLC. Blast comparisons showed that CAP3 contigs were highly redundant. Contrastingly, CLC in rare cases combined paralogs in one contig. Redundant and chimeric contigs may lead to erroneous SNPs. Filtering for redundancy can be done by blasting selected SNP markers to the contigs and discarding all the SNP markers that show more than one blast hit. Results on chimeric contigs showed that only four out of 2,421 SNP markers were selected from chimeric contigs.


Conclusion

In practice, CLC performs better in assembling highly heterogeneous genome sequences compared to CAP3, and consequently SNP retrieval is more efficient. Additionally a simple flow scheme is suggested for SNP marker retrieval that can be valid for all non-model species.
Originele taal-2Engels
Artikelnummer79
TijdschriftBMC Research Notes
Volume5
DOI's
StatusGepubliceerd - 2012

Vingerafdruk

Duik in de onderzoeksthema's van 'SNP markers retrieval for a non-model species: A practical approach'. Samen vormen ze een unieke vingerafdruk.

Citeer dit