KE Simmon, VP Hodson, A. Croft, MA Fisher. Poster presented during the 110th General Meeting of the American Society for Microbiology, San Diego, CA, May 2010.
Citation: "Uncurated databases often contain redundant sequences for a species, erroneous entries, and entries lacking species designation. These types of entries can hinder bacterial identification with 16S rRNA gene sequencing (16S) by increasing analysis time. To minimize the impact of such entries, a novel commercial database was developed that contains one sequence per validated species (SmartGene, Raleigh, NC). Each entry was identified by a statistical approach that chose the "Centroid" sequence that best represented the species. In this study we utilized this Centroid database (CEN) to determine the accuracy and analysis time on a set of clinical isolates previously identified using the larger SmartGene Eubacteria database (EUB)... Overall, the CEN is a useful tool that can significantly reduce the interpretation time for bacterial identification in clinical laboratories."