GADU - Genome Analysis and Database Update Pipeline
|Title||GADU - Genome Analysis and Database Update Pipeline|
|Year of Publication||2003|
|Authors||Rodriguez, A, Sulakhe, D, Marland, E, Nefedova, V, Yu, GX, Maltsev, N|
Realizing the enormous scientific potential of exponentially growing biological information requires the development of high-throughput automated computational environments that integrate large amounts of genomic and experimental data, and powerful tools for knowledge discovery and data mining. To assist high-throughput analysis of the genomes, we have developed the Genome Analysis and Databases Update system. GADU efficiently automates major steps of genome analysis: data acquisition and data analysis by a variety of tools and algorithms, as well as data storage and annotation. We are developing a TeraGrid technology-based backend for large-scale computations using GADU. GADU can function in either an automated or interactive mode via a Web-based user interface. Programs monitor every operation in GADU and report the status of the process. This architecture ensures GADU\'s robust performance and allows simultaneous processing of a large number of sequenced genomes regardless of their size.