Argonne National Laboratory

GADU - Genome Analysis and Database Update Pipeline

TitleGADU - Genome Analysis and Database Update Pipeline
Publication TypeReport
Year of Publication2003
AuthorsRodriguez, A, Sulakhe, D, Marland, E, Nefedova, V, Yu, GX, Maltsev, N
Date Published02/2003
Other NumbersANL/MCS-P1029-0203

Realizing the enormous scientific potential of exponentially growing biological information requires the development of high-throughput automated computational environments that integrate large amounts of genomic and experimental data, and powerful tools for knowledge discovery and data mining. To assist high-throughput analysis of the genomes, we have developed the Genome Analysis and Databases Update system. GADU efficiently automates major steps of genome analysis: data acquisition and data analysis by a variety of tools and algorithms, as well as data storage and annotation. We are developing a TeraGrid technology-based backend for large-scale computations using GADU. GADU can function in either an automated or interactive mode via a Web-based user interface. Programs monitor every operation in GADU and report the status of the process. This architecture ensures GADU\'s robust performance and allows simultaneous processing of a large number of sequenced genomes regardless of their size.