|
Projects related to the Priority Program Final Report of the Priority Program
Home
|
SHORT DESCRIPTIONINFORMATICS METHODS FOR THE ANALYSIS AND INTERPRETATION OF LARGE GENOMIC DATASETS
The extensive
activities at sequencing the genomes of
whole organisms revolutionize molecular
biology and biotechnology. Already more
than 10 microorganisms have completely
been sequenced (Status of March 1997).
Seven genomes are openly accessible
including the eucaryote yeast with about
12 million base pairs. The sequencing of
the human genome is planned to be
completed at the latest until the year
2005. The result of the sequencings is a
wealth of data, which cannot be dealt
with any more by conventional methods of
data analysis and modelling. Yeast,
whose sequencing has just been
completed, has about 6000 genes. Already
the task of getting an overview over
this amount of data requires new methods
of data analysis. It is not sufficient
any more to concentrate on the
examination of sequence patterns,
structures and functions of single
genes, RNA-molecules or proteins.
Rather, new procedures are needed in
order to search through and assess
selectively large genomic data records.
We call such procedures "screening
methods". Here, the analysis of
evolutionary, structural and functional
similarities play an important role. The
methods for this analysis originate from
computer science and mathematics. For
this reason, in its recommendations on
progress in biotechnology, the
Technology Council of the Chancellor of
the Federal Republic of Germany
attributes a high significance to
Computational Biology and recommends, to
push it forward with much emphasis. We
react to this recommendation with the
present suggestion for a priority
program of the DFG. The following problem areas pertain to this overall goal:
In all these fields the focus of the priority program is on methods that afford a sufficiently high data throughput for subjecting large genomic datasets to complex analysis, as well. Projects to be funded in the Priority Program have to have both significant Computer Science and/or mathematics content and direct relevance to important molecular biological questions. To this end, it is important to secure the methodical side through qualified computer science and/or mathematics and simultaneously secure that the goals of the project are relevant for genome research. Scientists from the fields of computational biology, Computer science and mathematics can apply. Interdisciplinary cooperation is especially desirable. The evaluation should be carried out by specialists that have interdisciplinary competence in both biology and computer science/mathematics. The results of the priority program should be made accessible directly to the biological research community. Therefore, the validation of tools on biological data is mandatory and the software should be usable and accessible. Routine colloquiums shall present the work of the priority program and foster the direct exchange with the bio-sciences. An internet website of the priority program is an integral part of the program.
|