Enter your search keyword(s):

Click to search our directories-AllWebHunt, Encyclopedic, TopChoice, Or Google, Alexa, About & Yahoo:

 


Bioinformatics
Home / Top / Science / Biology / Bioinformatics See also:
Related articles

Edit | Discuss Article

Bioinformatics

Bioinformatics or computational biology is the use of mathematical and informational techniques, including statistics, to solve biological problems, usually by creating or using computer programs, mathematical models or both. One of the main areas of bioinformatics is the data mining and analysis of the data gathered by the various genome projects. Other areas are sequence alignment, protein structure prediction, systems biology, protein-protein interactions and virtual evolution. As a summary, the various genome projects produce many long lists of letters and one of the roles of bioinformatics is to attempt to determine the words, grammar, sentences and ultimately, meaning (functional significance) of those letters.

Table of contents
1 Sequence analysis
2 Bioinformatics tools
3 Bioinformatics and structural biology
4 Modeling biological systems
5 Other applications
6 See also
7 Bibliography
8 External links

Sequence analysis

Main articles: Sequence alignment, Sequence database

Since the Epstein-Barr virus was sequenced in 1984, the DNA sequence of more and more organisms is stored in electronic databases. These data are analyzed to determine genes that code for proteins, as well as regulatory sequences. A comparison of genes within a species or between different species can show similarities between protein functions, or relations between species (the use of molecular systematics to construct phylogenetic trees). With the growing amount of data, it becomes impossible to analyze DNA sequences manually. Today, computer programs are used to find similar sequences in the genome of dozens of organisms, within billions of nucleotides. These programs can compensate for mutations (exchanged, deleted or inserted bases) in the DNA sequence, in order to identify sequences that are related, but not identical. A variant of this sequence alignment is used in the sequencing process itself. The so-called shotgun sequencing (that was used, for example, by Celera Genomics to sequence the human genome) does not give a sequential list of nucleotides, but instead the sequences of thousands of small DNA fragments (each about 600 nucleotides long). The ends of these fragments overlap and, aligned in the right way, make up the complete genome. Shotgun sequencing yields sequence data quickly, but the task to re-align the fragments can be quite complicated for larger genomes. In the case of the Human Genome Project, it took several months on a supercomputer array to align them correctly. Shotgun sequencing is generally preferred for smaller genomes, such as bacteria, and often used at least partially on organisms with much larger genomes.

Another aspect of bioinformatics in sequence analysis is the automatic search for genes and regulatory sequences within a genome. Not all of the nucleotides within a genome are genes. Within the genome of higher organisms, large parts of the DNA do not serve any obvious purpose. This so-called junk DNA may, however, contain unrecognized functional elements. Bioinformatics helps to bridge the gap between genome and proteome projects, for example in the use of DNA sequence for protein identification.

Bioinformatics tools

Computer scripting languages such as Perl and Python are often used to interface with biological databases and parse output from bioinformatics programs. Communities of bioinformatics programmers have set up free/open source projects such as EMBOSS, BioPerl, BioPython, BioRuby, and BioJava which develop and distribute shared programming tools and objects (as program modules) that make bioinformatics easier.

Bioinformatics and structural biology

Main article: Protein structure prediction

Protein structure prediction is another important application of bioinformatics. The amino acid sequence of a protein, the so-called primary structure, can be easily determined from the sequence on the gene that codes for it. But, the protein can only function correctly if it is folded in a very special and individual way (if it has the correct secondary, tertiary and quaternary structure). The prediction of this folding just by looking at the amino acid sequence is quite difficult. Several methods for computer predictions of protein folding are currently (as of 2004) under development.

One of the key principles in bioinformatics is homology. In the genomic branch of bioinformatics, homology is used to predict the function of a gene. If gene A is homologous to gene B of which the function is known, it is likely to have a similar function. In the structural branch of bioinformatics homology is used to determine which parts of the protein are important in structure formation and interaction with other proteins. In a technique called homology modelling, this information is used to predict the structure of a protein once the structure of a homologous protein is known. This currently remains the only way to predict protein structures reliably.

One case example of this is the similar protein homology between hemoglobin in humans and the hemoglobin in legumes (leghemoglobin). Both serve the same purpose of transporting oxygen in both organisms. Though both of these proteins have completely different amino acid sequences, their protein structures are virtually identical, which reflects their near identical purposes.

Modeling biological systems

Systems biology involves the use of computer simulations of cellular subsystems (such as the networks of metabolites and enzymes which comprise metabolism, signal transduction pathways and gene regulatory networks) to both analyze and visualize the complex connections of these cellular processes. Artificial life or virtual evolution attempts to understand evolutionary processes via the computer simulation of simple (artificial) life forms.

Other applications

Morphometrics is used to analyze pictures of embryos to track and to predict the fate of cell clusters during morphogenesis.

See also

Related fields

Bibliography

  • R. Durbin, S. Eddy, A. Krogh and G. Mitchison, Biological sequence analysis. Cambridge University Press, 1998.

External links

Topics within genomics
Genome project | Glycomics | Human Genome Project | Proteomics | Structural genomics
Bioinformatics | Systems biology
  1. redirect

 


Source | Copyright
Webmasters: Add your website here:

Readers: Edit | Discuss Listings

The Ensembl Project
Ensembl is a joint project between EMBL-EBI and the Sanger Centre to develop a software system which produces and maintains automatic annotation on eukaryotic genomes.
http://www.ensembl.org/

The Open Lab
A community focused on the freedom of information as it pertains to the biosciences.
http://bioinformatics.org/

The International Society for Computational Biology
The International Society for Computational Biology is dedicated to advancing the scientific understanding of living systems through computation; the emphasis is on the role of computing and informatics in advancing molecular biology.
http://www.iscb.org

The Bioinformatics Resource
The site of CCP11 (Collaborative Computational Project 11) the goal of which is to "to foster the broad bioinformatics community and the UK research community in particular". Comprehensive list of links, including information on courses, conferences and workshops that they run.
http://www.hgmp.mrc.ac.uk/CCP11/

European Molecular Biology Network
EMBnet is the only organisation world-wide bringing bioinformatics professionals to work together to serve the expanding fields of genetics and molecular biology.
http://www.embnet.org/

Bioinformatics and Biological Computing
Comprehensive bioinformatics site, with access to multiple database searching and sequence analysis tools - from the Weizmann Institute of Science.
http://bioinfo.weizmann.ac.il/

Biodatabase Mining
Whitepaper on database mining in the Human Genome Initiative.
http://biodatabases.com/whitepaper.html

Society for Bioinformatics in the Nordic countries
SocBiN is a non-profit organisation for people working with and interested in bioinformatics. One task of the society is to arrange annual conferences on Bioinformatics, of which the first took place April 1999 in Lund.
http://www.socbin.org/

DNA Structural Atlas
Easy-to-use summary of genomic information currently available for all organisms-from the Technical Univ. of Denmark.
http://www.cbs.dtu.dk/services/GenomeAtlas/

USGS Center for Biological Informatics
Facilitates access to and application of biological information.
http://biology.usgs.gov/cbi/

Iranian Bioinformatics Research Center
Research on Bioinformatics topics include: Genomics, Drug discovery, intelligent agent applications, sequencing, pattern matching, complex algorithms, distributed Databases, pharmaceogenomics, ...
http://www.Bio-IT.org

Bioweircom.org -- Site for Bioinformatics Developers
tools and info for bioinformatics newbies and gurus alike!
http://bioweircom.org

Visualisation for Bioinformatics
DNA microarray Visualisation Resources, Papers, Articles, Posters and Talks.
http://industry.ebi.ac.uk/~alan/VisSupp/

The Swiss Institute of Bioinformatics Homepage (SIB)
SIB operates the ExPASy proteomics server and the Swiss node of EMBnet. Teaching activities include a series of post-graduate courses given at the Universities of Geneva and Lausanne, as well as at the EPFL, and a Masters Degree in bioinformatics. Major research areas include the development of integrated databases and software resources in the field of proteomics.
http://www.isb-sib.ch/



Help build the largest human-edited directory on the web.
 Submit a Site - Open Directory Project (modified) - Become an Editor

Modified contents copyright 2008. All rights reserved.