Analysing and storing genetic information
Analysing and storing genetic information involves several sophisticated techniques that enable scientists to study, manage, and leverage vast amounts of genetic data for various applications in research, medicine, and other fields.
Key Technologies for Analysing and Storing Genetic Information:
DNA Microarrays (Gene Chips) DNA microarrays, also known as gene or DNA chips, are powerful tools used to identify genes present in an organism's genome and to determine which genes are expressed within cells.
Structure and Principle:
They are small pieces of glass or plastic (often 2 cm²) printed with thousands of tiny spots, each containing a known DNA sequence (probe). These probes are short, single strands of DNA with specific base sequences.
Analysis of Genomic DNA:
DNA from different species or individuals is digested into fragments (using restriction enzymes), then denatured into single strands.
These single-stranded DNA fragments are labeled with fluorescent tags (e.g., green for one sample, red for another), mixed, and then hybridized (bound) to complementary probes on the microarray. Unbound DNA is washed away.
The microarray is inspected under UV light or scanned by a laser scanner to detect fluorescence. The colors and patterns indicate which genes are present in each sample.
Analysis of Gene Expression:
To identify active genes, mRNA is extracted from cells (e.g., cancerous versus healthy cells).
This mRNA is then converted into complementary DNA (cDNA) using reverse transcriptase. The cDNA is labeled with fluorescent tags.
The labeled cDNA is hybridized to the microarray probes. The intensity of the fluorescent light emitted by each spot indicates the level of activity or expression of that particular gene.
Applications of Microarrays:
Analyzing genomes and comparing genes between different species or individuals.
Detecting mRNA in gene expression studies to understand which genes are active at a specific time.
Investigating gene mutations (e.g., in breast cancer) and identifying active genes in cells during development.
Bioinformatics and Genetic Databases Bioinformatics is a new discipline that combines biological data with computer technology and statistics. Its role is to facilitate the storage, processing, analysis, and sharing of vast amounts of biological information.
Data Stored:
Gene sequences and complete genomes (e.g., from the Human Genome Project).
Amino acid sequences (primary structures) and three-dimensional structures of proteins.
The quantity of data held in these databases is immense and growing exponentially.
Examples of Databases and Tools:
Ensembl holds eukaryotic genomes, including human, zebrafish, and mouse.
UniProt provides information on primary protein sequences and functions.
Protein Data Bank details amino acid sequences and protein structures.
The BLAST algorithm is a search tool used by researchers to find similarities between biological sequences they are studying and those already saved in databases.
Applications of Bioinformatics and Databases:
Drug Development: Identifying genes and proteins involved in diseases, aiding in the creation of new drugs that target specific pathways.
Personalized Medicine: Using genetic information to predict how individuals will respond to different drugs, allowing for customized drug prescriptions.
Evolutionary Studies: Comparing genome or protein sequences to establish evolutionary relationships and clarify classification systems by identifying common ancestry.
Research: Understanding how gene changes affect animal health (e.g., mice, zebrafish) to predict effects on human health, and studying complex genetic systems and gene expression.
Disease Diagnosis and Genetic Screening: Identifying specific disease-causing mutations and health risks. Databases like CFTR2 provide health professionals with up-to-date information on genetic variants.
Vaccine Production: Information on parasite genomes (e.g., Plasmodium) provides valuable data for developing vaccines.
Industrial Protection: Bioinformatics can be used to provide bacteria in industrial settings with protection against viruses.
DNA Sequencing DNA sequencing is the process of determining the complete genome sequence of an organism. It is a foundational technology that generates the massive amounts of data stored in genetic databases. Sequencing methods are continually being improved and are now largely automated. DNA sequencing is also crucial for genetic fingerprinting (DNA profiling), where unique patterns of DNA fragments are analyzed.
These technologies work in conjunction; DNA sequencing generates the raw data, microarrays offer high-throughput analysis of gene presence and activity, and bioinformatics provides the computational framework for the storage, retrieval, and interpretation of this vast biological information.
Last updated