Sequence database

From WikiMD's medical encyclopedia

Sequence database refers to a specialized type of database designed to store biological sequence information, such as DNA, RNA, and protein sequences. These databases are essential tools in the field of bioinformatics, facilitating a wide range of activities including gene identification, phylogenetics, and the study of evolutionary biology. Sequence databases can be broadly categorized into primary, secondary, and composite databases, each serving different purposes and containing various types of sequence data and annotations.

Primary Sequence Databases

Primary sequence databases are repositories that store raw sequence data generated from sequencing projects. They provide minimal annotation, such as the source of the sequence (the organism), the name of the submitter, and the date of submission. The most prominent examples include:

  • GenBank: A comprehensive public database of nucleotide sequences and supporting bibliographic and biological annotation. It is maintained by the National Center for Biotechnology Information (NCBI).
  • EMBL Bank: Managed by the European Molecular Biology Laboratory, this database collects nucleotide sequences from all over the world and offers free access to the data.
  • DDBJ: The DNA Data Bank of Japan, which collects DNA sequences from researchers and provides free access.

Secondary Sequence Databases

Secondary sequence databases contain curated sequences that have been processed and annotated with additional information, such as the function of the protein, domain structure, and evolutionary relationships. These databases often derive their content from primary databases but add value through curation and annotation. Examples include:

  • UniProt: A comprehensive resource for protein sequence and annotation data, providing detailed information about the biological function of proteins.
  • Pfam: A database of protein families, each represented by multiple sequence alignments and hidden Markov models.

Composite Sequence Databases

Composite sequence databases integrate data from both primary and secondary databases to provide a more comprehensive view of sequence information. They may also include specialized tools for data analysis. An example is the NCBI Entrez system, which allows users to search and retrieve information from multiple databases simultaneously.

Applications and Importance

Sequence databases are indispensable in the field of bioinformatics and molecular biology. They support a wide range of scientific research, including:

  • Identifying new genes and determining their function.
  • Studying genetic variation and its implications for health and disease.
  • Tracing the evolutionary history of organisms and genes.
  • Developing new drugs and vaccines.

Challenges and Future Directions

As sequencing technologies continue to evolve, sequence databases face challenges in data volume, quality control, and annotation. Efforts are ongoing to improve the accuracy and completeness of sequence annotations, integrate data from diverse sources, and develop more powerful tools for data analysis and visualization.

See Also

Stub icon
   This article is a biographical stub. You can help WikiMD by expanding it!



Sequence database gallery

Navigation: Wellness - Encyclopedia - Health topics - Disease Index‏‎ - Drugs - World Directory - Gray's Anatomy - Keto diet - Recipes

Transform your life with W8MD's budget GLP-1 injections from $125.

W8mdlogo.png
W8MD weight loss doctors team

W8MD offers a medical weight loss program to lose weight in Philadelphia. Our physician-supervised medical weight loss provides:

NYC weight loss doctor appointments

Start your NYC weight loss journey today at our NYC medical weight loss and Philadelphia medical weight loss clinics.

Linkedin_Shiny_Icon Facebook_Shiny_Icon YouTube_icon_(2011-2013) Google plus


Advertise on WikiMD

WikiMD's Wellness Encyclopedia

Let Food Be Thy Medicine
Medicine Thy Food - Hippocrates

Medical Disclaimer: WikiMD is not a substitute for professional medical advice. The information on WikiMD is provided as an information resource only, may be incorrect, outdated or misleading, and is not to be used or relied on for any diagnostic or treatment purposes. Please consult your health care provider before making any healthcare decisions or for guidance about a specific medical condition. WikiMD expressly disclaims responsibility, and shall have no liability, for any damages, loss, injury, or liability whatsoever suffered as a result of your reliance on the information contained in this site. By visiting this site you agree to the foregoing terms and conditions, which may from time to time be changed or supplemented by WikiMD. If you do not agree to the foregoing terms and conditions, you should not enter or use this site. See full disclaimer.
Credits:Most images are courtesy of Wikimedia commons, and templates, categories Wikipedia, licensed under CC BY SA or similar.

Contributors: Prab R. Tumpati, MD