UniGene: Difference between revisions

From WikiMD's Wellness Encyclopedia

CSV import
CSV import
 
Line 1: Line 1:
== UniGene ==
{{DISPLAYTITLE:UniGene}}


[[File:Unigene_banner.jpg|thumb|right|UniGene logo]]
[[File:Unigene_banner.jpg|thumb|right|UniGene logo]]


'''UniGene''' is a system for automatically partitioning [[GenBank]] sequences into a non-redundant set of gene-oriented clusters. Each UniGene cluster contains sequences that represent a unique gene, as well as related information such as the tissue types in which the gene has been expressed and map location.
'''UniGene''' is a database of the [[National Center for Biotechnology Information|NCBI]] that provides a non-redundant set of [[gene]]-oriented clusters of [[expressed sequence tag|EST]] sequences. It is a valuable resource for researchers in the field of [[genomics]] and [[bioinformatics]].


== History ==
==Overview==
UniGene was developed to organize [[expressed sequence tag|EST]] sequences into clusters that represent a unique [[gene]] or a set of closely related genes. Each UniGene cluster contains sequences that are believed to come from the same transcription locus, along with related information such as tissue types in which the gene is expressed and [[chromosome|chromosomal]] location.


UniGene was developed by the [[National Center for Biotechnology Information]] (NCBI) as a tool to organize and simplify the study of gene expression. It was first introduced in the late 1990s and has since become a valuable resource for researchers in the field of [[genomics]].
==Functionality==
UniGene serves several important functions in the field of [[genomics]]:


== Functionality ==
* '''Gene Discovery''': By clustering ESTs, UniGene helps in identifying new genes and understanding gene expression patterns across different tissues.
* '''Expression Analysis''': Researchers can use UniGene to study the expression of genes in various tissues and developmental stages.
* '''Sequence Alignment''': UniGene provides alignments of ESTs to known [[genome|genomic]] sequences, aiding in the annotation of [[genome|genomes]].


UniGene provides a comprehensive view of the transcriptome by clustering [[mRNA]] sequences from [[GenBank]], [[dbEST]], and [[RefSeq]] into unique gene clusters. Each cluster is assigned a unique identifier and includes sequences from different sources that are believed to represent the same gene. This allows researchers to study gene expression patterns across different tissues and developmental stages.
==Data Organization==
UniGene organizes data into clusters based on sequence similarity. Each cluster is assigned a unique identifier and includes:


== Applications ==
* A list of [[expressed sequence tag|EST]] sequences
* Information about the tissues from which the sequences were derived
* Links to related [[gene]] information in other databases


UniGene is widely used in [[bioinformatics]] and [[genomics]] research. It helps in identifying novel genes, studying gene expression, and understanding the genetic basis of diseases. Researchers use UniGene to find [[expressed sequence tags]] (ESTs) that are associated with specific genes, which can be useful in [[gene discovery]] and [[functional genomics]].
==Applications==
UniGene is widely used in [[bioinformatics]] and [[genomics]] research for:


== Related pages ==
* '''Gene Annotation''': Assisting in the annotation of [[genome|genomic]] sequences by providing evidence of transcription.
* '''Comparative Genomics''': Facilitating the comparison of gene expression patterns across different species.
* '''Functional Genomics''': Supporting studies on gene function and regulation by providing expression data.


* [[GenBank]]
==Limitations==
While UniGene is a powerful tool, it has some limitations:
 
* '''Redundancy''': Some clusters may contain sequences from closely related genes, leading to potential redundancy.
* '''Incomplete Coverage''': Not all genes are represented in UniGene, especially those with low expression levels.
 
==Related pages==
* [[Expressed sequence tag]]
* [[Expressed sequence tag]]
* [[Genomics]]
* [[Bioinformatics]]
* [[National Center for Biotechnology Information]]
* [[National Center for Biotechnology Information]]
* [[Gene expression]]
== References ==
* Schuler, G. D. (1997). "Pieces of the puzzle: Expressed sequence tags and the catalog of human genes." Science, 274(5287), 540-546.
* Wheeler, D. L., et al. (2000). "Database resources of the National Center for Biotechnology Information." Nucleic Acids Research, 28(1), 10-14.


[[Category:Bioinformatics databases]]
[[Category:Bioinformatics databases]]
[[Category:Genomics]]
[[Category:Genomics]]

Latest revision as of 03:48, 13 February 2025


UniGene logo

UniGene is a database of the NCBI that provides a non-redundant set of gene-oriented clusters of EST sequences. It is a valuable resource for researchers in the field of genomics and bioinformatics.

Overview[edit]

UniGene was developed to organize EST sequences into clusters that represent a unique gene or a set of closely related genes. Each UniGene cluster contains sequences that are believed to come from the same transcription locus, along with related information such as tissue types in which the gene is expressed and chromosomal location.

Functionality[edit]

UniGene serves several important functions in the field of genomics:

  • Gene Discovery: By clustering ESTs, UniGene helps in identifying new genes and understanding gene expression patterns across different tissues.
  • Expression Analysis: Researchers can use UniGene to study the expression of genes in various tissues and developmental stages.
  • Sequence Alignment: UniGene provides alignments of ESTs to known genomic sequences, aiding in the annotation of genomes.

Data Organization[edit]

UniGene organizes data into clusters based on sequence similarity. Each cluster is assigned a unique identifier and includes:

  • A list of EST sequences
  • Information about the tissues from which the sequences were derived
  • Links to related gene information in other databases

Applications[edit]

UniGene is widely used in bioinformatics and genomics research for:

  • Gene Annotation: Assisting in the annotation of genomic sequences by providing evidence of transcription.
  • Comparative Genomics: Facilitating the comparison of gene expression patterns across different species.
  • Functional Genomics: Supporting studies on gene function and regulation by providing expression data.

Limitations[edit]

While UniGene is a powerful tool, it has some limitations:

  • Redundancy: Some clusters may contain sequences from closely related genes, leading to potential redundancy.
  • Incomplete Coverage: Not all genes are represented in UniGene, especially those with low expression levels.

Related pages[edit]