UniGene: Difference between revisions

From WikiMD's Wellness Encyclopedia

CSV import
Tags: mobile edit mobile web edit
 
CSV import
 
(One intermediate revision by the same user not shown)
Line 1: Line 1:
'''UniGene''' is a database that is maintained by the [[National Center for Biotechnology Information]] (NCBI). It provides an organized view of the [[transcriptome]], and for each known gene, it presents the sequences that represent the gene's transcriptional landscape.
{{DISPLAYTITLE:UniGene}}


== Overview ==
[[File:Unigene_banner.jpg|thumb|right|UniGene logo]]


The UniGene database is a system for partitioning [[transcript sequences]] into a non-redundant set of gene-oriented clusters. Each UniGene cluster contains sequences that represent a unique gene, as well as related information such as the [[tissue types]] in which the gene has been expressed and map location.
'''UniGene''' is a database of the [[National Center for Biotechnology Information|NCBI]] that provides a non-redundant set of [[gene]]-oriented clusters of [[expressed sequence tag|EST]] sequences. It is a valuable resource for researchers in the field of [[genomics]] and [[bioinformatics]].


== Construction ==
==Overview==
UniGene was developed to organize [[expressed sequence tag|EST]] sequences into clusters that represent a unique [[gene]] or a set of closely related genes. Each UniGene cluster contains sequences that are believed to come from the same transcription locus, along with related information such as tissue types in which the gene is expressed and [[chromosome|chromosomal]] location.


The UniGene database is constructed by partitioning GenBank sequences into a non-redundant set of gene-oriented clusters. Each UniGene cluster contains sequences that represent a unique gene. The process of creating a UniGene cluster involves several steps:
==Functionality==
UniGene serves several important functions in the field of [[genomics]]:


# Sequences are partitioned into clusters based on their overlap.
* '''Gene Discovery''': By clustering ESTs, UniGene helps in identifying new genes and understanding gene expression patterns across different tissues.
# Each cluster is then analyzed to identify the most likely representative sequence.
* '''Expression Analysis''': Researchers can use UniGene to study the expression of genes in various tissues and developmental stages.
# The representative sequence is then used to search the rest of the database to find other sequences that may belong to the same gene.
* '''Sequence Alignment''': UniGene provides alignments of ESTs to known [[genome|genomic]] sequences, aiding in the annotation of [[genome|genomes]].


== Applications ==
==Data Organization==
UniGene organizes data into clusters based on sequence similarity. Each cluster is assigned a unique identifier and includes:


UniGene is a valuable resource for many areas of [[genomic research]]. These include:
* A list of [[expressed sequence tag|EST]] sequences
* Information about the tissues from which the sequences were derived
* Links to related [[gene]] information in other databases


# [[Gene discovery]] and [[annotation]]
==Applications==
# [[Gene expression]] studies
UniGene is widely used in [[bioinformatics]] and [[genomics]] research for:
# [[Comparative genomics]]
# [[Molecular evolution]] studies
# [[Disease research]]


== See also ==
* '''Gene Annotation''': Assisting in the annotation of [[genome|genomic]] sequences by providing evidence of transcription.
* '''Comparative Genomics''': Facilitating the comparison of gene expression patterns across different species.
* '''Functional Genomics''': Supporting studies on gene function and regulation by providing expression data.


* [[National Center for Biotechnology Information]]
==Limitations==
* [[GenBank]]
While UniGene is a powerful tool, it has some limitations:
* [[Transcriptome]]
* [[Gene expression]]


== References ==
* '''Redundancy''': Some clusters may contain sequences from closely related genes, leading to potential redundancy.
* '''Incomplete Coverage''': Not all genes are represented in UniGene, especially those with low expression levels.


{{reflist}}
==Related pages==
* [[Expressed sequence tag]]
* [[Genomics]]
* [[Bioinformatics]]
* [[National Center for Biotechnology Information]]


[[Category:Bioinformatics databases]]
[[Category:Genomics]]
[[Category:Genomics]]
[[Category:Bioinformatics databases]]
[[Category:National Institutes of Health]]
[[Category:Biological databases]]
{{biology-stub}}
{{medicine-stub}}

Latest revision as of 03:48, 13 February 2025


UniGene logo

UniGene is a database of the NCBI that provides a non-redundant set of gene-oriented clusters of EST sequences. It is a valuable resource for researchers in the field of genomics and bioinformatics.

Overview[edit]

UniGene was developed to organize EST sequences into clusters that represent a unique gene or a set of closely related genes. Each UniGene cluster contains sequences that are believed to come from the same transcription locus, along with related information such as tissue types in which the gene is expressed and chromosomal location.

Functionality[edit]

UniGene serves several important functions in the field of genomics:

  • Gene Discovery: By clustering ESTs, UniGene helps in identifying new genes and understanding gene expression patterns across different tissues.
  • Expression Analysis: Researchers can use UniGene to study the expression of genes in various tissues and developmental stages.
  • Sequence Alignment: UniGene provides alignments of ESTs to known genomic sequences, aiding in the annotation of genomes.

Data Organization[edit]

UniGene organizes data into clusters based on sequence similarity. Each cluster is assigned a unique identifier and includes:

  • A list of EST sequences
  • Information about the tissues from which the sequences were derived
  • Links to related gene information in other databases

Applications[edit]

UniGene is widely used in bioinformatics and genomics research for:

  • Gene Annotation: Assisting in the annotation of genomic sequences by providing evidence of transcription.
  • Comparative Genomics: Facilitating the comparison of gene expression patterns across different species.
  • Functional Genomics: Supporting studies on gene function and regulation by providing expression data.

Limitations[edit]

While UniGene is a powerful tool, it has some limitations:

  • Redundancy: Some clusters may contain sequences from closely related genes, leading to potential redundancy.
  • Incomplete Coverage: Not all genes are represented in UniGene, especially those with low expression levels.

Related pages[edit]