Unigenes: Difference between revisions
CSV import Tags: mobile edit mobile web edit |
CSV import |
||
| Line 1: | Line 1: | ||
'''Unigenes''' are a | == Unigenes == | ||
[[File:Unigenes_humifusa_342167844.jpg|thumb|Unigenes humifusa]] | |||
'''Unigenes''' are a concept in [[genomics]] and [[bioinformatics]] that refer to a unique sequence of [[DNA]] that represents a single gene. The term is often used in the context of [[gene expression]] studies and [[genome annotation]]. | |||
== Overview == | == Overview == | ||
Unigenes are | Unigenes are constructed by clustering [[expressed sequence tags]] (ESTs) and other [[cDNA]] sequences to form a non-redundant set of gene-oriented clusters. Each unigene cluster represents a unique gene locus, which can be used to study gene expression patterns across different tissues and conditions. | ||
== Applications == | == Applications == | ||
Unigenes are used in | Unigenes are widely used in [[functional genomics]] to identify and annotate genes in newly sequenced genomes. They provide a valuable resource for [[comparative genomics]] and can help in identifying [[orthologs]] and [[paralogs]] across different species. Additionally, unigenes are instrumental in the development of [[microarrays]] and other [[high-throughput]] technologies for studying gene expression. | ||
== | == Construction == | ||
The construction of unigenes involves several steps: | |||
1. **Collection of ESTs**: Large-scale sequencing projects generate ESTs from various tissues and developmental stages. | |||
2. **Clustering**: ESTs are clustered based on sequence similarity to form groups that represent a single gene. | |||
3. **Assembly**: Within each cluster, sequences are assembled to create a consensus sequence, which is the unigene. | |||
4. **Annotation**: The resulting unigenes are annotated with functional information, such as [[gene ontology]] terms and [[pathway]] associations. | |||
== Challenges == | == Challenges == | ||
One of the main challenges in unigene construction is the presence of [[alternative splicing]], which can result in multiple transcripts from a single gene. This can complicate the clustering and assembly process. Additionally, sequencing errors and [[polymorphisms]] can affect the accuracy of unigene representation. | |||
== Related pages == | |||
* [[Gene expression]] | |||
* [[Expressed sequence tag]] | |||
* [[Genome annotation]] | |||
* [[Functional genomics]] | |||
[[Category:Genomics]] | [[Category:Genomics]] | ||
[[Category:Bioinformatics]] | [[Category:Bioinformatics]] | ||
Revision as of 15:45, 9 February 2025
Unigenes

Unigenes are a concept in genomics and bioinformatics that refer to a unique sequence of DNA that represents a single gene. The term is often used in the context of gene expression studies and genome annotation.
Overview
Unigenes are constructed by clustering expressed sequence tags (ESTs) and other cDNA sequences to form a non-redundant set of gene-oriented clusters. Each unigene cluster represents a unique gene locus, which can be used to study gene expression patterns across different tissues and conditions.
Applications
Unigenes are widely used in functional genomics to identify and annotate genes in newly sequenced genomes. They provide a valuable resource for comparative genomics and can help in identifying orthologs and paralogs across different species. Additionally, unigenes are instrumental in the development of microarrays and other high-throughput technologies for studying gene expression.
Construction
The construction of unigenes involves several steps:
1. **Collection of ESTs**: Large-scale sequencing projects generate ESTs from various tissues and developmental stages. 2. **Clustering**: ESTs are clustered based on sequence similarity to form groups that represent a single gene. 3. **Assembly**: Within each cluster, sequences are assembled to create a consensus sequence, which is the unigene. 4. **Annotation**: The resulting unigenes are annotated with functional information, such as gene ontology terms and pathway associations.
Challenges
One of the main challenges in unigene construction is the presence of alternative splicing, which can result in multiple transcripts from a single gene. This can complicate the clustering and assembly process. Additionally, sequencing errors and polymorphisms can affect the accuracy of unigene representation.