UniProt: Difference between revisions

From WikiMD's Wellness Encyclopedia

CSV import
CSV import
 
Line 1: Line 1:
== UniProt ==
{{DISPLAYTITLE:UniProt}}


[[File:Uniprot-logo.img.svg|thumb|Logo of UniProt]]
== Overview ==
 
[[File:Uniprot-logo.img.svg|thumb|right|The UniProt logo]]
'''UniProt''' is a comprehensive, high-quality, and freely accessible resource of protein sequence and functional information. It is a central hub for the collection of functional information on proteins, with accurate, consistent, and rich annotation. The UniProt databases are maintained by the [[UniProt Consortium]], which includes the [[European Bioinformatics Institute]] (EBI), the [[Swiss Institute of Bioinformatics]] (SIB), and the [[Protein Information Resource]] (PIR).
'''UniProt''' (Universal Protein Resource) is a comprehensive, high-quality, and freely accessible database of protein sequence and functional information. It is a central hub for the collection of functional information on proteins, with accurate, consistent, and rich annotation. UniProt is a collaboration between the [[European Bioinformatics Institute]] (EBI), the [[Swiss Institute of Bioinformatics]] (SIB), and the [[Protein Information Resource]] (PIR).


== History ==
== History ==
The UniProt project was initiated in December 2002 as a merger of three existing protein databases: Swiss-Prot, TrEMBL, and PIR-PSD. The goal was to create a single, comprehensive resource that would provide a more complete and accurate picture of protein sequences and their functions. The first release of UniProt was in 2003.
UniProt was created in 2002 by the merger of three major protein sequence databases: Swiss-Prot, TrEMBL, and PIR-PSD. The goal was to provide a single, centralized resource for protein sequence and functional information.


== Components ==
== Components ==
UniProt consists of several components:
UniProt consists of several components:


* '''[[UniProtKB]] (UniProt Knowledgebase)''': This is the central database of protein sequences and functional information. It is divided into two sections:
* '''[[UniProtKB]]''' (UniProt Knowledgebase): The central database of protein sequences and functional information, which is divided into two sections:
   * '''[[Swiss-Prot]]''': A manually annotated and reviewed section of UniProtKB.
   * '''[[Swiss-Prot]]''': A manually annotated and reviewed section.
   * '''[[TrEMBL]]''': A computer-annotated section of UniProtKB, which is not reviewed.
   * '''[[TrEMBL]]''': A section that contains computationally analyzed records that await full manual annotation.


* '''[[UniParc]] (UniProt Archive)''': A comprehensive and non-redundant database that contains most of the publicly available protein sequences in the world.
* '''[[UniParc]]''' (UniProt Archive): A comprehensive and non-redundant database that contains most of the publicly available protein sequences in the world.


* '''[[UniRef]] (UniProt Reference Clusters)''': Provides clustered sets of sequences from UniProtKB and selected UniParc records to obtain complete coverage of sequence space at several resolutions.
* '''[[UniRef]]''' (UniProt Reference Clusters): Provides clustered sets of sequences from UniProtKB and selected UniParc records to obtain complete coverage of sequence space at several resolutions.


== Features ==
== Features ==
UniProt provides a wide range of features, including:
UniProt provides a wide range of features, including:


* Detailed [[annotation]] of protein sequences, including information on function, domain structure, post-translational modifications, variants, and more.
* Detailed [[protein sequence]] information.
* Cross-references to other databases, providing a network of biological information.
* Functional annotations such as [[protein function]], [[enzyme]] activity, and [[biological process]] involvement.
* Tools for sequence analysis and retrieval.
* Information on [[protein structure]], [[post-translational modification]]s, and [[protein-protein interaction]]s.
* Regular updates to ensure the most current data is available.
* Cross-references to other databases, including [[genomic]] and [[proteomic]] resources.


== Access and Use ==
== Access and Tools ==
UniProt is freely accessible to the public and can be accessed through its website. Users can search for proteins by name, function, or sequence, and download data in various formats. The database is widely used by researchers in the fields of [[bioinformatics]], [[molecular biology]], and [[genomics]].
UniProt is accessible through its website, which provides a user-friendly interface for searching and retrieving data. It also offers various tools for sequence analysis, including:


== Related pages ==
* '''BLAST''': For sequence similarity searching.
* [[Protein Data Bank]]
* '''Align''': For multiple sequence alignment.
* [[GenBank]]
* '''Retrieve/ID mapping''': For converting between different database identifiers.
* [[Ensembl]]
 
== Applications ==
UniProt is widely used in [[bioinformatics]], [[molecular biology]], and [[biomedical research]]. It supports a variety of applications, including:


== References ==
* [[Drug discovery]] and [[development]].
{{Reflist}}
* [[Genomics]] and [[proteomics]] research.
* [[Functional genomics]] studies.


== External links ==
== Related pages ==
* [https://www.uniprot.org/ Official UniProt website]
* [[Protein structure]]
* [[Bioinformatics]]
* [[Genomics]]
* [[Proteomics]]


[[Category:Bioinformatics databases]]
[[Category:Protein structure]]
[[Category:Biological databases]]
[[Category:Biological databases]]
[[Category:Bioinformatics]]
[[Category:Protein structure]]

Latest revision as of 03:41, 13 February 2025


Overview[edit]

File:Uniprot-logo.img.svg
The UniProt logo

UniProt (Universal Protein Resource) is a comprehensive, high-quality, and freely accessible database of protein sequence and functional information. It is a central hub for the collection of functional information on proteins, with accurate, consistent, and rich annotation. UniProt is a collaboration between the European Bioinformatics Institute (EBI), the Swiss Institute of Bioinformatics (SIB), and the Protein Information Resource (PIR).

History[edit]

UniProt was created in 2002 by the merger of three major protein sequence databases: Swiss-Prot, TrEMBL, and PIR-PSD. The goal was to provide a single, centralized resource for protein sequence and functional information.

Components[edit]

UniProt consists of several components:

  • UniProtKB (UniProt Knowledgebase): The central database of protein sequences and functional information, which is divided into two sections:
 * Swiss-Prot: A manually annotated and reviewed section.
 * TrEMBL: A section that contains computationally analyzed records that await full manual annotation.
  • UniParc (UniProt Archive): A comprehensive and non-redundant database that contains most of the publicly available protein sequences in the world.
  • UniRef (UniProt Reference Clusters): Provides clustered sets of sequences from UniProtKB and selected UniParc records to obtain complete coverage of sequence space at several resolutions.

Features[edit]

UniProt provides a wide range of features, including:

Access and Tools[edit]

UniProt is accessible through its website, which provides a user-friendly interface for searching and retrieving data. It also offers various tools for sequence analysis, including:

  • BLAST: For sequence similarity searching.
  • Align: For multiple sequence alignment.
  • Retrieve/ID mapping: For converting between different database identifiers.

Applications[edit]

UniProt is widely used in bioinformatics, molecular biology, and biomedical research. It supports a variety of applications, including:

Related pages[edit]