Contig

From WikiMD's WELLNESSPEDIA

Contig[edit]

A contig is a set of overlapping DNA segments that together represent a consensus region of DNA. In genomics, contigs are used to assemble the sequence of a genome from smaller fragments. The term "contig" is derived from "contiguous," indicating that the DNA segments are continuous and connected.

File:PET contig scaffold.png
Diagram illustrating the relationship between contigs, scaffolds, and paired-end tags.

Formation of Contigs[edit]

Contigs are formed during the process of DNA sequencing and genome assembly. In this process, short sequences of DNA, known as reads, are generated from a DNA library. These reads are then aligned and merged based on overlapping regions to form longer contiguous sequences, or contigs.

The assembly of contigs is a critical step in shotgun sequencing, where the genome is broken into random fragments, sequenced, and then reassembled. The goal is to reconstruct the original sequence of the genome as accurately as possible.

Contigs and Scaffolds[edit]

While contigs are continuous sequences, they do not necessarily represent the entire genome. Gaps may exist between contigs due to repetitive sequences or regions that are difficult to sequence. To bridge these gaps, contigs are further organized into scaffolds.

Scaffolds are formed by linking contigs using additional information, such as paired-end reads or mate-pair reads, which provide spatial information about the relative positions of contigs. This process helps to order and orient contigs, creating a more complete representation of the genome.

Applications of Contigs[edit]

Contigs are fundamental in various applications of genomics, including:

  • Genome Assembly: Contigs are the building blocks of genome assembly, allowing researchers to reconstruct the sequence of an organism's genome.
  • Comparative Genomics: By comparing contigs from different organisms, scientists can identify homologous sequences and study evolutionary relationships.
  • Gene Annotation: Contigs provide the sequence context necessary for identifying and annotating genes and other functional elements within a genome.

Related pages[edit]

Medical Disclaimer: WikiMD is for informational purposes only and is not a substitute for professional medical advice. Content may be inaccurate or outdated and should not be used for diagnosis or treatment. Always consult your healthcare provider for medical decisions. Verify information with trusted sources such as CDC.gov and NIH.gov. By using this site, you agree that WikiMD is not liable for any outcomes related to its content. See full disclaimer.
Credits:Most images are courtesy of Wikimedia commons, and templates, categories Wikipedia, licensed under CC BY SA or similar.