Gene Comparisons

Next

ninja icon

Application:

•  Comparison of the number of genes in humans with other species

    
The number of genes present in an organism will differ between species and is not a valid indicator of biological complexity

The number of genes in a genome is usually predicted by identifying sequences common to genes

  • These identifying regions may include expressed sequence tags (ESTs) or sequences that are homologous to known genes
  • The presence of pseudogenes and transposons make accurate counts of unique gene numbers very difficult


As scientists may use different approaches to predicting gene numbers, final estimations can vary significantly 

  • For instance, the number of genes in rice (Oryza sativa) is estimated as being between 32,000 – 50,000
  • The number of genes in humans is estimated as being between 19,000 – 25,000


Gene Comparisons Between Different Species

gene number


ninja icon

Skill:

•  Use of a database to determine differences in the base sequence of a gene in two species

    
Gene sequences from different species can be identified and then compared using two online resources:

  • GenBank – a genetic database that serves as an annotated collection of DNA sequences
  • Clustal Omega – an alignment program that compares multiple sequences of DNA








Identifying Gene Sequences

GenBank can be used to identify the DNA sequence for a gene in a number of different species

To identify a specific gene sequence:

  • Change the search parameter from nucleotide to gene and type in the name of the gene of interest
  • Choose the species of interest and click on the link (under ‘Name / Gene ID’)
  • Scroll to the ‘Genomic regions, transcripts and products’ section and click on the ‘FASTA’ link


Below are examples of different genes that may be searched for:

  • HBB  –  Haemoglobin beta gene
  • COX1  –  Cytochrome oxidase 1 gene
  • F8  –  Coagulation factor VIII gene
  • IGF1R  –  Insulin growth factor 1 receptor gene


Comparing Gene Sequences

Clustal Omega aligns multiple gene sequences to allow for the determination of differences in the base sequence

To construct a multiple alignment:

  • Change the input sequence type to DNA and paste the relevant FASTA sequences into the provided space
  • Before each sequence designate a species name preceded by a forward arrow (e.g. '>Human’ or ‘>Chimpanzee’)
  • Alternatively, sequences can be saved as a document in plain text format (.txt) and then uploaded
  • When all sequences have been included, click ‘Submit’ (under step 3)


Clustal Omega possesses several useful features that are absent in alternative tools like BLASTN:

  • Multiple (more than two) sequences can be compared at once
  • Sequence consensus is colour coded in a Jalview applet found under ‘Result Summary’ (requires Java)
  • Branched phylograms can be generated to show evolutionary relationships (under ‘Phylogenetic Tree’)


Below is a plain text file that can be uploaded to compare gene sequence fragments from different species:

  • CHRNE  –  Cholinergic receptor epsilon (nicotinic) for various species


Sequence Alignment of DNA from Two Species

sequence alignment