In biological research, there are several occasions in which genetic comparisons are needed. For instance, there are times when there is not much information on a specific organism or enzyme that is of interest to the researcher. Therefore it is useful to be able to find genetically similar organisms or enzymes that are better researched. The genetics can help justify the use of these organisms as a base for setting up an experiment or background for an experiment. Additionally, genetics can be used to determine the evolutionary relationships between organisms. The more genetically similar organisms or enzymes are to each other then the more recently they branched from a common ancestor. It is therefore useful to become familiar with this type of software.
National Center for Biotechnology Information (NCBI) is a database where, among other things, genes can be easily searched for and found. Basic Local Alignment Search Tool (BLAST) is a tool on NCBI that allows the comparison of a query sequence with all other sequences in the database.
The subject for this project to familiarize people with the database and tools is the enzyme Glutathione S-Transferase (GST) in Mus musculus (house mouse). The mouse was chosen because its genome has been extensively researched and sequenced. GSTs are enzymes that have been found to be a part of the cell’s protection from multiple dangerous chemicals that are produced both in the environment and within the organism.
GSTs are split into three different superfamilies. The largest of these superfamilies is the cytosolic and this superfamily has thirteen recognized classes: alpha, delta, beta, epsilon, zeta, theta, mu, nu, pi, sigma, tau, phi and omega. You can recognize each superfamily by the letter that directly follows Gst in the name section (i.e. the alpha superfamily is Gsta) and then each superfamily has different categories under them, which are distinguished by numbers.
Conserved Domains are sections of the genome that are similar in several different species. This suggests that they perform an important function for the organism and therefore it has been preserved through evolution.
This project should not take more than 20 minutes.
Step 1: Go to the National Center for Biotechnology Information
Then change "All Databases" to "Gene" in the box next to the search bar.
Step 2: Search the Database
Step 3: Pick a Specific Gene
The gene used for the duration of the instructions and is therefore in all of the pictures is Gsta1. Just click on the name of the gene (in purple in the picture but it will be blue on the website) that you choose to get to the next step.
Step 4: Get the Sequence
Copy the accession number (this should start with NP_ and is followed by a string of numbers). When you copy this, make sure the NP_ is included.
Step 5: Go to BLAST
Step 6: Enter Query
Make sure that the database is on "non-redundant protein sequences (nr)" and the Algorithm is "blast (protein-protein BLAST)." You don't have to worry about any of the other sections or boxes.
Click the BLAST button at the bottom of the page.
Step 7: Results Page
Once on the results page scroll down to the "descriptions" section.
Step 8: Select Sequences to Compare
Click the boxes next to the sequences that you want to compare side-by-side. Again, since the purpose is to familiarize yourself with the software, It does not matter which ones you choose.
Then click on the "Graphics" link on the top of the descriptions box.
Step 9: Find Conserved Domains
You can zoom in and out to see the actual sequence and move the view up and down the protein. This is helpful in finding the conserved domains across the different species.
Step 10: Comparing Two Sequences
This lines up the query (accession number you typed in) and the subject (gene you clicked on in "Descriptions"), with the similar proteins listed in the middle line. If there is not a protein in common, then there will be a blank space.
You are now familiar with the basic functions of these websites if you ever need to use genetic comparisons in research.