An approach to what can be accomplished in Silico
Introduction Sequence Structure Function
iHOP means information Hyperlinked Over Proteins. The iHop server is a very efficient tool allowing finding all
the papers in which a particular gene name is cited. Let's try it:
1/ Go to the Web site
2/ Type a gene name (e.g. vps36) or an accession number (e.g. UniProt
or NCBI Gene, etc.) into the search field in the menu bar.
3/ In many cases your search term will correspond to various genes or
proteins. Click on the "paper" icon on the right-hand side to select
the corresponding gene. When you mouse goes over it, it should be
written "interaction information for this gene"
4/ The "interaction information" contains all sentences found in the
literature for the main gene of a page (gene A) in red, and other genes
(gene B) in Blue. Gene symbols within sentences are hyperlinked.
Therefore, clicking on a gene symbol (e.g. gene B) will bring you to
the page of gene B.
5/ A nice option is the Enhanced PubMed/Google query. Each gene page
contains at the top a link to execute it. These queries allow you to
quickly query PubMed or Google for a specific gene including all its
synonyms and orthographic variations.
6/ You can also tune your iHop navigation, just use the Filter and
Options link in the menu bar to change overall appearance parameters.
STRING is a web server containing 7 types of functional information for 736429 proteins in 179 species at the moment.
Let's see if it can give any useful information about our gene
1/ Go to the STRING web site ,
2/ Enter your gene name; alternatively if no gene is found you can paste a protein sequence
3/ Look at the result table:
the functionnally related protein found are displayed and the evidence type used
to infer the relation is specified.
Here, you can:
, is very interesting:
it displays the network of proteins present in the result box and allows you to grow the network step by step.
Up to now, we used "NCBI protein", if you select "NCBI gene" ,
you will see that you actually have access to plenty of extra-information such as interactions.
1/ So, go there ,
2/ Type in .. do I need to tell you ;) ?
3/ Select the good gene,
4/ Look at all this information!
If your gene of interest is from Yeast, Worm or Fly, you should have a look at the organism database. These databases summarize all of the features of a particular gene or protein. Therefore, they are good to get a quick overview of what is known in contrast to literature mining approaches (e.g. iHOP), which allow getting more detailed information.
It might also be worth checking EnSembl as it gives good access to gene structure and genome organization for a wide range of organisms.