[Genome] mouse-human orthologs from genome alignments
Rachel Harte
hartera at soe.ucsc.edu
Wed Mar 29 10:46:09 PST 2006
Hello Avril,
The mmBlastTab does indeed have best Blastp matches between the proteins
for the human Known Genes set and the proteins for the mouse Known Genes
set. The problem with the Blastp set is that it include orthologs and
paralogs. Another problem is that proteins that are mostly just a single
domain could be matched to other proteins only because they share the
domain and have high levels of identity in this region. In this way,
paralogs could be matched instead of orthologs using this method.
The nets are the best alignments to use for finding orthologs between
species. The mouse net for the human genome has the best mouse match for
each region of the human genome. The reverse is true for the human net for
the mouse genome. It sounds like the method that you outlined below would
work fine for finding the mouse-human orthologs using the net files.
You might find it helpful to use out pslMap program:
usage:
pslMap [options] inPsl mapFile outPsl
This program can take a psl file e.g. the refSeqAli table for human, and
using the mouse chains for human as the mapFile, you can map the positions of
the human mRNAs to the mouse genome. This program is available in our
source code:
http://hgwdev.cse.ucsc.edu/FAQ/FAQlicense#license3
pslMap is in the kent/src/hg/pslMap directory.
I hope that this helps you. Please let us know if you have further
questions.
Rachel
On Tue, 28 Mar 2006, Avril Coghlan wrote:
> Hello,
>
> I am wondering whether you have a list of human-mouse orthologs
> that are inferred from the human-mouse genome alignment?
>
> As far as I can see, the human-mouse orthologs given in the Table
> Browser (mmBlastTab table) are derived from best blast matches
> (and not from the human-mouse genome alignment), is that right?
>
> I would like to infer human-mouse orthologs from the genome alignment,
> and am not sure if this information is already stored in the UCSC
> genome browser, or download files.
>
> So far I've figured out a way to do this is:
> (i) find the coordinates of human and mouse genes from the Table Browser
> (ii) for each human gene X, use the hg17.mm7.net file to find
> the mouse region M1 that aligns best to the human region H1
> containing
> gene X
> (iii) use the hg17.mm7.all.chain file to find which bases in the mouse
> region M1 are aligned to human gene X in human region H1
> (iv) see which mouse gene Y is found in the mouse bases that align
> to human gene X
> Does this sound right? Is there an easier way to infer human-mouse
> orthologs from the genome alignment? I understand from the paper
> Kent et al 2003 'Evolution's cauldron' that you have inferred
> human-mouse
> orthologs from the genome alignment before, am I doing it the right
> way?
>
> I'm new to the UCSC genome browser (which I'm finding very nice)
> so I'm sorry in advance if this is a silly question.
>
> regards
> Avril Coghlan
>
> _______________________________________________
> Genome maillist - Genome at soe.ucsc.edu
> http://www.soe.ucsc.edu/mailman/listinfo/genome
>
--
UCSC Genome Bioinformatics Group
http://genome.ucsc.edu
More information about the Genome
mailing list