[Genome] Need some help

Ann Zweig ann at soe.ucsc.edu
Fri Mar 3 11:55:30 PST 2006


Scott-

	Now I understand.  You are correct in remembering that the on line BLAT 
program only allows 25 sequences at a time.  What will be helpful to you 
is your own BLAT executable.  Read about getting a copy of the BLAT 
source here: http://genome.ucsc.edu/FAQ/FAQblat.html#blat3.    For 
non-commercial users like yourself no license is required.  Once you
have that installed on your machine, you can type 'blat' at the 
command-line to see the usage statement for the tool.

	The tool will provide you with an output file containing the best hits 
for each sequence.  After you get the output, you may want to filter out 
on the best few hits (or best single hit) for each sequence.  You can do 
this using the pslCDnaFilter tool.  This tool requires alignments that 
have the repMatch psl field set even when doing alignments without 
masking.  The command line blat option -repeats=lower enables counting 
repeat matches.

	Read about how to download the pslCDnaFilter tool from our source code 
here: http://genome.ucsc.edu/FAQ/FAQlicense#license3.  You will find the 
tool here: /cluster/bin/i386/pslCDnaFilter.

	You will find another filtering tool (called pslReps) in the same 
location in the source tree that will provide you only the single best 
match (or within X% of best).  As with the BLAT tool, simply type the 
name of the tool at the command line with no parameters to see the usage 
statement.


Hope this is helpful.

Regards,

----------
Ann Zweig
UCSC Genome Bioinformatics Group
http://genome.ucsc.edu

Hebbring, Scott J. wrote:
> You are on the right track.  I want to use the short sequence fragments from "chrN_random.fa.gz" (about 2000 sequences) and blat these across the genome.  If I recall, I can only blat 25 sequences at one time.  Is there a faster way?  I would like to do this under the assumption that one of these fragments might not be aligning because it falls on one of the breakpoints of my deletion/duplication.
> 
> Thanks for the help
> Scott Hebbring 
> 
> -----Original Message-----
> From: Ann Zweig [mailto:ann at soe.ucsc.edu] 
> Sent: Wednesday, March 01, 2006 4:57 PM
> To: Hebbring, Scott J.
> Cc: 'genome at soe.ucsc.edu'
> Subject: Re: [Genome] Need some help
> 
> Hello Scott-
> 
> 	I am a little unclear on exactly what you are asking, but let me take a 
> stab at it.
> 
> 	I think that you would like to BLAT the fragments of gene that you get 
> when you break it at these breakpoints to see where they align on the 
> genome.  Do you already have these sequence fragments?  If so, then you 
> would want to BLAT and then do some sort of filtering to get the best 
> hits.  From those results, you could make a custom track and or do Table 
> Browser queries.
> 
> 	If I am on the right track, let me know and I'll point you towards some 
> tools that may be helpful.  If not, please re-explain your problem.  Thanks.
> 
> 
> Regards,
> 
> ----------
> Ann Zweig
> UCSC Genome Bioinformatics Group
> http://genome.ucsc.edu
> 
> 
> 
> Hebbring, Scott J. wrote:
> 
>>For background, I have a gene that is polymorphicly duplicated and deleted.  I wanted to try and use chr?_random.fa.gz sequence from your site to help map the break points.  Is there a good way to download these fragments as a custom track or to input these sequences in the Table browser?  In a sense, I want to do a BLAT of 2,000 small sequences.
>>Thanks
>>Scott Hebbring
>>
>>_______________________________________________
>>Genome maillist  -  Genome at soe.ucsc.edu
>>http://www.soe.ucsc.edu/mailman/listinfo/genome




More information about the Genome mailing list