|
There are a number of approaches. Within Bioperl take a look at Bio::Tools::SeqPattern. Or, take a look at the TFBS package, at http://forkhead.cgb.ki.se/TFBS (Transcription Factor Binding Site). This Bioperl-compliant package specializes in pattern searching of nucleotide sequence using matrices. It's also conceivable that the combination of Bioperl and Perl's regular expressions could do the trick. You might also consider the CPAN module String::Approx (this module addresses the percent match query), but experienced users question whether its distance estimates are correct, the Unix agrep command is thought to be faster and more accurate. Finally, you could use EMBOSS, as discussed in the previous question (or you could use Pise to run EMBOSS applications). The relevant programs would be fuzzpro or fuzznuc.
|
How do I do motif searches with Bioperl? Can I do "find all sequences that are 75% identical" to a given motif?
Related Questions
- Bioperl is a toolkit of perl modules useful in building bioinformatics solutions in perl. It is built in an ...
- We have a universal version number for a release. This is set in the module Bio::Root::Version which is ...
- We suggest the following. Post your idea to the appropriate mailing list. If it is a really new idea consider ...
- Please cite it as: Stajich JE, Block D, Boulez K, Brenner SE, Chervitz SA, Dagdigian C, Fuellen G, Gilbert ...
- We are trying to build a standard starting point for analysis application which will probably look like Bio:: ...