Please cite as: CSH Protocols; 2007; doi:10.1101/pdb.top16

This Protocol
Right arrow Full Text
Right arrow Update/discuss this protocolDiscussion icon
Right arrow Alert me when this protocol is cited
Right arrow Alert me when comments are published
Right arrow Alert me if a correction is posted
Services
Right arrow Similar protocols in this database
Right arrow Alert me to new releases of protocols
Right arrow Save to Personal Folders
Right arrow Download to citation manager
Right arrow Printer-friendly versionPrinter-friendly version
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Mount, D. W.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Mount, D. W.
Related Collections
Right arrow Proteins and Proteomics, general
Right arrow Proteomics
Right arrow Genome Analysis
Right arrow Bioinformatics/Genomics, general
Right arrow Alignment of Sequences
Right arrow Alignment of Sequences, general
Right arrow Alignment of Pairs of Sequences
Right arrow Multiple Sequence Alignment
Right arrow Sequence Database Searching
Right arrow Computational Biology
Right arrow Genetic Variation
Right arrowRelated Articles
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?
Legend icon

topic_introductionTopic Introduction

Using a FASTA Sequence Database Similarity Search

David W. Mount

Adapted from "Sequence Database Searching for Similar Sequences," Chapter 6, in Bioinformatics: Sequence and Genome Analysis, 2nd edition, by David W. Mount. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, USA, 2004.


INTRODUCTION

FASTA is a program for rapid alignment of pairs of protein and DNA sequences. Rather than comparing individual residues in the two sequences, FASTA searches for matching sequence patterns or words, called k-tuples. These patterns comprise k consecutive matches of letters in both sequences. The program then attempts to build a local alignment based on these word matches. Due to the ability of the algorithm to find matching sequences in a sequence database with high speed, FASTA is useful for routine database searches of this type. Comparable methods are the BLAST program, which is faster than FASTA, is of comparable sensitivity for protein queries, and also does DNA searches, and programs that use the Smith-Waterman dynamic programming algorithm for protein and DNA searches, which are slower but more sensitive when full-length protein sequences are used as queries.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?

Related Articles

Recommended Steps for a FASTA Search
David W. Mount
CSH Protocols 2007: 40. [Abstract] [Full Text]

Steps Used by the BLAST Algorithm
David W. Mount
CSH Protocols 2007: 41. [Abstract] [Full Text]

Strategies for Sequence Similarity Database Searches
David W. Mount
CSH Protocols 2007: 15. [Abstract] [Full Text]

Using the Basic Local Alignment Search Tool (BLAST)
David W. Mount
CSH Protocols 2007: 17. [Abstract] [Full Text]



This article has been cited by other articles:


Home page
CSHL ProtocolsHome page
D. W. Mount
Studies of Varying Alignment Algorithm, Amino Acid Scoring Matrix, and Gap Penalties
CSH Protocols, June 1, 2008; 2008(7): pdb.ip60 - pdb.ip60.
[Abstract] [Full Text]


Home page
CSHL ProtocolsHome page
D. W. Mount
Using PAM Matrices in Sequence Alignments
CSH Protocols, June 1, 2008; 2008(7): pdb.top38 - pdb.top38.
[Abstract] [Full Text]


Home page
CSHL ProtocolsHome page
D. W. Mount
Using Gaps and Gap Penalties to Optimize Pairwise Sequence Alignments
CSH Protocols, June 1, 2008; 2008(7): pdb.top40 - pdb.top40.
[Abstract] [Full Text]