Sequence Analysis in a Nutshell: A Guide to Tools: A Guide to Common Tools and Databases

Sequence Analysis in a Nutshell: A Guide to Tools: A Guide to Common Tools and Databases

Sequence Analysis in a Nutshell: A Guide to Tools: A Guide to Common Tools and Databases

Sequence Analysis in a Nutshell: A Guide to Tools: A Guide to Common Tools and Databases

Paperback(1ST)

$29.95 
  • SHIP THIS ITEM
    Qualifies for Free Shipping
  • PICK UP IN STORE
    Check Availability at Nearby Stores

Related collections and offers


Overview

Gene sequence data is the most abundant type of data available, and if you're interested in analyzing it, you'll find a wealth of computational methods and tools to help you. In fact, finding the data is not the challenge at all; rather it is dealing with the plethora of flat file formats used to process the sequence entries and trying to remember what their specific field codes mean. If you survive by surrounding yourself with well-thumbed hard copies of readme files or remembering exactly where to look for the details when you need them, then Sequence Analysis in a Nutshell: A Guide to Common Tools and Databases is for you. This book is a handy resource, as well as an invaluable reference, for anyone who needs to know about the practical aspects and mechanics of sequence analysis. Sequence Analysis in a Nutshell: A Guide to Common Tools and Databases pulls together all of the vital information about the most commonly used databases, analytical tools, and tables used in sequence analysis. The book is partitioned into three fundamental areas to help you maximize your use of the content. The first section, "Databases" contains examples of flatfiles from key databases (GenBank, EMBL, SWISS-PROT), the definitions of the codes or fields used in each database, and the sequence feature types/terms and qualifiers for the nucleotide and protein databases. The second section, "Tools" provides the command line syntax for popular applications such as ReadSeq, MEME/MAST, BLAST, ClustalW, and the EMBOSS suite of analytical tools. The third section, "Appendixes" concentrates on information essential to understanding the individual components that make up a biological sequence. The tables in this section include nucleotide and protein codes, genetic codes, as well as other relevant information. Written in O'Reilly's enormously popular, straightforward "Nutshell" format, this book draws together essential information for bioinformaticians in industry and academia, as well as for students. If sequence analysis is part of your daily life, you'll want this easy-to-use book on your desk.

Product Details

ISBN-13: 9780596004941
Publisher: O'Reilly Media, Incorporated
Publication date: 01/28/2003
Series: In a Nutshell (O'Reilly)
Edition description: 1ST
Pages: 302
Product dimensions: 6.00(w) x 9.00(h) x 0.69(d)

About the Author

Scott Markel is a Principal Software Architect at LION bioscience Inc., where he is responsible for providing architectural direction in the development of software for the life sciences, including the use and development of standards. He is a co-chair of the Life Sciences Research Domain Task Force of the Object Management Group, and also chairs the LSR's Architecture and Roadmap Working Group. Prior to working at LION, Scott worked at NetGenics, Johnson & Johnson Pharmaceutical Research & Development, and Sarnoff Corporation. He has a Ph.D. in mathematics from the Universityof Wisconsin-Madison. When Scott's not working or writing he enjoys spending time with his wife and kids, reading European history books, and just enjoying life in sunny San Diego.

Darryl León is a Principal Scientific Architect at LION bioscience Inc., where he is responsible for providing scientific direction in the development of software for the life sciences. Prior to working at LION, Darryl worked at NetGenics, DoubleTwist, and Genset. He has taught at California Polytechnic State University, San Luis Obispo, and currently teaches a bioinformatics class at U.C. Santa Cruz Extension and U.C. San Diego Extension. He is also a member of the Bioinformatics Advisory Committee at U.C. San Diego Extension. Darryl has a Ph. D. in biochemistry from the Universityof California, San Diego and did his postdoctoral research at the Universityof California, Santa Cruz.

Table of Contents

  • Preface
  • Data Formats
    • Chapter 1: FASTA Format
    • Chapter 2: GenBank/EMBL/DDBJ
    • Chapter 3: SWISS-PROT
    • Chapter 4: Pfam
    • Chapter 5: PROSITE
  • Tools
    • Chapter 6: Readseq
    • Chapter 7: BLAST
    • Chapter 8: BLAT
    • Chapter 9: ClustalW
    • Chapter 10: HMMER
    • Chapter 11: MEME/MAST
    • Chapter 12: EMBOSS
  • Appendixes
    • Nucleotide andAmino Acid Tables
    • Genetic Codes
    • Resources
    • Future Plans
  • Colophon
From the B&N Reads Blog

Customer Reviews