Bioinformatics with Python Cookbook

Tiago Antao

商品描述

Learn how to use modern Python bioinformatics libraries and applications to do cutting-edge research in computational biology

About This Book

  • Discover and learn the most important Python libraries and applications to do a complex bioinformatics analysis
  • Focuses on the most modern tools to do research with next generation sequencing, genomics, population genetics, phylogenomics, and proteomics
  • Uses real-world examples and teaches you to implement high-impact research methods

Who This Book Is For

If you have intermediate-level knowledge of Python and are well aware of the main research and vocabulary in your bioinformatics topic of interest, this book will help you develop your knowledge further.

What You Will Learn

  • Gain a deep understanding of Python's fundamental bioinformatics libraries and be exposed to the most important data science tools in Python
  • Process genome-wide data with Biopython
  • Analyze and perform quality control on next-generation sequencing datasets using libraries such as PyVCF or PySAM
  • Use DendroPy and Biopython for phylogenetic analysis
  • Perform population genetics analysis on large datasets
  • Simulate complex demographies and genomic features with simuPOP

In Detail

If you are either a computational biologist or a Python programmer, you will probably relate to the expression "explosive growth, exciting times". Python is arguably the main programming language for big data, and the deluge of data in biology, mostly from genomics and proteomics, makes bioinformatics one of the most exciting fields in data science.

Using the hands-on recipes in this book, you'll be able to do practical research and analysis in computational biology with Python. We cover modern, next-generation sequencing libraries and explore real-world examples on how to handle real data. The main focus of the book is the practical application of bioinformatics, but we also cover modern programming techniques and frameworks to deal with the ever increasing deluge of bioinformatics data.

商品描述(中文翻譯)

學習如何使用現代Python生物信息學庫和應用程序,在計算生物學領域進行尖端研究。

關於本書

- 發現並學習最重要的Python庫和應用程序,以進行複雜的生物信息學分析。
- 專注於使用下一代测序、基因組學、群體遺傳學、親緣基因組學和蛋白質組學進行研究的最新工具。
- 使用實際示例,教你實施高影響力的研究方法。

適合閱讀對象

如果你具有Python的中級知識,並且對你感興趣的生物信息學主題的主要研究和詞彙有很好的了解,本書將幫助你進一步發展你的知識。

你將學到什麼

- 深入了解Python的基本生物信息學庫,並接觸到Python中最重要的數據科學工具。
- 使用Biopython處理全基因組數據。
- 使用PyVCF或PySAM等庫分析和進行質量控制的下一代测序數據集。
- 使用DendroPy和Biopython進行親緣分析。
- 在大數據集上進行群體遺傳學分析。
- 使用simuPOP模擬複雜的人口結構和基因組特徵。

詳細內容

如果你是計算生物學家或Python程序員,你可能會對“爆炸性增長,令人興奮的時代”這個詞有所共鳴。Python可以說是大數據的主要編程語言,而生物信息學領域的數據洪流,主要來自基因組學和蛋白質組學,使得生物信息學成為數據科學中最令人興奮的領域之一。

通過本書中的實踐性食譜,你將能夠使用Python在計算生物學中進行實際的研究和分析。我們涵蓋了現代的下一代测序庫,並探索了如何處理真實數據的實際示例。本書的主要重點是生物信息學的實際應用,但我們也涵蓋了處理不斷增加的生物信息學數據的現代編程技術和框架。