Pandas for Everyone: Python Data Analysis (Addison-Wesley Data & Analytics Series)

Daniel Y. Chen

買這商品的人也買了...

商品描述

This tutorial teaches everything you need to get started with Python programming for the fast-growing field of data analysis. Daniel Chen tightly links each new concept with easy-to-apply, relevant examples from modern data analysis.

 

Unlike other beginner's books, this guide helps today's newcomers learn both Python and its popular Pandas data science toolset in the context of tasks they'll really want to perform. Following the proven Software Carpentry approach to teaching programming, Chen introduces each concept with a simple motivating example, slowly offering deeper insights and expanding your ability to handle concrete tasks.

 

Each chapter is illuminated with a concept map: an intuitive visual index of what you'll learn -- and an easy way to refer back to what you've already learned. An extensive set of easy-to-read appendices help you fill knowledge gaps wherever they may exist. Coverage includes:

  • Setting up your Python and Pandas environment
  • Getting started with Pandas dataframes
  • Using dataframes to calculate and perform basic statistical tasks
  • Plotting in Matplotlib
  • Cleaning data, reshaping dataframes, handling missing values, working with dates, and more
  • Building basic data analytics models
  • Applying machine learning techniques: both supervised and unsupervised
  • Creating reproducible documents using literate programming techniques

商品描述(中文翻譯)

本教程教授您一切開始使用Python進行快速增長的數據分析領域所需的知識。Daniel Chen將每個新概念與現代數據分析中易於應用的相關示例緊密聯繫在一起。

與其他初學者書籍不同,本指南幫助新手在執行真正想要執行的任務的情況下學習Python及其流行的Pandas數據科學工具集。Chen採用經過驗證的軟件工程方法來教授編程,他用一個簡單的激勵示例介紹每個概念,逐漸提供更深入的見解,擴展您處理具體任務的能力。

每個章節都有一個概念地圖:一個直觀的視覺索引,顯示您將學到的內容,並且是回顧您已經學到的內容的簡單方法。一系列易於閱讀的附錄幫助您填補可能存在的知識空白。內容包括:

- 設置Python和Pandas環境
- 開始使用Pandas數據框
- 使用數據框計算和執行基本統計任務
- 在Matplotlib中繪圖
- 數據清理、重塑數據框、處理缺失值、處理日期等
- 構建基本數據分析模型
- 應用監督和非監督的機器學習技術
- 使用文學編程技術創建可重現的文檔