Hands-On Data Science with Anaconda: Utilize the right mix of tools to create high-performance data science applications

Dr. Yuxing Yan, James Yan

  • 出版商: Packt Publishing
  • 出版日期: 2018-05-31
  • 售價: $1,500
  • 貴賓價: 9.5$1,425
  • 語言: 英文
  • 頁數: 364
  • 裝訂: Paperback
  • ISBN: 1788831195
  • ISBN-13: 9781788831192
  • 相關分類: Data Science
  • 相關翻譯: Anaconda 數據科學實戰 (簡中版)
  • 立即出貨 (庫存=1)

買這商品的人也買了...

商品描述

Develop, deploy, and streamline your data science projects with the most popular end-to-end platform, Anaconda

Key Features

  • Use Anaconda to find solutions for clustering, classification, and linear regression
  • Analyze your data efficiently with the most powerful data science stack
  • Use the Anaconda cloud to store, share, and discover projects and libraries

Book Description

Anaconda is an open source platform that brings together the best tools for data science professionals with more than 100 popular packages supporting Python, Scala, and R languages. Hands-On Data Science with Anaconda gets you started with Anaconda and demonstrates how you can use it to perform data science operations in the real world.

The book begins with setting up the environment for Anaconda platform in order to make it accessible for tools and frameworks such as Jupyter, pandas, matplotlib, Python, R, Julia, and more. You'll walk through package manager Conda, through which you can automatically manage all packages including cross-language dependencies, and work across Linux, macOS, and Windows. You'll explore all the essentials of data science and linear algebra to perform data science tasks using packages such as SciPy, contrastive, scikit-learn, Rattle, and Rmixmod.

Once you're accustomed to all this, you'll start with operations in data science such as cleaning, sorting, and data classification. You'll move on to learning how to perform tasks such as clustering, regression, prediction, and building machine learning models and optimizing them. In addition to this, you'll learn how to visualize data using the packages available for Julia, Python, and R.

What you will learn

  • Perform cleaning, sorting, classification, clustering, regression, and dataset modeling using Anaconda
  • Use the package manager conda and discover, install, and use functionally efficient and scalable packages
  • Get comfortable with heterogeneous data exploration using multiple languages within a project
  • Perform distributed computing and use Anaconda Accelerate to optimize computational powers
  • Discover and share packages, notebooks, and environments, and use shared project drives on Anaconda Cloud
  • Tackle advanced data prediction problems

Who This Book Is For

Hands-On Data Science with Anaconda is for you if you are a developer who is looking for the best tools in the market to perform data science. It's also ideal for data analysts and data science professionals who want to improve the efficiency of their data science applications by using the best libraries in multiple languages. Basic programming knowledge with R or Python and introductory knowledge of linear algebra is expected.

Table of Contents

  1. Ecosystem of Anaconda
  2. Anaconda Installation
  3. Data basics
  4. Data visualization
  5. Statistics modeling in Anaconda
  6. Managing packages
  7. Optimization in Anaconda
  8. Unsupervised Learning in Anaconda
  9. Supervised Learning in Anaconda
  10. Predictive Data Analytics: Modelling and Validation
  11. Anaconda Cloud
  12. Distributed computing, parallel computing and HPCC

商品描述(中文翻譯)

使用最受歡迎的端到端平台Anaconda,開發、部署和優化您的數據科學項目。

主要特點:
- 使用Anaconda解決聚類、分類和線性回歸問題
- 使用最強大的數據科學堆棧高效分析數據
- 使用Anaconda雲端存儲、分享和發現項目和庫

書籍描述:
Anaconda是一個開源平台,匯集了支持Python、Scala和R語言的100多個熱門包,為數據科學專業人員提供最佳工具。《Hands-On Data Science with Anaconda》將帶您入門Anaconda,並演示如何在現實世界中使用它進行數據科學操作。

本書首先介紹了為Anaconda平台設置環境的步驟,以便使其能夠與Jupyter、pandas、matplotlib、Python、R、Julia等工具和框架配合使用。您將通過包管理器Conda來管理所有包,包括跨語言依賴,並在Linux、macOS和Windows上工作。您將探索數據科學和線性代數的所有基礎知識,並使用SciPy、contrastive、scikit-learn、Rattle和Rmixmod等包執行數據科學任務。

熟悉了這些內容後,您將開始進行數據科學操作,如清理、排序和數據分類。您將學習如何執行聚類、回歸、預測以及構建機器學習模型並對其進行優化的任務。此外,您還將學習如何使用Julia、Python和R提供的包來可視化數據。

您將學到:
- 使用Anaconda執行清理、排序、分類、聚類、回歸和數據集建模
- 使用包管理器Conda,發現、安裝和使用功能高效且可擴展的包
- 在項目中使用多種語言進行異構數據探索
- 執行分佈式計算,使用Anaconda Accelerate優化計算能力
- 在Anaconda Cloud上發現和分享包、筆記本和環境,使用共享項目驅動器
- 解決高級數據預測問題

本書適合開發人員尋找市場上最佳工具進行數據科學的讀者。同時,對於想要通過使用多種語言中的最佳庫提高數據科學應用效率的數據分析師和數據科學專業人員也非常適用。預期讀者具備R或Python的基本編程知識和線性代數的入門知識。

目錄:
1. Anaconda生態系統
2. 安裝Anaconda
3. 數據基礎知識
4. 數據可視化
5. Anaconda中的統計建模
6. 包管理
7. Anaconda中的優化
8. Anaconda中的無監督學習
9. Anaconda中的監督學習
10. 預測數據分析:建模和驗證
11. Anaconda Cloud
12. 分佈式計算、並行計算和HPCC