Data Analysis with Python and Pyspark (Paperback)

Rioux, Jonathan

買這商品的人也買了...

相關主題

商品描述

Data Analysis with Python and PySpark is a carefully engineered tutorial that helps you use PySpark to deliver your data-driven applications at any scale.

When it comes to data analytics, it pays to think big. PySpark blends the powerful Spark big data processing engine with the Python programming language to provide a data analysis platform that can scale up for nearly any task. Data Analysis with Python and PySpark is your guide to delivering successful Python-driven data projects.

Data Analysis with Python and PySpark is a carefully engineered tutorial that helps you use PySpark to deliver your data-driven applications at any scale. This clear and hands-on guide shows you how to enlarge your processing capabilities across multiple machines with data from any source, ranging from Hadoop-based clusters to Excel worksheets. You'll learn how to break down big analysis tasks into manageable chunks and how to choose and use the best PySpark data abstraction for your unique needs.

Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications.

商品描述(中文翻譯)

《使用Python和PySpark進行數據分析》是一本精心設計的教程,幫助您使用PySpark在任何規模上交付數據驅動的應用程序。在數據分析方面,思考大規模是值得的。PySpark將強大的Spark大數據處理引擎與Python編程語言相結合,提供了一個可擴展到幾乎任何任務的數據分析平台。《使用Python和PySpark進行數據分析》是您成功交付Python驅動的數據項目的指南。這本清晰而實用的指南將向您展示如何通過來自任何來源的數據(從基於Hadoop的集群到Excel工作表)擴大您的處理能力。您將學習如何將大型分析任務分解為可管理的片段,以及如何選擇和使用最適合您獨特需求的PySpark數據抽象。購買印刷版書籍將包含Manning Publications提供的PDF、Kindle和ePub格式的免費電子書。

作者簡介

As a data scientist for an engineering consultancy Jonathan Rioux uses PySpark daily. He teaches the software to data scientists, engineers, and data-savvy business analysts.

作者簡介(中文翻譯)

作為一家工程顧問公司的數據科學家,Jonathan Rioux每天都使用PySpark。他教授這個軟體給數據科學家、工程師和數據敏感的業務分析師。