An Introduction to Data Analysis in R: Hands-On Coding, Data Mining, Visualization and Statistics from Scratch

Zamora Saiz, Alfonso, Quesada González, Carlos, Hurtado Gil, Lluís

商品描述

This textbook offers an easy-to-follow, practical guide to modern data analysis using the programming language R. The chapters cover topics such as the fundamentals of programming in R, data collection and preprocessing, including web scraping, data visualization, and statistical methods, including multivariate analysis, and feature exercises at the end of each section. The text requires only basic statistics skills, as it strikes a balance between statistical and mathematical understanding and implementation in R, with a special emphasis on reproducible examples and real-world applications. This textbook is primarily intended for undergraduate students of mathematics, statistics, physics, economics, finance and business who are pursuing a career in data analytics. It will be equally valuable for master students of data science and industry professionals who want to conduct data analyses.

商品描述(中文翻譯)

這本教科書提供了一個易於理解且實用的指南,教導讀者如何使用程式語言R進行現代數據分析。各章節涵蓋了R程式設計的基礎、數據收集和預處理(包括網頁爬蟲)、數據可視化以及統計方法(包括多變量分析),並在每個章節結尾提供練習題。本書只需要基本的統計知識,它在統計和數學理解與R實踐之間取得了平衡,特別強調可重複的範例和實際應用。這本教科書主要針對數學、統計、物理、經濟、金融和商業等本科學生,他們希望在數據分析領域追求職業生涯。同樣,這本教科書對於數據科學碩士生和希望進行數據分析的業界專業人士也非常有價值。

作者簡介

Alfonso Zamora Saiz is a professor at the School of Computer Science Engineering, Technical University of Madrid, Spain. Holding a PhD in algebraic geometry from the Complutense University of Madrid (2013), he has been a visiting PhD student at Cambridge University and Columbia University, a postdoc at the IST in Lisbon, lecturer at the California State University Channel Islands and a professor at the CEU San Pablo University in Madrid. He has also worked as a quantitative analyst in the industry. His research interests include algebra, geometry and topology in pure math, as well as data analytical applications and mathematics education.

Carlos Quesada González holds a PhD in applied mathematics from the Complutense University of Madrid, Spain. He is a professor and Vice-dean of the School of Business at the CEU San Pablo University in Madrid, where he has helped to establish the Business Intelligence degree. He also teaches master courses on big data for finance and collaborates as a statistical analyst with Grant-Thornton.

Lluís Hurtado Gil is a professional data scientist at eDreams ODIGEO and holds a PhD in astrophysics from Valencia University (2016). He was a professor of statistics and econometrics for three years at CEU San Pablo University, where he also served as secretary of the Statistics and Applied Mathematics Department. He has published works on econometrics for undergraduate students and research papers on statistical applications in modern astrophysics with R code. Currently, he continues to collaborate with the International J-PAS Survey, investigating the physics of the accelerating universe. Professionally, he has specialized in the application of stochastic processes to digital marketing.

Diego Mondéjar Ruiz is a professor at the Department of Applied Mathematics and Statistics, CEU San Pablo University in Madrid, Spain. He obtained his PhD in mathematics from the Complutense University of Madrid with a thesis on topological data analysis and computational topology in 2015. In addition to having been a visiting PhD student at Stanford University and the University of Pennsylvania, he has taught mathematics, statistics and programming courses at several universities.

作者簡介(中文翻譯)

Alfonso Zamora Saiz是西班牙馬德里理工大學計算機科學工程學院的教授。他在馬德里康普魯特大學獲得代數幾何的博士學位(2013年),曾在劍橋大學和哥倫比亞大學作為訪問博士生,並在里斯本的IST擔任博士後研究員,加州州立大學頻道群島分校擔任講師,以及馬德里CEU聖保羅大學的教授。他還曾在工業界擔任過量化分析師。他的研究興趣包括純數學中的代數、幾何和拓撲,以及數據分析應用和數學教育。

Carlos Quesada González在西班牙馬德里康普魯特大學獲得應用數學的博士學位。他是馬德里CEU聖保羅大學商學院的教授和副院長,他協助建立了商業智能學位。他還教授金融大數據碩士課程,並與Grant-Thornton合作擔任統計分析師。

Lluís Hurtado Gil是eDreams ODIGEO的專業數據科學家,他在西班牙巴倫西亞大學獲得天體物理學的博士學位(2016年)。他曾在CEU聖保羅大學擔任三年的統計學和計量經濟學教授,並擔任統計學和應用數學系的秘書。他發表了關於本科生計量經濟學的著作,以及帶有R代碼的現代天體物理學統計應用的研究論文。目前,他繼續與國際J-PAS調查合作,研究宇宙加速的物理學。在職業上,他專注於將隨機過程應用於數字營銷。

Diego Mondéjar Ruiz是西班牙馬德里CEU聖保羅大學應用數學和統計學系的教授。他在馬德里康普魯特大學獲得數學博士學位,論文研究了拓撲數據分析和計算拓撲(2015年)。除了曾在斯坦福大學和賓夕法尼亞大學作為訪問博士生外,他還在多所大學教授數學、統計學和編程課程。