97 Things Every Data Engineer Should Know: Collective Wisdom from the Experts

Macey, Tobias

商品描述

Take advantage of today's sky-high demand for data engineers. With this in-depth book, current and aspiring engineers will learn powerful real-world best practices for managing data big and small. Contributors from notable companies including Twitter, Google, Stitch Fix, Microsoft, Capital One, and LinkedIn share their experiences and lessons learned for overcoming a variety of specific and often nagging challenges.

Edited by Tobias Macey, host of the popular Data Engineering Podcast, this book presents 97 concise and useful tips for cleaning, prepping, wrangling, storing, processing, and ingesting data. Data engineers, data architects, data team managers, data scientists, machine learning engineers, and software engineers will greatly benefit from the wisdom and experience of their peers.

Topics include:

  • Building pipelines
  • Stream processing
  • Data privacy and security
  • Data governance and lineage
  • Data storage and architecture
  • The ecosystem of modern tools
  • Data team makeup and culture
  • Career advice

商品描述(中文翻譯)

充分利用當今對於數據工程師的高需求。這本深入的書籍將教授現任和有志之士強大的實用技巧,以應對大型和小型數據的管理。Twitter、Google、Stitch Fix、Microsoft、Capital One和LinkedIn等知名公司的貢獻者分享了他們在克服各種具體且常常令人困擾的挑戰中所獲得的經驗和教訓。

由熱門的《Data Engineering Podcast》主持人Tobias Macey編輯,本書提供了97條簡潔而實用的技巧,用於清理、準備、整理、存儲、處理和輸入數據。數據工程師、數據架構師、數據團隊經理、數據科學家、機器學習工程師和軟件工程師將從同行的智慧和經驗中獲益匪淺。

主題包括:
- 構建數據管道
- 流式處理
- 數據隱私和安全
- 數據治理和譜系
- 數據存儲和架構
- 現代工具生態系統
- 數據團隊組成和文化
- 職業建議

作者簡介

Tobias Macey hosts the Data Engineering Podcast and Podcast.__init__ where he discusses the tools, topics, and people that comprise the data engineering and Python communities respectively. His experience across the domains of infrastructure, software, cloud, and data engineering allows him to ask informed questions and bring useful context to the discussions. The ongoing focus of his career is to help educate people, through designing and building platforms that power online learning, consulting with companies and investors to understand the possibilities of emerging technologies, and leading teams of engineers to help them grow professionally.

作者簡介(中文翻譯)

Tobias Macey主持《Data Engineering Podcast》和《Podcast.__init__》,在這兩個節目中,他分別討論了數據工程和Python社區的工具、主題和人物。他在基礎設施、軟件、雲端和數據工程領域的經驗使他能夠提出有見地的問題並為討論帶來有用的背景。他職業生涯的持續關注點是通過設計和構建支持在線學習的平台來幫助教育人們,與公司和投資者協商以了解新興技術的可能性,並領導工程團隊幫助他們在專業上成長。