Deep Learning with C++: Design and deploy neural networks using CUDA for high-performance AI in C++
暫譯: 使用 C++ 的深度學習:利用 CUDA 設計和部署高效能 AI 的神經網絡

Chen, Bill, Gupta, Vikash

  • 出版商: Packt Publishing
  • 出版日期: 2026-04-30
  • 售價: $1,840
  • 貴賓價: 9.5$1,748
  • 語言: 英文
  • 頁數: 610
  • 裝訂: Quality Paper - also called trade paper
  • ISBN: 1835880029
  • ISBN-13: 9781835880029
  • 相關分類: DeepLearningCUDA
  • 海外代購書籍(需單獨結帳)

商品描述

Build and deploy high-performance deep learning models using C++ for real-time applications where speed and efficiency matter.

Free with your book: DRM-free PDF version + access to Packt's next-gen Reader*

Key Features:

Build deep learning models in C++ with PyTorch C++ API and CUDA

Implement CNNs, RNNs, LSTMs, GANs, and Transformers in C++ for real-world applications

Optimize and deploy machine learning models to production with scalable C++ pipelines

Book Description:

Deep learning systems often struggle to meet performance demands in real-time and production environments. This book shows you how to build high-performance deep learning systems in C++, enabling efficient and scalable artificial intelligence (AI) in resource-constrained environments where performance matters.

You'll start by setting up a complete C++ deep learning environment and implementing core neural networks from scratch. As you progress, you'll build advanced architectures, including Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), Long Short-Term Memory Networks (LSTMs), Generative Adversarial Networks (GANs), and Transformers, using C++, CUDA, and PyTorch's C++ API. The book then focuses on model quantization and compression. It will guide you through the model deployment process in production with robust monitoring and explainability. You'll also explore distributed training and techniques for real-time inference in performance-critical domains.

By the end of this book, you'll be able to design, optimize, and deploy deep learning systems in C++ that are production-ready, scalable, and efficient across multiple industries.

*Email sign-up and proof of purchase required

What You Will Learn:

Set up and use CUDA and PyTorch's C++ API for deep learning

Implement CNNs, RNNs, LSTMs, GANs, Transformers, and LLMs in C++

Leverage CUDA for high-performance model training

Perform model compression using quantization, pruning, and distillation

Deploy and monitor models in production using C++ tools

Apply explainability techniques such as LIME, SHAP, and Grad-CAM

Who this book is for:

This book is for ML engineers, deep learning practitioners, and data scientists with a C++ background who want to build or learn about high-performance deep learning models. It also serves developers transitioning from Python-based frameworks looking for real-time deployment solutions in industries like finance, autonomous systems, and healthcare.

Table of Contents

Introduction to Deep Learning with C++ and Environment Setup

Data Preparation and Preprocessing in C++

CUDA for GPU Acceleration in Deep Learning with C++

Building a Basic Neural Network in C++

Multilayer Perceptrons in C++

Convolutional Neural Networks in C++

Recurrent Neural Networks and Long Short-Term Memory Networks in C++

Generative Networks, Autoencoders, and Large Language Models in C++

Transformers and Large Language Model Fine-tuning in C++

Deploying and Optimizing Models for Inference

Debugging and Retraining Deployed Models

Monitoring Deployed Models

Explainability and Transparency in Deep Learning Models

商品描述(中文翻譯)

建構和部署高效能的深度學習模型,使用 C++ 針對速度和效率至關重要的即時應用程式。

隨書附贈:無 DRM 的 PDF 版本 + 獲得 Packt 的下一代閱讀器*

主要特點:
使用 PyTorch C++ API 和 CUDA 在 C++ 中建構深度學習模型
在 C++ 中實現 CNN、RNN、LSTM、GAN 和 Transformers 以應用於實際場景
優化並將機器學習模型部署到生產環境,使用可擴展的 C++ 管道

書籍描述:
深度學習系統在即時和生產環境中常常難以滿足性能需求。本書將教您如何在 C++ 中建構高效能的深度學習系統,使其能在資源受限的環境中實現高效且可擴展的人工智慧 (AI),在性能至關重要的情況下發揮作用。

您將從設置完整的 C++ 深度學習環境開始,並從零開始實現核心神經網絡。隨著進展,您將使用 C++、CUDA 和 PyTorch 的 C++ API 建構先進的架構,包括卷積神經網絡 (CNN)、循環神經網絡 (RNN)、長短期記憶網絡 (LSTM)、生成對抗網絡 (GAN) 和 Transformers。本書接著專注於模型量化和壓縮,將指導您通過生產中的模型部署過程,並提供穩健的監控和可解釋性。您還將探索分散式訓練和在性能關鍵領域進行即時推斷的技術。

在本書結束時,您將能夠設計、優化和部署生產就緒、可擴展且高效的 C++ 深度學習系統,適用於多個行業。

*需要電子郵件註冊和購買證明

您將學到的內容:
設置和使用 CUDA 及 PyTorch 的 C++ API 進行深度學習
在 C++ 中實現 CNN、RNN、LSTM、GAN、Transformers 和 LLM
利用 CUDA 進行高效能模型訓練
使用量化、剪枝和蒸餾進行模型壓縮
使用 C++ 工具在生產中部署和監控模型
應用可解釋性技術,如 LIME、SHAP 和 Grad-CAM

本書適合對象:
本書適合具有 C++ 背景的機器學習工程師、深度學習實踐者和數據科學家,想要建構或了解高效能深度學習模型的人士。它也適合從基於 Python 的框架轉型的開發者,尋求在金融、自主系統和醫療等行業中的即時部署解決方案。

目錄
C++ 深度學習簡介及環境設置
C++ 中的數據準備和預處理
C++ 中的 CUDA 用於深度學習的 GPU 加速
在 C++ 中建構基本神經網絡
C++ 中的多層感知器
C++ 中的卷積神經網絡
C++ 中的循環神經網絡和長短期記憶網絡
C++ 中的生成網絡、自編碼器和大型語言模型
C++ 中的 Transformers 和大型語言模型微調
部署和優化推斷模型
調試和重新訓練已部署的模型
監控已部署的模型
深度學習模型中的可解釋性和透明性