Learning Microsoft Cognitive Services

Leif Larsen

買這商品的人也買了...

商品描述

Key Features

  • Explore the capabilities of all 21 APIs released as part of the Cognitive Services platform
  • Build intelligent apps that combine the power of computer vision, speech recognition, and language processing
  • Give your apps human-like cognitive intelligence with this hands-on guide

Book Description

Microsoft has revamped its Project Oxford to launch the all new Cognitive Services platform―a set of 21 APIs to add speech, vision, language, and knowledge capabilities to apps.

This book will introduce you to all 21 APIs released as part of Cognitive Services platform and show you how to leverage their capabilities. More importantly, you'll see how the power of these APIs can be combined to build real-world apps that have cognitive capabilities. The book is split into three sections: computer vision, speech recognition and language processing, and knowledge and search.

You will be taken through the vision APIs at first as this is very visual, and not too complex. The next part revolves around speech and language, which are somewhat connected. The last part is about adding real-world intelligence to apps by connecting them to Knowledge and Search APIs.

By the end of this book, you will be in a position to understand what Microsoft Cognitive Service can offer and how to use the different APIs.

What you will learn

  • Identify a person through visual inspection and audio
  • Reduce user effort by utilizing AI-like capabilities
  • Understand how to analyze images and text in different manners
  • Find out how to analyze images using Vision APIs
  • Add video analysis to applications using Vision APIs

商品描述(中文翻譯)

《主要特點》
- 探索作為認知服務平台一部分的21個API的功能
- 構建結合計算機視覺、語音識別和語言處理能力的智能應用程式
- 通過這本實踐指南,賦予您的應用程式類似人類的認知智能

《書籍描述》
微軟對其Project Oxford進行了改進,推出了全新的認知服務平台 - 一套包含21個API的服務,可為應用程式添加語音、視覺、語言和知識功能。

本書將向您介紹作為認知服務平台一部分的21個API,並展示如何利用它們的功能。更重要的是,您將看到如何結合這些API的能力來構建具有認知能力的實際應用程式。本書分為三個部分:計算機視覺、語音識別和語言處理,以及知識和搜索。

首先,您將了解視覺API,因為這是非常直觀且不太複雜的部分。接下來的部分涉及語音和語言,這兩者有一定的聯繫。最後一部分是通過將應用程式連接到知識和搜索API來為應用程式添加現實世界的智能。

通過閱讀本書,您將了解微軟認知服務能夠提供什麼,以及如何使用不同的API。

《您將學到什麼》
- 通過視覺檢查和音頻識別來識別人物
- 利用類似人工智能的能力減少用戶的努力
- 了解如何以不同方式分析圖像和文本
- 了解如何使用視覺API分析圖像
- 使用視覺API將視頻分析添加到應用程式中