HTTP Programming Recipes for Java Bots (Paperback)

Jeff Heaton

  • 出版商: Heaton Research
  • 出版日期: 2007-04-17
  • 售價: $1,910
  • 貴賓價: 9.5$1,815
  • 語言: 英文
  • 頁數: 680
  • 裝訂: Paperback
  • ISBN: 0977320669
  • ISBN-13: 9780977320660
  • 相關分類: HTTPJava 程式語言
  • 無法訂購

買這商品的人也買了...

商品描述

The Hypertext Transfer Protocol (HTTP) allows information to be exchanged between a web server and a web browser. Java allows you to program HTTP directly. HTTP programming allows you to create programs that access the web much like a human user would. These programs, which are called bots, can collect information or automate common web programming tasks. This book presents a collection of very reusable recipes for Java bot programming. This book covers many topics related to Java HTTP programming. Both secure and insecure HTTP communications are covered, as well as HTTP authentication. Learn to interact with HTTP forms and support both HTTP POST and HTTP GET requests. Collect data from a wide array of HTML constructs, such as tables, and lists. Learn about advanced topics that complicate the life of a bot, such as AJAX and Javascript. Also learn about the ethical use of bots, and when bots should not be used. This book also introduces the Heaton Research Spider. The Heaton Research Spider is an open source spider framework. Using the Heaton Research Spider you can create spiders that will crawl a web site, much like a real spider crawls the web. The Heaton Research Spider is available in both Java and Microsoft Dot Net form.

商品描述(中文翻譯)

超文本傳輸協定(HTTP)允許網頁伺服器和網頁瀏覽器之間進行資訊交換。Java允許您直接編寫HTTP程式。HTTP程式設計允許您創建像人類使用者一樣訪問網頁的程式。這些程式被稱為機器人(bots),可以收集資訊或自動執行常見的網頁程式任務。本書提供了一系列非常可重複使用的Java機器人程式設計配方。

本書涵蓋了許多與Java HTTP程式設計相關的主題。包括安全和非安全的HTTP通訊,以及HTTP驗證。學習與HTTP表單互動,並支援HTTP POST和HTTP GET請求。從各種HTML結構(如表格和列表)中收集資料。了解複雜化機器人生活的高級主題,如AJAX和JavaScript。還學習有關機器人的道德使用,以及應該避免使用機器人的情況。

本書還介紹了Heaton Research Spider。Heaton Research Spider是一個開源的爬蟲框架。使用Heaton Research Spider,您可以創建爬行網站的爬蟲,就像真實的蜘蛛爬行網頁一樣。Heaton Research Spider提供Java和Microsoft Dot Net版本。