UGN UGN books

Webbots, Spiders, and Screen Scrapers


A Guide to Developing Internet Agents with PHP/CURL

Guide to writing webbots, spiders, and screen scrapersThe Internet is bigger and better than what a mere browser allows. "Webbots, Spiders, and Screen Scrapers: A Guide to Developing Internet Agents with PHP/CURL" is for programmers and businesspeople who want to take full advantage of the vast resources available on the Web. As author Michael Schrenk demonstrates, there's no reason to let browsers limit the online experience -- especially when it's so easy to automate online tasks to suit individual needs.

This new book begins by outlining the deficiencies of browsers, then explains how these deficiencies can be exploited in the design and deployment of task-specific webbots--customized programs that aggregate different sources, filter content for relevant data, and automate online transactions.

Inside "Webbots, Spiders, and Screen Scrapers," readers learn how to write fault-tolerant webbots and spiders that:
-download entire websites and parse data from web pages
-manage cookies and decode encrypted files
-automate form submissions and send and receive email
-send SMS alerts to cell phones
-unlock password-protected websites
-automatically bid in online auctions
-exchange data with FTP and NNTP servers

Sample projects reinforce these new skills so readers can create simple Web applications to track online prices, create anonymous browsing environments, archive online data, and more. In addition, the author's website (www.schrenk.com) provides readers with sample scripts and code libraries, as well as a place to test their own webbots.

"It can be difficult to learn how to design, develop, and deploy webbots," said No Starch Press founder Bill Pollock. "Mike Schrenk has been living and breathing this stuff for many years and is the perfect teacher to share his accumulated wisdom."

As "Webbots, Spiders, and Screen Scrapers" illustrates, some tasks are just too tedious—or too important—to leave to humans. With guidance from author Schrenk on how to automate online life, readers won't let a browser limit the way they use the Internet again.

Michael Schrenk develops webbots and spiders for clients across North America. He has written for Computerworld and Web Techniques magazines and has taught college courses on Web usability and Internet marketing. He's also an occasional speaker at DEFCON.

Webbots, Spiders, and Screen Scrapers: A Guide to Developing Internet Agents with PHP/CURL
by Michael Schrenk
Apr 07, 328pp, US$39.95, Available in bookstores everywhere.
Download: Sample chapter

ABOUT NO STARCH PRESS: Founded in 1994, No Starch Press (nostarch.com) is one of the few remaining independent computer book publishers. We publish the finest in geek entertainment--unique books on technology, with a focus on open source, security, hacking, programming, and alternative operating systems. Our titles have personality, our authors are passionate, and our books tackle topics that people care about. No Starch Press titles have been selected for the prestigious Communication Arts Design Annual and STEP inside 100, and have won the Ippy Award from Independent Publisher magazine. Visit nostarch.com for more information and our complete catalog. (And most No Starch Press books use RepKover, a lay-flat binding that won't snap shut.)

UGN Site Navigation:

Return to: the top of this page, or the INDEX for this department
Exit to: The User Group Network front page
Contact: The Editor, Webmaster or Membership Director
* Discuss Photoshop
* Discuss Desktop Publishing
* Critique your Web Site

CREDITS:
Reviewed by Fred Showker for the User Group Network News Service. (C) 2006, all rights reserved. Affiliate groups may freely republish this piece so long as they include the tag line: "From the User Group Network News Service at http://www.user-groups.net/ " ... Event dates are subject to change. Some products, programs, or promotions are not available outside the U.S. Prices are estimated retail prices and are listed in U.S. dollars. Product specifications are subject to change. Apple, the Apple logo, Mac, Mac OS, Macintosh, Power Mac, Velocity Engine, FireWire, AirPort, Safari, Sherlock, QuickTime, iLife, iTunes, iChat, iPhoto, iMovie, iDVD, iCal and Apple Store are either registered trademarks or trademarks of Apple. Other company and product names may be trademarks of their respective owners. Mention of third-party products is for informational purposes only and constitutes neither a recommendation nor an endorsement.

 

The User Group Network is a member of:, the MUG News, and is sponsored in part by: The Design & Publishing Center, The News Serve Network, and the Designers' Bookshelf. The User Group Network is the first, and the original user group network for computer users everywhere including, Apple, Mac-Pro, User Group Organization to support Macintosh, IBM PC, Microsoft, Compaq, Amiga, BE/OS, Linux, UNIX, and other leading computer platforms. Hosting services are provided by The Graphic Design Network to serve the computing community. For information about the UGNetwork, to get involved or have your own groups' home page located at user-groups.net, please contact us. Copyright 1994 through present. This site is maintained in the community interest by The Graphic Design Network c/o Showker Graphic Arts & Design, a Corporation of the Commonwealth of Virginia, Commonwealth of Virginia, 22801, Harrisonburg, VA, in the Shenandoah Valley of Virginia, established in 1972.

Valid HTML 4.01!