Webbots, Spiders, and Screen Scrapers, 2nd Edition: A Guide to Developing Internet Agents with PHP/CURLby Michael Schrenk
This second edition of Webbots, Spiders, and Screen Scrapers includes tricks for dealing with sites that are resistant to crawling and scraping, writing stealthy webbots that mimic human search behavior, and using regular expressions to harvest specific data. As you discover the possibilities of web scraping, you'll see how webbots can save you precious time/i>… See more details below
- LendMe LendMe™ Learn More
This second edition of Webbots, Spiders, and Screen Scrapers includes tricks for dealing with sites that are resistant to crawling and scraping, writing stealthy webbots that mimic human search behavior, and using regular expressions to harvest specific data. As you discover the possibilities of web scraping, you'll see how webbots can save you precious time and give you much greater control over the data available on the Web.
- No Starch Press San Francisco, CA
- Publication date:
- Sold by:
- Barnes & Noble
- NOOK Book
- Sales rank:
- File size:
- 10 MB
and post it to your social network
Most Helpful Customer Reviews
See all customer reviews >
Definitely the missing link in how to automate internet activity. A most have book in your Tech Library.
This is a review of Michael's 2nd Edition of the same book (I received an early release edition from the publisher, I did not have an opportunity to read the 1st edition): I thoroughly enjoy this book. I found myself glued to this topic, I have heard about it many times before just never investigated it. This is "good stuff" and I missed out by not starting earlier. The author, Michael Schrenk knows his stuff and is passionate about his craft and it shows in the way he writes. All throughout his book his excitement about how incredible this technology is, and his use of these tools in creative ways is contagious. I like to read books by authors who are so enthusiastic about their subject matter, as oppose to just droning out facts and knowledge. Reading this book was exciting and addicting. Following along, tinkering with his examples was just play fun. His excitement and ingenious way of looking at things just rubs off, even before I got to the real-world examples the ideas just started flowing. It's like I just discovered the next BIG THING, but I'm not going to shared that here. He does a great job of explaining everything in step by step details and then compliments them with photos and diagrams to aide with comprehension. His code examples are simple and it was easy to see what was going on. His code examples are written in an imperative, or procedural style as oppose to an object oriented style, which in my opinion, is better suited when teaching new or difficult concepts. Also, it's just easier to follow along by a wider range of people with varying programming backgrounds. He also provides his own supplemental library (via the book website), to simplify using cURL itself. Using his library, I was able to quickly get things up and running and see how everything works, and that is a good thing when learning something new. It sets you on a possible spin and leaves you with nothing but good stuff to say about the subject you just learned. In the end, would I recommend this book to others? Absolutely. It is just like learning the command line, once you start and see the benefits, you never look back.