First of all, welcome to this new version of Netwoof. This documentation will be improved over the time, and you are welcome to send us reports on it.
API drivers and specifications available here
Netwoof can be used in many ways : web automation, data crawling/scraping, monitoring and so more.
- Functional test : Scheduled test on your own application, even on intranet, with our proxy system. Netwoof can log in, subscribe and so on...
- Crawl qualified data: You can retrieve any unordered data from different sources and transform it in single object to integrate it your application or simply make statistical studies.
- Massive data crawling: Netwoof includes a generic crawler that let you index tones of documents (even PDF, Word, Powerpoint...). This crawler manages boilerplate issues and document evolution at the same time. It means that you will only crawl pertinent documents without ads, menu, …
- File download : as well, you ask can Netwoof to retrieve anykind of data (flv, jpg, gif, docx, xml, etc...). It can store it on a dedicated Amazon S3 bucket and let you get it through our API.