This is NOT a final list of coming features. the list may change at any time.
High priority- Completing the Visual Web Ripper manual - (COMPLETED)
- Updating and adding new training videos (COMPLETED)
- Updating the Visual Web Ripper website (COMPLETED)
- Feature to allow Visual Web Ripper to continue data extraction where it was last stopped (COMPLETED)
- Feature to restart the web browser during data extraction. This is to handle situations where a website leaks memory in Internet Explorer causing Visual Web Ripper to run out of memory and crash. (COMPLETED)
- Use SQL Server Compact to store extracted data instead of custom in-memory data structures. This should eliminate the need to break extracted data into parts, and also eliminate the need to manually control the memory cache. (IN PROGRESS)
Medium priority- Redesign of the Visual Web Ripper user interface (In PROGRESS).
- Ability to continue data extraction from a given point (COMPLETED).
- "Test connection" button on all database connection screens.
- Update debug messages to only show a warning if a content element is not found and all alternative contents are also not found.
- Ability to set a Page Transformation script for all web pages in the entire project.
- Allow reordering of content and template elements by using drag-and-drop (COMPLETED).
- A scheduler overview screen that shows all data extraction projects.
- Ability to turn off Content Transformation in edit mode.
- API callback functionality for CAPTCHA fields, allowing a 3rd party application to respond to CAPTCHA (Examples:
http://manymacros.com/, http://www.beatcaptchas.com/imacroscode.html) - (COMPLETED)
- Project save message on exiting after changes have been made.
- Support for "Enter" key event.
- Extracting content from PDF files.
- Application wide database and email configuration.
- Add an option to cancel a web browser request based on content type (to avoid the web browser from starting a file download for example).
- Ability to convert multiple existing data files into a new output format.
- Ability to set the log folder path.
- Feature to split data extraction into multiple processes to increase performance in WebBrowser mode.
- Ability to upload files when submitting forms in WebCrawler mode.
- Adding ODBC destination data source.
- Ability to set multiple form fields with a single FormField element.
- Ability to extract a screenshot of a web page.
Low priority- Handling situations where HTML element events are caught at the document level.
- Adding more short-cut keys in the Visual Web Ripper user interface.
- Adding more options for proxy servers, such as dynamically changing the number of links loaded by each proxy to emulate normal user behaviour.
- Access to request/response headers in WebCrawler mode.
- Updating the "View HTML" screen with an option to show HTML as formatted by Internet Explorer.
- The "View data" screen should have links to the web pages where data was extracted. This will aid in debugging missing data.
- Global Content Transformation script that is run for all content elements in the entire project.