Visual Web Ripper Logo Visual Web Ripper Logo
Welcome Guest Search | Active Topics | Log In | Register

Tag as favorite
Current to-do list
Sequentum Support
#1 Posted : Thursday, September 02, 2010 10:14:54 PM

Groups: Administrators
Joined: 4/10/2010
Posts: 1,239
Location: Sydney, Australia
This is NOT a final list of coming features. the list may change at any time.

High priority

- Completing the Visual Web Ripper manual - (COMPLETED)
- Updating and adding new training videos (COMPLETED)
- Updating the Visual Web Ripper website (COMPLETED)
- Feature to allow Visual Web Ripper to continue data extraction where it was last stopped (COMPLETED)
- Feature to restart the web browser during data extraction. This is to handle situations where a website leaks memory in Internet Explorer causing Visual Web Ripper to run out of memory and crash. (COMPLETED)
- Use SQL Server Compact to store extracted data instead of custom in-memory data structures. This should eliminate the need to break extracted data into parts, and also eliminate the need to manually control the memory cache. (IN PROGRESS)

Medium priority

- Redesign of the Visual Web Ripper user interface (In PROGRESS).
- Ability to continue data extraction from a given point (COMPLETED).
- "Test connection" button on all database connection screens.
- Update debug messages to only show a warning if a content element is not found and all alternative contents are also not found.
- Ability to set a Page Transformation script for all web pages in the entire project.
- Allow reordering of content and template elements by using drag-and-drop (COMPLETED).
- A scheduler overview screen that shows all data extraction projects.
- Ability to turn off Content Transformation in edit mode.
- API callback functionality for CAPTCHA fields, allowing a 3rd party application to respond to CAPTCHA (Examples: http://manymacros.com/, http://www.beatcaptchas.com/imacroscode.html) - (COMPLETED)
- Project save message on exiting after changes have been made.
- Support for "Enter" key event.
- Extracting content from PDF files.
- Application wide database and email configuration.
- Add an option to cancel a web browser request based on content type (to avoid the web browser from starting a file download for example).
- Ability to convert multiple existing data files into a new output format.
- Ability to set the log folder path.
- Feature to split data extraction into multiple processes to increase performance in WebBrowser mode.
- Ability to upload files when submitting forms in WebCrawler mode.
- Adding ODBC destination data source.
- Ability to set multiple form fields with a single FormField element.
- Ability to extract a screenshot of a web page.

Low priority

- Handling situations where HTML element events are caught at the document level.
- Adding more short-cut keys in the Visual Web Ripper user interface.
- Adding more options for proxy servers, such as dynamically changing the number of links loaded by each proxy to emulate normal user behaviour.
- Access to request/response headers in WebCrawler mode.
- Updating the "View HTML" screen with an option to show HTML as formatted by Internet Explorer.
- The "View data" screen should have links to the web pages where data was extracted. This will aid in debugging missing data.
- Global Content Transformation script that is run for all content elements in the entire project.
Moffice
#2 Posted : Monday, September 06, 2010 1:45:04 AM
Groups: Registered
Joined: 8/24/2010
Posts: 18
Great ... looking forward to it, :)
Give a buzz when something is done, so we don't miss the features among the other updates.

Thanks...

hellraiserc7
#3 Posted : Monday, November 08, 2010 8:01:57 AM
Groups: Registered
Joined: 6/17/2010
Posts: 58
Location: Canada
I don't see extraction of text from PDF files. I guess this goes into the high priority list?
jagdish
#4 Posted : Monday, November 08, 2010 8:50:40 AM
Groups: Registered
Joined: 9/15/2010
Posts: 4
Location: Chennai
May I have the status of these updates ?

Thanks
Jagdish
renti
#5 Posted : Monday, December 20, 2010 6:17:25 PM
Groups: Registered
Joined: 12/13/2010
Posts: 8
Location: Spain

Hello,

Could you please implement a new feature, to save data extraction automotically each x min ?? You become mad when software has been running for hours, it crashs and you have to run project form the first begining again !!

Is there anywhere a temp. file we can use when software crashs ?

regards,
tony.
Sequentum Support
#6 Posted : Monday, December 20, 2010 7:25:07 PM

Groups: Administrators
Joined: 4/10/2010
Posts: 1,239
Location: Sydney, Australia
It is better to try and work out why the software crashes. Have a look at these topics in the manual.

http://manual.visualwebr...ault.aspx?manual_id=674

http://manual.visualwebr...ault.aspx?manual_id=675

You can use the "Data row cache" option to save data to disk while extracting. This is also explained in the topic above.
Users browsing this topic
Guest
Tag as favorite
Forum Jump  
You cannot post new topics in this forum.
You cannot reply to topics in this forum.
You cannot delete your posts in this forum.
You cannot edit your posts in this forum.
You cannot create polls in this forum.
You cannot vote in polls in this forum.

Powered by YAF 1.9.4 RC1 | YAF © 2003-2009, Yet Another Forum.NET
This page was generated in 0.122 seconds.