Visual Web Ripper Logo Visual Web Ripper Logo

Highlighted features

Project Summary


Link area page navigation

requested: 3/5/2010 version: 2.33.12

Demonstrates the use of a link area page navigation template.

Target URL: http://www.tradeindia.com/Seller/Home-Supplies/Antique-Furniture/

Download demo project and sample data extract Tradeindia2.zip

Request

This url has a lot of listings and around 5 pages.
Want to go inside each listing and get the details.
You can go inside each listing by clicking veiw details. ( I will make the elements for the details.
 
How to do the first part that is select all the listings in all 5 pages and then go inside each and collect data.
 
Please suggest

Solution

This project should be easy to create, except for these two issues:

1.

The company list has premium listings at the top, and these listings are different from standard listings. A selection template has been used to handle the detail links of premium listings.

2.

The navigation page links are difficult to select because of incorrect HTML syntax on the website. A manually created selection path has been used to select the page links. A basic understanding of XPATH syntax is required in order to understand and manually edit the selection path.

Visit this website for more information about XPATH syntax:

http://www.w3schools.com/XPath/xpath_syntax.asp

Download demo project and sample data extract Tradeindia2.zip

Discuss this project

3/10/2010

Hi
The issue is the details inside each listing we want that we can use the above project to get urls then feed here for details but issue is in details all text in one block see link http://www.tradeindia.com/Seller-308712-302308-1578-CATALOGS/Antique-Furniture/AJV-Exports-.html the data down is what we need
 
AJV Exports
MANSION OF JUSTICE KSP, BEHIND PUBLIC PARK,
Jodhpur - 342006, Rajasthan, India
Phone:91-291-2544867
Fax:91-291-2544867

Key Personnel
Mr. Krishan Kumar Singh (Proprietor)
Mobile:+919928329293
 
We want seperaet fields
1 Address field: asMANSION OF JUSTICE KSP, BEHIND PUBLIC PARK,
2 City field: Jodhpur
3 State field :Rajastan
4 Country field: INdia
 5 Phone field:91-291-2544867
6 Fax field :91-291-2544867
Mobile:+919928329293
8 Contact person field  :Mr. Krishan Kumar Singh
 
Thanks
 

3/10/2010

Sequentum Support

This is fairly simple to do if you know regex syntax. Just use content transformation to separate the fields. I've done a few of them for you in the attached project.
 
Visit this website for more information about regex syntax:
 

3/10/2010

U mean u have done in tradeindia2 project , let me see that and the link provided and get back thanks

3/10/2010

Sequentum Support

Sorry, forgot to attached the project. Here it is.

Attachment: Tradeindia3.zip

3/10/2010

can u please explain atleast can u give an example here for say
We want seperaet fields
1 Address field: asMANSION OF JUSTICE KSP, BEHIND PUBLIC PARK,
2 City field: Jodhpur
3 State field :Rajastan
4 Country field: INdia
U can list it here in text dont need project
 
Please

3/10/2010

ok thanks a lot let me see

3/10/2010

I see only the full part selected for eg,
 
Phone i see phone field and in capture see number  and type html where is the regex  sytax
 
Please advice

3/10/2010

Sequentum Support

Click the "Content Transformation" button to see the regex.

3/10/2010

Hey one last help can u let me know the Regex code for the field key personnel to get result name of person 
 
Regards
rajiv

3/11/2010

Hey one last help can u let me know the Regex code for the field key personnel to get result name of person since its on a diff. line
 
Regards
rajiv

3/11/2010

Sequentum Support

The regex is:
 
Key Personnel</B><BR>(.*?)<BR> 

3/29/2010

Hi,
 
I'm an existing customer.  I can't figure out how to select "businessType" content using Filter function in this demo project? I read your artical about how to use text filter, and I still don't know.  Please help.
 
Jenny M

3/30/2010

Sequentum Support

Simply click on the business type value element "Exporter / Manufacturer / Distributor / Supplier / Trading Company", and then right-click on "Business Type" and select Add Filter from the context menu and then Must Have Text "Business Type".

4/2/2010

Got it!  Thank you very much!

  Required Field - required field
Comment Required Field
Attachement
Loading...
Add
  • Very user friendly visual project designer.
  • Extract complete data structures, such as product catalogues.
  • Repeatedly submit forms for all possible input values.
  • Extract data from highly dynamic web sites including AJAX web sites.
  • Web data extraction scheduler with email notifications and logging.
  • Custom post-processing and comprehensive API.
  • Only $299 including 1 year maintenance.

© 2009-2010 Sequentum  |  Terms & Conditions  |  Privacy Statement  |  Login