![values in excel to octoparse loop values in excel to octoparse loop](https://i.ytimg.com/vi/7Chuqz5M65U/mqdefault.jpg)
![values in excel to octoparse loop values in excel to octoparse loop](https://demo.dokumen.tips/img/380x512/reader018/reader/2020011510/58ecd9f61a28abbe5a8b4581/r-1.jpg)
Paste a list of URLs into the "URL list" box and click "Save". Create a loop for a list of URLs.ĭrag a "Loop Item" into the Workflow Designer and then choose "URL list" in the "Loop mode". Scraping product details from Ĭlick "Quick Start" ➜ Choose "New Task (Advanced Mode)" ➜Complete basic information ➜ Click "Next". Click the "Export" button to export the results to Excel file, databases or other formats and save the file to your computer. Copy the list of URLs for Task 2. The data extracted will be shown in "Data Extracted" pane. Octoparse will automatically extract all the URLs. Then click "Next" ➜ Click "Next" ➜ Click "Local Extraction" to run the task on your computer. Go to the webpage ➜ Cycle Pages box ➜ Loop Item box ➜ Extract Data➜ Click to Paginate. Now we need to check the workflow by clicking actions from the beginning of the workflow. Drag the second “Loop Item” box before the “Click to paginate” action of the “Cycle Pages” box in the Workflow Designer so that we can grab all the elements of sections from multiple pages. ➜ Click the item name➜ Select "Extract link(href attribute of A tag) of this item". ➜Then click "Finish Creating List" ➜ Click "loop" to process the list for extracting the elements in each page.Įxtract the link of the first item. Now we get all the links with a similar layout. ➜ Click "Continue to edit the list".Ĭlick the second highlighted link ➜ Click "Add current item to the list" again. Then the first highlighted link has been added to the list. Click "Create a list of items" (sections with similar layout). Move your cursor over the section with similar layout, where you would extract the URLs.Ĭlick the first highlighted link ➜ Create a list of sections with a similar layout. You can click "Expand the selection area" button until "Loop click in the element" appears. You can right-click the "Next" pagination link to prevent triggering the link.ģ. If you want to extract information from every page of search result, you need to add a page navigation action.Ģ. ➜ Choose "Loop click in the element" to turn the page.ġ. ➜ Click the "Go" icon to open the webpage. Enter the target URL in the built-in browser. Scraping the URLs needed for Task 2.Ĭlick "Quick Start" ➜ Choose "New Task (Advanced Mode)" ➜Complete basic information. (Download my extraction tasks of this tutorial HERE just in case you need it: Task 1, Task 2.) Or you can follow the steps below to make the scraping tasks to scrape data from.
VALUES IN EXCEL TO OCTOPARSE LOOP DOWNLOAD
You can directly download the two tasks ( The OTD. The data fields include auction item name, item condition, ended time, the price for the item, the number of items sold, HKD(including shipping), shipping price, shipping details, item location, seller id, seller's representative, product details, item image URL and product detail page URL. In this tutorial, we will scrape all the product details page from with Octoparse. Task 1 is used to scrape the URLs of product details and Task 2 is used to scrape all the product details from. To scrape product details from as fast as possible, you can make two scraping tasks - Task 1 and Task 2. Then our cloud servers will collect the data shortly and provide you with a structured data-set. To speed up the extraction, you can use our Cloud Extraction to split the scraping task into many sub-tasks. Octoparse enables you to scrape data from.