Thứ Ba, 23 tháng 4, 2019

How to use Web Content Extractor(WCE) as Email Scraper?

Đăng Bởi: Admin - tháng 4 23, 2019 - 0 Bình Luận

http://webdata-scraping.com/use-web-content-extractor-as-email-scraper/


How to use Web Content Extractor(WCE) as Email Scraper?

Email-ScrapingWeb Content Extractor is a great web scraping software developed by Newprosoft Team. The software has easy to use project wizard to create a scraping configuration and scrape data from websites.
One day I came to see the Visual Email Extractor which is also product of Newprosoft and similar to Web Content Extractor but it’s primary use is to scrape email addresses by crawling websites you feed to the scraper. I had noticed that with the little modification in Web Content Extractor project configuration you can use it same as Visual Email Extractor to extract email addresses.
In this post I will show you what configuration makes the Web Content Extractor to extract email addresses. I still recommend Visual Email Extractor as it has lot more features then extracting email using WCE.
Here are the configuration that makes WCE to Extract Emails.
Step 1 : Open Web Content Extractor and Create New Project and Click on Next.
Web Content Extractor
Step 2:  Under Crawling Rules -> Advanced Rules Tab do the following settings
Crawling Level 1 Settings
 Crawling Level 2  Settings

Email Scraping
Web Content Extractor Settings
Step 3 After doing above settings now click on Next  -> in Extraction Pattern window -> Click on Define ->  in Web Page Address (URL) give any URL where email is given.  and click on  + sign right of Date Fields to define scraping pattern.
Now inside HTML Structure selects HTML check box or Body check box which means for each page it will take whole page content to parse data.
Now last settings to extract emails from page using regular expression based email extraction function.  Open Predefined Script window and select ‘Extract_Email_Addresses‘ and click on OK. and if you have used page that contains email then in Script Result’ you will be able to see the harvested email.
Email Extractor
Email Extraction Script Settings
Hope this will help you to use your Web Content Extractor as a Email Scraper.. Share your view in comment.



Phản Hồi Độc Giả

Một số lưu ý khi bình luận

Mọi bình luận sai nội quy sẽ bị xóa mà không cần báo trước (xem nội quy)

Bấm Thông báo cho tôi bên dưới khung bình luận để nhận thông báo khi admin trả lời

Để bình luận một đoạn code, hãy mã hóa code trước nhé