Blog

Software Guides, Tutorials and News

How to Configure Main Web Scraper and E-Mail Extractor Options

How to Configure Main Web Scraper and E-Mail Extractor Options


In this guide, we will show you how to configure the CBT web scrape and e-mail extractor settings to allow you to scrape and extract data from search engines, your website list, business directories and social media sites.

How to Configure Main Web Scraper and E-Mail Extractor Options

Project Name - you should name your project that describes your niche or industry. The project folder and results file will be saved using your project name. Make sure that the project name correctly identifies your project. It will help you to keep track of your files.

Keywords
- here, you should enter your keywords that you would like to use to find websites to scrape. If you are scraping your own website list, you will not need to enter keywords. However, if you are scraping search engines such as Google, Google Maps, business directories such as Yellow Pages and Yelp and social media sites, you will need to enter your keywords. Spend some time planning your keywords because they will have a direct impact on the overall quality of your results. Be mindful that the keywords that you use for search engines will be different to the keywords that you ought to use for Google Maps and business directories like Yellow Pages and Yelp. This is because Google Maps and business directories are built for local searches whilst the search engines can be scraped for virtually anything and everything.

"Crawl and Scrape E-Mails from Search Engines" - check this option is you are planning on scraping and extracting data from the search engines such as Google and Bing, business directories like Yellow Pages, Google Maps and social media sites.

"Scrape E-Mail from your Website List" - check this option if you are planning of scraping and extracting data from your own website list or re-scraping your results csv file.

"Use Proxies" - check this option if you are going to use proxies. We recommend that you use proxies if you plan to scrape websites using multiple threads. Make sure to configure your proxies inside the proxy settings area. Do not check this option if you intend on using a VPN as this applies to your local settings.

"Invisible Mode: Hire the searching forms" - this settings does exactly what it says on the tin: it hides the browsers in which the scraping takes place. You can enable and disable this option at any time. It is useful to allow you to see how everything is working.

"Fast Mode" - check this option if you intend to use multiple threads. Please ensure that your computer specification can handle the threads. Start off with lower threads and work your way up.

"Use an Integrated Web Browser Instead of HTTP request" - use this option if you intend to use a VPN like Nord VPN or HMA VPN PRO with timed out IP change. Bear in mind that a VPN will change your IP address at your local level and can interfere with your Facebook account. Likewise, VPN software have been known to cause XEvil and XRumer to disconnect.

"Real time view" - check this option if you want to see the results in real time. Disable this option to reduce processing power.

"Delay between Requests in milliseconds" - keep this at 1000. This setting helps to ensure that you are not scraping too fast and it tries to emulate human behaviour.

"Delete results without email tel" - if you are scraping data specifically with emails and telephone numbers, you can choose to delete any results that do not contain any of the two.

"Complete previous search" - this is a very powerful feature. Check it if you would like to resume your previous search. If your search engine scraper and email grabber had crashed, you can always restart it and resume the previous search. Likewise, if you would like to stop scraping for the day and power off your PC, you can do so without the fear of losing your data. Simply close the software and the next time you restart it, just check the complete previous search option.

"Import" - you can import a CSV file from a previous search if you would like to use your existing results as a starting point for your search. The web scraper and email extractor will simply scrape data on top of your existing database. This is very useful for building your sales leads. 

At the bottom of the data extractor, you will see the CPU usage, RAM usage, proxies status, threads, total keywords, total search engines, export folder. This will give you a visual reminder of your settings and also give you an idea of how your scraper is performing resources-wise.

Inside the help section, you can check for an update and submit a bug or a problem that you are experiencing directly to our team. You can also do this on our website.

No comments yet...

Leave your comment

32216

Character Limit 400