Simple and free PHP example parser statistics keywords with wordstat Yandex.

It is clear that before promoting a site, you need to decide on the keywords. it is not difficult, but painstaking work. In order to find something worthwhile, you need to shovel a bunch of data. Therefore, you cannot do without process automation tools. In this note, I want to focus on PHP creation data parser with wordstat Yandex.

So. The main problem when parsing data from the wordstat Yandex keyword statistics service is the presence of captcha. It is not so difficult to get around it. It is enough to pass the fuid01 cookie generated when processing the captcha in the request. In other words, you need to go to the service, make a request, specifying the characters from the picture and get the content of the required cookie.

How to get the content of fuid01 cookie in Firefox browser?

Because I'm not going to aim at epics and other great works of mankind, I will only describe the process of obtaining the content of fuid01 cookies in Firefox browser(I am using version 8.0). In general, let's launch Firefox. We believe that the query in wordstat has already been made and the cookie has been created. Click the "Firefox" button in the upper left corner of the browser window. In the menu, select: Settings> Preferences(I am not confusing anything).

In the "Settings" window that opens, go to the "Privacy" tab. Here we are interested in the "History" block. In the list, select Firefox "will use your history storage settings" and click the "Show cookies ..." button that appears.

In the "Cookies" window, in the "Search" field, enter the name of the cookie we are interested in, i.e. "Fuid01". The list should display what was found. Select one of the proposed cookies and in the information field, select and copy its "Content".

How to work with PHP parser wordstat Yandex

You can download PHP parser wordstat Yandex for free here... I must say right away that this is just an example, the job of which is to parse keywords and display them on the screen, but all in order.

The first thing you need to understand is that all data is encoded in UTF-8. So if anything, don't forget to convert the data. Moreover, on some servers this may be a problem, in more detail. The next nuance is that the script will need support for the script to work. Otherwise, everything is quite simple.

We assign the content of the fuid01 cookie to the $ fuid01 variable. In fact, this value is set in curl_setopt () via CURLOPT_COOKIE, but for convenience I have deduced it separately. Next, we are interested in the $ params array - these are the variables passed in the query to wordstat Yandex. As an example, I limited myself to the simplest option, so there were no dynamics. In particular, only the first page of the issue is parsed: "page" => 1, the text value is obtained via GET, but for the region it is selected Moscow: "geo" => 1.

It is clear that the region identifier, if another is needed, will have to be clarified. To do this, go to wordstat Yandex, click on the link "Specify the region ..." and select the required one.

Having made a request, you need to look at the value of the required parameter in the URL. It should be noted that if more than one region is selected, their IDs will be listed separated by commas.

Next comes a request to the statistics service and parsing wordstat Yandex data. The latter has one small nuance. The fact is that wordstat Yandex displays statistics in the form of two tables: “what were they looking for with the word ...” and “what else were people looking for, who were looking for ...” - I used only the first one. However, there is nothing complicated there. Regular expressions are pretty simple. I think you can figure it out. Good luck!

) For the SE :: Yandex :: WordStat parser to work, Yandex accounts are required. Accounts can be registered using a parser or simply add existing accounts to the files / SE-Yandex / accounts.txt file in the format:

[email protected]; password
[email protected]; password2
...

Click to Expand ...

Collected data (top)

  • Number of impressions for the specified request
  • Statistics update date
  • List of all keywords associated with the specified and the number of their impressions per month
  • List of all additional keywords that users were looking for and the number of their impressions per month

Capabilities (top)

  • Parses the maximum number of results returned by the wordstat - 40 pages of 50 search items each
  • Supports selection of search region (with subgroups)
  • Can automatically substitute found keywords into queries (option Parse to level)
  • The ability to select several regions at once for assessment
  • The ability to bypass captchas using the AntiCaptcha service or any other API supporting them

Use cases (top)

  • Estimation of the amount of traffic by keyword (frequency)
  • Search for new keywords with similar topics
  • Collecting large databases of keywords of various topics
  • Any other options implying parsing Yandex.WordStat in one form or another

Inquiries (top)

  • Key words must be specified as queries, just as if they were entered directly into the Wordstat search form, for example:

windows moscow
"windows moscow"
! windows! moscow

Click to Expand ...

results (top)

  • As a result, the original query, the number of its impressions, the date of statistics update, a list of related keywords and their impressions per month, a list of additional keywords and their impressions per month are displayed.

Windows! Moscow - 10368, updated: 16/05/2013
keywords:
windows moscow: 32367
plastic windows moscow: 8994
pvc windows moscow: 4813
buy windows moscow: 2561
price windows moscow: 1706
Moscow window work: 1547
vacancies windows moscow: 1187
wooden windows moscow: 1087
service + one window Moscow: 1021
...
additional keywords:
production of pvc windows: 8512
windows rehau: 15686
salamander windows: 1576
windows kbe: 3798
windows kbe: 6089
windows kwe: 3227
balcony glazing: 83216
gazebos: 471213
glazing of loggias: 26366
office partitions: 18740
window assembly: 26223
...

Click to Expand ...

Possible settings (top)

Parameter Default value Description
Pages count 10 Number of pages to parse
Region AllSearch region
Remove + from keywords Remove plus sign (+) from found queries
Use AntiGate Determines whether to use AntiGate to bypass captchas
AntiGate preset defaultYou must first configure the Util :: AntiGate parser - specify your access key and other parameters, and then select the created preset here
AntiGate preset for Login defaultAntiGate preset for login. You must first configure the parser Util :: AntiGate with parameters, and then select the created preset here
Use Accounts
First sleep 50 Delay after the first request when using AntiGate to save captchas
Use session Retains good sessions for future reference
Mobile only Get statistics for mobile traffic only
Remove bad accounts Automatic deletion of accounts with an incorrect username / password or requiring confirmation by phone

Yandex Wordstat is one of the basic services for SEO-optimizers working in the Russian Internet. The standard tool of the system allows you to search for key phrases from the search results - high, medium and low frequency - using operators for this to increase search efficiency and setting up geolocation. Among SEOs, the process of collecting key queries that are relevant to the topic of the site is called parsing.

Experts carry out parsing Yandex to form a semantic core - a linguistic "backbone" for future content, which most accurately reflects the niche or service that the site offers. For this, there is its own search tool in Yandex - Yandex Wordstat Assistant, which can be linked to your account for free. With its help, it is convenient to collect and upload the collected keywords to an Excel spreadsheet.

You can use this tool, but only if you have a relatively small site where the semantic core is no more than a thousand requests. If the amount of work involved in collecting the semantic core is much greater (for example, for an online store with many products), then it will be unreasonably long to parse Yandex results manually using Wordstat Assistant. It is more expedient to do this with the help of special programs and online services - free, shareware and paid (for example, Key Collector).

Let's consider the basic principles of working with Wordstat, as well as popular tools for automatically collecting search queries.

How to parse correctly in Wordstat: an overview of programs for analyzing search results

Parsing Yandex SERPs is the process of collecting search queries on a given topic (using basic markers). With Yandex Wordstat Assistant, you can easily get basic information for the formation of the semantic core - search queries, the number of impressions per month for each phrase, as well as words similar in topic that help to expand the reach target audience. Parser Wordstat allows you to get information by words and by region, while the system focuses specifically on the sites.

If you want to perform an advanced search that includes not only Yandex, but also social networks and other search engines, you need to use universal tools, for example, A-Parser. This software product is easy to use, but to work effectively with it, you need to buy a proxy for parsing.

In whatever environment you work, use the universal scheme for collecting the semantic core:

  • formation of a basic list of markers (queries that are uniquely relevant to the site);
  • collection of keywords;
  • removal of "garbage" - unnecessary and accidentally included in the list of requests;
  • clustering of keywords by site sections or topics.

We consider the main algorithm for working with Wordstat and Yandex parsim for collecting semantics using the example of a real estate agency website. Let's say we need to compile a list of relevant keywords that correspond to the base marker "apartment rent".


Advice! Pay attention to the clues like “search along with this” in the next search results box - this will help you reach a wider audience when composing your SEO content.

To make it easier for yourself the task of manual parsing, use the operators Yandex. For example, by putting a "!" Sign at the end of a search phrase, you will fix the end of the search query, and the system will search for words in this particular word form. The operator "-" (minus) will allow you to remove unnecessary words, and the square brackets "" - to fix the order of words in the query.

Experts advise to combine manual and automatic selection of queries to compose a semantic core, especially for beginners. Using the standard Yandex Wordstat Assistant tool, you develop the skills of intuitive selection of search phrases that lead to the site target customers using medium and low frequency keys. High-frequency phrases don't always work, especially in a competitive niche.

If you do not have time for manual parsing in Yandex Wordstat, use special tools. Various things can be found on the Internet software, but most Russian-speaking SEO-optimization specialists do parsing of Yandex search results using Key Collector.

This is a desktop product that allows you to create and store projects for each site in the local computer memory, upload and save files and do keyword parsing in accordance with regional settings. The program requires linking to an account. To work with key search queries in the Key Collector, there are icons of the main search engines in Runet (in our case, it is a Yandex parser, although you can choose Google, Bing and others).

Other useful SEO services include:

  • Serpstat- a multifunctional platform for professionals with a trial version with limited functionality, as well as a paid subscription from $ 19 to $ 299 per month;
  • Ahrefs is a web service with many useful options, including niche monitoring, competitor analysis and site indexing improvement. To collect the semantic core, the Keywords Explorer tool is provided. You can test it from $ 7 per week;
  • Semrush- an analogue of Ahrefs in terms of functionality, more expensive in terms of tariffs (from $ 99 and above).

Experts say that Kay Collector is the most convenient and functional program, which makes the life of the optimizer much easier. It has many useful options for fine-tuning Yandex parser parameters (for example, search depth, selective search for queries by base frequency, etc.).

But the program has a nuance - it is paid. The cost of the license is 1800-1900 rubles for electronic and non-cash payments, respectively.

Advice! If for some reason you do not want to use this product, you can try its free analogue "Slovoyob". A simpler option is also suitable - Bukvarix - free service to collect keywords and form a semantic core.

Parsing Yandex Wordstat can be done independently and using special programs. Manual collection through the Wordstat Assistant tool pays off if your niche has a narrow focus and few competitors, and the list of search queries is relatively small. For large volumes of work, it is recommended to use special programs for parsing and analytics.

Download parser archive
This parser collects keywords and frequency into an Excel file from the Yandex wordstat service.
If your frequent tasks include the collection of statistics on keywords from the yandex waterstat service, then one of the ways to optimize the workflow is to delegate the task of parsing keywords to the parser.
This parser is not original in functionality, but original in the simplicity of settings and data retrieval by keywords.


The functionality of this parser allows you to collect key words from the wordstat.yandex.ru service, statistics of the requested keys collected, the exact frequency of keys, as well as ungroup the collected keys into clusters.

Parser settings:

1. Login from yandex.ru - in this field you must enter the address of your mailbox from Yandex mail

2. Password from yandex.ru - in this field you must enter the password from your mail on Yandex

3. File with keywords - this field specifies a file with a list of keywords (the file must be saved in utf-8 format, each key on a new line), if you plan to activate clustering of queries after collection, then this file should contain only 1 basic passphrase (all words are in lowercase only (in small letters))

4. File with negative keywords - this field specifies a file with a list of negative keywords (the file must be saved in utf-8 format, each negative word on a new line)

5. The depth of capture - to what depth to parse

6. Explore results - this option activates the stage of query clustering after the keys are collected by the base phrase (by activating this option, you must specify the file to save the study)

7. Check for exact frequency - this option activates the collection of the exact frequency for the collected keys

8. Deep Scan - this option activates deep scan

9. Key for captchas - optional

10. File for saving keys - Excel file, into which keywords from the wordstat.yandex.ru service will be saved

11. File for saving the study - Excel file in which the grouped clusters will be saved after clustering

If you have any questions or wishes - write to the mail [email protected]
skype - vipvodu
Download parser archive

An example of a file with clusters by key is a client base.

Advice:
1. Check the settings carefully before starting.
2. If the program, after pressing the button - start - turns off - then most likely something is wrong with the settings, or there are hanging processes after the previous interrupted parsing, hanging processes must either be killed in the task manager, or restart the computer.
3. Do not interrupt parsing, wait for the inscription - All data is saved