Search spam. What is search spam: types of search spam - how to identify them What is search spam

Hello everyone!

Each of us is familiar with the word SPAM. Moreover, everyone has encountered it and seen what it is. For example, in his e-mail, in SMS messages, and almost everywhere. This concept also applies to the search results of Yandex, Google and other systems.

Search spam is sites or pages created for the purpose of manipulating search results. And, as a result, deceiving users and search engines.

Since the ranking of sites in search results is influenced by many factors, manifestations of search spam “put pressure” on them and the web resource quickly takes high positions. Simple re-optimization, in other words.

Basically, on such resources it is impossible to find an answer to a search query, or it is possible, but difficult, but it’s easy to catch a virus or “accidentally” install, for example, Amigo 😀

Types of search spam

There are several obvious manifestations of search spam:

  • Oversaturation of text content keywords;
  • Reference "explosion". A sharp increase in the number of external links to the resource;
  • Re-optimization of meta tags, ALTs of images;
  • A large amount of useless content;
  • 100% copying of content from other sites;
  • Promotion by behavioral factors.

Each of us can “sin” a little and over-optimize or simply suddenly purchase a large number of external links. All of the above is punished by search engines, usually by exclusion from the index or forced demotion in the search results. So be extremely careful when promoting your resource.

More detailed information You can read about penalties for this or that violation in Yandex.

Special ones are responsible for tracking the actions listed above. For example, if you made a mistake in forming a link mass, then with your own hands you increase the chances of getting into Minusinsk. For example, it is responsible for spamming text.

Once you fall under the filter, it will be very difficult for you to regain trust search engines— you can spend a lot of time on this: from a month to several years. Moreover, it is often impossible to return to the previous positions in search results.

To avoid falling under the filter, you just need to follow certain requirements:

  1. Link only to high-quality resources that are interesting to your visitors;
  2. Do not abuse too much advertising. Also, you cannot place “shocking” advertising;
  3. Use external links sparingly. In details ;
  4. And finally, don’t deceive your visitors.

Other types of search spam

There are other types of search spam that you may also have encountered:

  • Doorway sites are web resources that are actively promoted in the search engine in order to automatically redirect traffic to the advertising page of another site;
  • Cloaking is pages and sites that provide different content to users and search engine robots in order to influence search engine rankings. In short, the robot sees one thing, and you see something completely different;
  • Hidden text. Generating text content that is invisible to visitors and rich in big amount keywords;
  • Clickjacking is the placement of invisible elements on a website, when clicked on, some action occurs;
  • Content malware, viruses. A fairly common occurrence - a person enters , for example, request download driver for wifi modem and ends up on some site where he is asked to download this very driver. But instead of what he was looking for, a virus or some kind of malicious program appears on the computer.

All of the above are severely punished. If you encounter similar manifestations of search spam, feel free to write a complaint to Yandex or Google technical support - it will not go unnoticed.

Every year the number of sites hosted on the Internet increases exponentially. As a result, competition for a place in the TOP greatly increases (especially for high-frequency queries).

Webmasters and optimizers are forced to use a variety of methods to promote their own sites (their resources) in an ever-increasing competition.

And some of these methods are partially or completely prohibited by search engines.

Many users know firsthand about spam in their website, but search spam Not everyone knows.

Search engine spam – what is it?

The common name for prohibited optimization techniques that some webmasters sometimes use is search engine spam.

This name is due to the fact that search results are spammed with pages with irrelevant content due to the use of dishonest promotion methods. In other words,

search spam is when a user’s request produces content that does not correspond to this request (in the user’s opinion) and which should not be in the TOP (in the search engine’s opinion).

The presence of such spam pages in search results negatively affects people’s attitude towards search engines and reduces their level of trust.

Types of search spam

What is considered search spam? Let us list its main types.

  1. Stuffing content with keywords and phrases

The text itself, as well as descriptions of pictures and video files, meta tags, etc., can be filled to capacity with keys.

All this is done in the hope that the search engine algorithm will consider the page more relevant to these keywords. In fact, this method of SEO optimization has not worked for a long time. Webmasters who use it are more likely to get a ban for their site than to increase its position in the search results.

  1. Automatic redirect

This is an instant redirection of users from one page to another.

In this case, when visiting a page of a website, a person is instantly redirected to another site.

Often the user does not even have time to notice the redirect itself (since it happens automatically and very quickly). Most often, after a redirect, a person ends up on a page with advertising content that is spammed with links.

  1. Cloaca

In this case, for each of the promoted pages, the webmaster creates two versions at once.

  • The first version of the page is intended for search engines,
  • the second version is for ordinary users.

Thus, cesspools are different content for search engines and for users.

A special mechanism tracks who exactly visits the site - a search engine robot or an ordinary person. Based on this, one or another version of the page is displayed.

The page for search engines is very carefully optimized, it does not contain all unnecessary elements and details, but there are a lot of keywords for search engines. The page for ordinary visitors is made normal, as convenient and beautiful as possible (in terms of design and appearance).

And it seems like “the wolves are fed and the sheep are safe,” that is, the search engines are happy and the users are happy. But in fact, by using cesspools, the webmaster deceives the search engine, which, in turn, does not forgive such things and bans “forking” pages.

  1. Swaping

This term means a complete replacement of the content of a website page immediately after its successful indexing in search engines. The primary task of a webmaster or optimizer using swap is to fill the page with unique and high-quality content, promote it in search results and get good traffic from search engines.

Then, after the next update (periodic update of the search engine), the webmaster completely changes the content of his page. Instead of unique content, text appears there, stuffed with keywords and links to promoted resources (sites).

It is clear that with the next update, search engines will detect the substitution and pessimize the page. But until then, it will continue to collect traffic (visitors), being in the TOP for some time.

  1. Invisible text and links
  • You can use very small fonts,
  • you can make the font color and page background color the same (for example, white text on a white background),
  • you can use special rules CSS styles to mask links.
  • You can insert single-pixel images containing a link, etc. into the page.

How search engines fight search spam

Search engines do not like search spam not only for deception, but mainly because search spam misleads the user, the user remains dissatisfied and therefore leaves (may leave) to look for information on his request to another search engine. Search engines fight for their users, so they try not to disappoint them and provide only high-quality information in response to their requests.

This approach implies an irreconcilable fight by search engines against search spam. Search engines try to find spam, remove it from their database and punish (ban) such a site or page.

As for the ways to detect search spam, there are only three of them.

1) Automatic

In this case, search engine spam is detected using search engine algorithms. Based on the signs of a particular type of spam, a search is made for sites that use dishonest methods of promotion, and their subsequent pessimization.

2) Semi-automatic

In this case, the task of search algorithms is to search for suspicious sites and pages. The final decision on whether to ban or pessimize a site is made by the moderator () of the search engine.

3) Manual

Here, checking the site for involvement in the use of search spam from beginning to end is carried out by a moderator (assessor). Most often, such checks occur on the basis of complaints received from the owners of competing sites.

We released new book"Content marketing in in social networks: How to get into your subscribers’ heads and make them fall in love with your brand.”

Search spam – deception of the user

What is search spam and how to recognize it? From the point of view of an ordinary person, spam is intrusive advertising that appears instead of information that the user is trying to find. At its core, search engine spam or webspam is an attempt to manipulate the results of generated search results in order to promote low-quality sites to the TOP 10. Their content is often uninformative or does not meet the user’s needs.

More videos on our channel - learn internet marketing with SEMANTICA

What are the types of search spam?

There is a certain classification of search engines regarding spam. Both Yandex and Google urge webmasters and optimizers to refrain from the promotion methods listed below.

1. Excessive number of key phrases in the text. This is an attempt to “pump up” the text with keywords as much as possible in order to inflate its position in the search. How to spot spam of this type? This can be done based on some signs:

  • presence of automatically generated text;
  • repeated repetition of certain phrases;
  • highlighting keys with tags , ;
  • the presence of hidden text that blends into the background of the page.

2. . This term refers to intermediate web pages that redirect the visitor to another site. Most often, a doorway is a one-page website optimized for a list of key phrases. Doorways are created using tools like DMI, SEoDOR.

3. Link spam. In order to gain weight, a webmaster may try to use link spam, which includes:

  • mass acquisition of hyperlinks from automatic exchanges;
  • spam links received from blogs, forums, guest books;
  • creating a network of small .

Search engine spam and its consequences

Search engines are improving their algorithms in such a way as to exclude spam Internet resources from search results whenever possible. The pessimization methods applied to unscrupulous webmasters depend on the type of violation. For example, excessive concentration of keywords in the text leads to a decrease in the results of issuing a single document. However, the rest of the site continues to function normally.

Doorways detected by the search engine will be banned. The fate of satellite sites developed to promote the main Internet resource depends on their quality. If the webmaster created a satellite using unique and more or less high-quality texts, then such a site may remain in search.

Excessive purchase of links threatens to pessimize the promoted web resource. In order to combat attempts to manipulate search results, Yandex launched the “Minusinsk” algorithm in May 2015.

Promoting a website in search engines is task No. 1 for any webmaster and optimizer. After all, the number of site visitors and, ultimately, the profit brought by this site depends on high positions on queries. You can achieve good positions in the search using permitted or unauthorized methods, the latter includes search spam, or as it is called in Google- "webspam".

If you open the “License to use the Yandex search engine”, then clause 3.7. This license defines search engine spam as follows: “Search spam” is an attempt to deceive the Service’s search engine and manipulate its results in order to change the position of a particular website in search results. Websites that use “search spam” may be lowered in ranking or excluded from the Service database due to the impossibility of their correct ranking.— Thus, Yandex regulates webspam as deception of the PS and manipulation of search results, without specifically saying what kind of manipulations are meant.

The Google Corporation includes the well-known department Webspam Team, which is commanded by the well-known Matt Katz and this department is engaged in the fight against search spam. One of the latest creations of this department is the Google Penguin filter, which has made a lot of noise since the spring of 2012.

Google classifies the following as webspam, among other things:

  • Doorways
  • Hidden text and hidden links
  • Link exchange schemes
  • Masking and hidden redirect
  • Pages filled with irrelevant keywords
  • Pages or domains with almost the same content
  • Link exchange schemes

For all this, the site can be lowered in ranking or even thrown out of the search databases. Google recommends reporting sites that use illegal search spam methods through this page. It is possible that in this way someone is getting rid of competitors in search results.

From the above, you can understand that webspam is the manipulation of content and links in order to obtain high positions. There are a lot of methods of webspam as such, and there is no point in listing them all in this article.

Spam rate

When ranking sites, an indicator such as the spam rate of an individual web page and the entire website is used. This coefficient is constantly recalculated depending on the incoming data and affects the ranking in conjunction with other factors.

Link search spam

Link webspam most often includes:

  • Creation of sites (site networks) specifically for links
  • Link exchange
  • Trash links from comments
  • Unmoderated links
  • Links hidden
  • Cross-cutting links not related to the subject of the site
  • Purchased links with key entry
  • Links for manipulating PR and TIC

Text search spam

Text search spam most often comes down to keyword spamming

  • page text
  • headers
  • meta tags
  • links
  • presence of ks in the domain
  • etc.

In conclusion, it is worth saying that search engines have always been determined to fight search spam and this fight continues to this day. Moreover, the successes of search engines in this matter are visible to the naked eye. But it’s up to you to decide which promotion methods to choose for your websites.

Rand Fishkin on anchors and the future of links

Ultimately, to deceive the user.

Main types

  • Unrelated to the content of the page, but popular in search queries words in the tags “meta keywords”, “description”, for example “ sex», « freebie" As a result, search engines began to analyze not only special tags, but also the text of the site itself.
  • “Pumping” text with keywords is an artificial increase in the frequency of a keyword or expression in the text and (or) the use of HTML markup elements (h1-3, strong, b, em, i) to artificially increase the weight of the keyword.
  • “Invisible text” is text that is invisible to a page visitor, but is indexed by a search engine. Apply a text color that matches the background color, 1 pixel text, blocks of text, with the “display:none” style.
  • Link spam - links that “inflate” the “link popularity” parameter and PageRank of the site. Since search engines, when responding to a request, are guided by the number of links available on other sites on this resource, then the idea came up to somehow increase the number of such links:
    1. Create small websites on free hosting, register them in large quantities thematic catalogs and link from them to the main one.
    2. Take part in the exchange of links.
    3. Buy links for money.
    4. Link spam from guest books, blogs, wikis, etc.

Search engines combat this by creating filters that include sites whose links are not taken into account when ranking.

  • Doorways are intermediate pages created to increase the weight of the page for link ranking or to organize Google bombs. In accordance with doorway technology, a special doorway page must be promoted in the search index. And from this page redirect to advertising. One advertisement can have an unlimited number of doorways. Search engines respond by removing sites from their database that have automatic redirects. To which spammers respond with a simple trick: they ask the visitor to click on the “Login to the site” button or something similar.
  • Masking, or “cloaking” - analysis query variables, in which the search engine is given site content that is different from what the user sees.

Consequences of using search spam

  • If early search engines could trust keywords and indications of update frequency, then, due to the active use of these methods to “deceive” search engines, later versions of search engines were forced to almost completely ignore these instructions, being critical of each of the pages of the site, which made it difficult to find “respectable” pages with rare content and specified keywords. For example, a page with the text of a medieval song and the keywords “Middle Ages, poetry, Eastern Europe”, which does not have a large number of links from other sites, and does not contain the words “Middle Ages, poetry” in the text, is unlikely to be found using these keywords.
Internet