New Page 1

LA GRAMMATICA DI ENGLISH GRATIS IN VERSIONE MOBILE   INFORMATIVA PRIVACY

  NUOVA SEZIONE ELINGUE

 

Selettore risorse   

   

 

                                         IL Metodo  |  Grammatica  |  RISPOSTE GRAMMATICALI  |  Multiblog  |  INSEGNARE AGLI ADULTI  |  INSEGNARE AI BAMBINI  |  AudioBooks  |  RISORSE SFiziosE  |  Articoli  |  Tips  | testi pAralleli  |  VIDEO SOTTOTITOLATI
                                                                                         ESERCIZI :   Serie 1 - 2 - 3  - 4 - 5  SERVIZI:   Pronunciatore di inglese - Dizionario - Convertitore IPA/UK - IPA/US - Convertitore di valute in lire ed euro                                              

 

 

WIKIBOOKS
DISPONIBILI
?????????

ART
- Great Painters
BUSINESS&LAW
- Accounting
- Fundamentals of Law
- Marketing
- Shorthand
CARS
- Concept Cars
GAMES&SPORT
- Videogames
- The World of Sports

COMPUTER TECHNOLOGY
- Blogs
- Free Software
- Google
- My Computer

- PHP Language and Applications
- Wikipedia
- Windows Vista

EDUCATION
- Education
LITERATURE
- Masterpieces of English Literature
LINGUISTICS
- American English

- English Dictionaries
- The English Language

MEDICINE
- Medical Emergencies
- The Theory of Memory
MUSIC&DANCE
- The Beatles
- Dances
- Microphones
- Musical Notation
- Music Instruments
SCIENCE
- Batteries
- Nanotechnology
LIFESTYLE
- Cosmetics
- Diets
- Vegetarianism and Veganism
TRADITIONS
- Christmas Traditions
NATURE
- Animals

- Fruits And Vegetables



ARTICLES IN THE BOOK

  1. Atom
  2. Audioblogging
  3. Blog Carnival
  4. Blogcast
  5. Blog feed
  6. Blog fiction
  7. Blogger.com
  8. Bloggies
  9. Blogosphere
  10. Blogroll
  11. Blog software
  12. Citizen journalism
  13. Collaborative blog
  14. Community Server
  15. Content Management System
  16. Corporate blog
  17. Dooce
  18. Edublog
  19. Electronic literature
  20. Escribitionist
  21. Facebook
  22. Flaming
  23. Forum moderator
  24. Fotolog
  25. GNU General Public License
  26. Google bomb
  27. Google Reader
  28. Inauthentic Text
  29. International Weblogger's Day
  30. Internet Troll
  31. Linkback
  32. Link rot
  33. List of blogging terms
  34. LiveJournal
  35. Massively distributed collaboration
  36. Micropatronage
  37. Moblog
  38. Moderation system
  39. Movable Type
  40. MySpace
  41. MySQL
  42. News aggregator
  43. Online diary
  44. OPML
  45. PageRank
  46. Permalink
  47. Personal journal
  48. Photoblog
  49. Pingback
  50. Ping-server
  51. Podcasting
  52. Political blog
  53. Project blog
  54. Rating community
  55. Reputation management
  56. Reputation system
  57. RSS
  58. Social media
  59. Spam blog
  60. Spamdexing
  61. Spam in blogs
  62. Sping
  63. Technorati
  64. TrackBack
  65. User generated content
  66. Virtual Community
  67. Vlog
  68. Weblog
  69. Windows Live Spaces
  70. WordPress.com
  71. Wordpress
  72. Yahoo 360°
  73. YouTube

 


 

 
CONDIZIONI DI USO DI QUESTO SITO
L'utente può utilizzare il nostro sito solo se comprende e accetta quanto segue:

  • Le risorse linguistiche gratuite presentate in questo sito si possono utilizzare esclusivamente per uso personale e non commerciale con tassativa esclusione di ogni condivisione comunque effettuata. Tutti i diritti sono riservati. La riproduzione anche parziale è vietata senza autorizzazione scritta.
  • Il nome del sito EnglishGratis è esclusivamente un marchio e un nome di dominio internet che fa riferimento alla disponibilità sul sito di un numero molto elevato di risorse gratuite e non implica dunque alcuna promessa di gratuità relativamente a prodotti e servizi nostri o di terze parti pubblicizzati a mezzo banner e link, o contrassegnati chiaramente come prodotti a pagamento (anche ma non solo con la menzione "Annuncio pubblicitario"), o comunque menzionati nelle pagine del sito ma non disponibili sulle pagine pubbliche, non protette da password, del sito stesso.
  • La pubblicità di terze parti è in questo momento affidata al servizio Google AdSense che sceglie secondo automatismi di carattere algoritmico gli annunci di terze parti che compariranno sul nostro sito e sui quali non abbiamo alcun modo di influire. Non siamo quindi responsabili del contenuto di questi annunci e delle eventuali affermazioni o promesse che in essi vengono fatte!
  • L'utente, inoltre, accetta di tenerci indenni da qualsiasi tipo di responsabilità per l'uso - ed eventuali conseguenze di esso - degli esercizi e delle informazioni linguistiche e grammaticali contenute sul siti. Le risposte grammaticali sono infatti improntate ad un criterio di praticità e pragmaticità più che ad una completezza ed esaustività che finirebbe per frastornare, per l'eccesso di informazione fornita, il nostro utente. La segnalazione di eventuali errori è gradita e darà luogo ad una immediata rettifica.

     

    ENGLISHGRATIS.COM è un sito personale di
    Roberto Casiraghi e Crystal Jones
    email: robertocasiraghi at iol punto it

    Roberto Casiraghi           
    INFORMATIVA SULLA PRIVACY              Crystal Jones


    Siti amici:  Lonweb Daisy Stories English4Life Scuolitalia
    Sito segnalato da INGLESE.IT

 
 



THE BOOK OF BLOGS
This article is from:
http://en.wikipedia.org/wiki/Spamdexing

All text is available under the terms of the GNU Free Documentation License: http://en.wikipedia.org/wiki/Wikipedia:Text_of_the_GNU_Free_Documentation_License 

Spamdexing

From Wikipedia, the free encyclopedia

 

Spamdexing is any of various methods to manipulate the relevancy or prominence of resources indexed by a search engine, usually in a manner inconsistent with the purpose of the indexing system. Search engines use a variety of algorithms to determine relevancy ranking. Some of these include determining whether the search term appears in the META keywords tag, others whether the search term appears in the body text or URL of a web page. Many search engines check for instances of spamdexing and will remove suspect pages from their indices.

The rise of spamdexing in the mid-1990s made the leading search engines of the time less useful, and the success of Google at both producing better search results and combating keyword spamming, through its reputation-based PageRank link analysis system, helped it become the dominant search site late in the decade, where it remains. Although it has not been rendered useless by spamdexing, Google has not been immune to more sophisticated methods either. Google bombing is another form of search engine result manipulation, which involves placing hyperlinks that directly affect the rank of other sites[1]. Google first algorithmically combated Google bombing on January 25th, 2007.

The earliest known reference to the term spamdexing is by Eric Convey in his article "Porn sneaks way back on Web," The Boston Herald, May 22, 1996, where he said:

The problem arises when site operators load their Web pages with hundreds of extraneous terms so search engines will list them among legitimate addresses. The process is called "spamdexing," a combination of spamming — the Internet term for sending users unsolicited information — and "indexing."[2]

Common spamdexing techniques can be classified into two broad classes: content spam and link spam.

Content spam

These techniques involve altering the logical view that a search engine has over the page's contents. They all aim at variants of the vector space model for information retrieval on text collections.

Hidden or invisible text:

  • Disguising keywords and phrases by making them the same (or almost the same) color as the background, using a tiny font size or hiding them within the HTML code such as "no frame" sections, ALT attributes and "no script" sections. This is useful to make a page appear to be relevant for a web crawler in a way that makes it more likely to be found. Example: A promoter of a Ponzi scheme wants to attract web surfers to a site where he advertises his scam. He places hidden text appropriate for a fan page of a popular music group on his page, hoping that the page will be listed as a fan site and receive many visits from music lovers. However, hidden text is not always spamdexing: it can also be used to enhance accessibility.

Keyword stuffing:

  • This involves the calculated placement of keywords within a page to raise the keyword count, variety, and density of the page. Older versions of indexing programs simply counted how often a keyword appeared, and used that to determine relevance levels. Most modern search engines have the ability to analyze a page for keyword stuffing and determine whether the frequency is consistent with other sites created specifically to attract search engine traffic.

Meta tag stuffing:

  • Repeating keywords in the Meta tags, and using keywords that are unrelated to the site's content, believed to be ineffective as of 2005 onwards.

"Gateway" or doorway pages:

  • Creating low-quality web pages that contain very little content but are instead stuffed with very similar key words and phrases. They are designed to rank highly within the search results, but serve no purpose to visitors looking for information. A doorway page will generally have "click here to enter" in the middle of it.

Scraper sites:

  • Scraper sites, also known as Made for AdSense sites, are created using various programs designed to 'scrape' search engine results pages or other sources of content and create 'content' for a website. The specific presentation of content on these sites is unique, but is merely an amalgamation of content taken from other sources, often without permission. These types of websites are generally full of advertising, or redirect the user to other sites.

Link spam

Link spam takes advantage of link-based ranking algorithms, such as Google's PageRank algorithm, which gives a higher ranking to a website the more other highly ranked websites link to it. These techniques also aim at influencing other link-based ranking techniques such as the HITS algorithm.

Link farms:

  • Involves creating tightly-knit communities of pages referencing each other, also known humorously as mutual admiration societies [1]

Hidden links:

  • Putting links where visitors will not see them in order to increase link popularity.

"Sybil attack":

  • This is the forging of multiple identities for malicious intent, named after the famous multiple personality disorder patient Shirley Ardell Mason. A spammer may create multiple web sites at different domain names that all link to each other, such as fake blogs known as spam blogs.

Wiki spam:

  • Using the open editability of wiki systems to place links from the wiki site to the spam site. Often, the subject of the spam site is totally unrelated to the page on the wiki where the link is added. While many powerful tools exist to filter or block email spam, there are very few tools for blocking wikispam.

Spam in blogs:

  • This is the placing or solicitation of links randomly on other sites, placing a desired keyword into the hyperlinked text of the inbound link. Guest books, forums, blogs and any site that accepts visitors comments are particular targets and are often victims of drive by spamming where automated software creates nonsense posts with links that are usually irrelevant and unwanted.

Spam blogs (also known as splogs):

  • A spam blog, on the contrary, is a fake blog created exclusively with the intent of spamming. They are similar in nature to link farms.

Page hijacking:

  • is achieved by creating a rogue copy of a popular website which shows contents similar to the original to a web crawler, but redirects web surfers to unrelated or malicious websites.

Referer log spamming:

  • When someone accesses a web page, i.e. the referee, by following a link from another web page, i.e. the referer, the referee is given the address of the referer by the person's internet browser. Some websites have a referer log which shows which pages link to that site. By having a robot randomly access many sites enough times, with a message or specific address given as the referer, that message or internet address then appears in the referer log of those sites that have referer logs. Since some search engines base the importance of sites by the number of different sites linking to them, referer-log spam may be used to increase the search engine rankings of the spammer's sites, by getting the referer logs of many sites to link to them.

Buying expired domains:

  • Some link spammers monitor DNS records for domains that will expire soon, then buy them when they expire and replace the pages with links to their pages.

Some of these techniques may be applied for creating a Google bomb, this is, to cooperate with other users to boost the ranking of a particular page for a particular query.

Other types of spamdexing

Mirror websites:

  • Hosting of multiple websites all with conceptually similar content but using different URLs. Some search engines give a higher rank to results where the keyword searched for appears in the URL.

URL redirection:

  • Taking the user to another page without his or her intervention, e.g. using META refresh tags, Java, JavaScript or Server side redirects

Cloaking refers to any of several means to serve up a different page to the search-engine spider than will be seen by human users. It can be an attempt to mislead search engines regarding the content on a particular web site. However, cloaking can also be used to ethically increase accessibility of a site to users with disabilities, or to provide human users with content that search engines aren't able to process or parse. It is also used to deliver content based on a user's location; Google itself uses IP delivery, a form of cloaking, to deliver results.

A form of this is code swapping, this is: optimizing a page for top ranking, then, swapping another page in its place once a top ranking is achieved.

The following techniques are also widely acknowledged as being spam, or "black hat":

  • Doorway pages
  • Link farms
  • Googleating

References

  1. ^ Deconstructing Google bombs
  2. ^ Word Spy: spamdexing

See also

  • Google bomb
  • Google juice
  • Link farm
  • TrustRank
  • 302 Google Jacking
  • Index (search engine) - overview of search engine indexing technology

External links

To report spamdexed pages

  • Found on Google search engine results
  • Found on Yahoo! search engine results
  • Found on MSN search engine results

Search engine help pages for webmasters

  • Google's Webmaster Guidelines page
  • Yahoo!'s Search Engine Indexing page
  • MSN Search's Site Owner page

Other tools and information for webmasters

  • Online tool that detects spam techniques on web pages
  • A paper explaining various methods to determine webpage/blog spam
  • AIRWeb '05: First International Workshop on Adversarial Information Retrieval on the Web
  • AIRWeb 2006: Second International Workshop on Adversarial Information Retrieval on the Web
  • A list of open proxy and bot IP's. Ban IP's on this list to prevent comment spam. Updated weekly.
Retrieved from "http://en.wikipedia.org/wiki/Spamdexing"