Username: Save?
Password:
Home Forum Links Search Login Register*
    News: Keep The TechnoWorldInc.com Community Clean: Read Guidelines Here.
Recent Updates
[September 09, 2024, 12:27:25 PM]

[September 09, 2024, 12:27:25 PM]

[September 09, 2024, 12:27:25 PM]

[September 09, 2024, 12:27:25 PM]

[August 10, 2024, 12:34:30 PM]

[August 10, 2024, 12:34:30 PM]

[August 10, 2024, 12:34:30 PM]

[August 10, 2024, 12:34:30 PM]

[July 05, 2024, 02:11:09 PM]

[July 05, 2024, 02:11:09 PM]

[July 05, 2024, 02:11:09 PM]

[June 21, 2024, 01:43:48 PM]

[June 21, 2024, 01:43:48 PM]
Subscriptions
Get Latest Tech Updates For Free!
Resources
   Travelikers
   Funistan
   PrettyGalz
   Techlap
   FreeThemes
   Videsta
   Glamistan
   BachatMela
   GlamGalz
   Techzug
   Vidsage
   Funzug
   WorldHostInc
   Funfani
   FilmyMama
   Uploaded.Tech
   MegaPixelShop
   Netens
   Funotic
   FreeJobsInc
   FilesPark
Participate in the fastest growing Technical Encyclopedia! This website is 100% Free. Please register or login using the login box above if you have already registered. You will need to be logged in to reply, make new topics and to access all the areas. Registration is free! Click Here To Register.
+ Techno World Inc - The Best Technical Encyclopedia Online! » Forum » THE TECHNO CLUB [ TECHNOWORLDINC.COM ] » Techno Articles » Website Promotion
 What is LSI
Pages: [1]   Go Down
  Print  
Author Topic: What is LSI  (Read 544 times)
Shawn Tracer
TWI Hero
**********


Karma: 2
Offline Offline

Posts: 16072


View Profile
What is LSI
« Posted: March 04, 2008, 11:20:31 AM »


What is LSI
 by: Rakesh Ojha

What is Latent Semantic Indexing or LSI ?

Latent Semantic Indexing or LSI has changed the world of search engine optimization. One fine morning, SEO experts found that most of their best ranking sites on Google were in jeopardy. Google has simply updated its crawler-program to accommodate LSI and has moved towards a more relevant rating list!

LSI is a methodology involving statistical probability and correlation that helps deducing the semantic distance between words. It's obviously a complex methodology but can be easily applied to understand the relation between certain words in a paragraph or in a document. This methodology is being used while indexing a page in the search engine's database.

Delving deeper, LSI is concerned not only with studying a document for keywords and listing it in the database, but also with studying a collection of documents and recognizing and identifying the words that are common between these documents. This way it can conclude on the semantic relation between the words being used in these documents. The process then finds out which other documents include or makes use of these semantically close words. The resultant documents are indexed to be related or closely relevant to a context, according to latent semantic indexing.

LSI regards the documents with certain proportion of words being used frequently to be semantically close. If there are fewer words common among documents, they are supposed to be semantically distant. Therefore, LSI introduces interdependence of measure and it rates the relevance of any document on a scale of 0 to 1. Unlike regular keyword searches, LSI can acknowledge the measure of how close is a document to another or how relevant is a credential to a particular context.

Let's consider an example here. In a document that discusses Stephen Covey and his preaching, words such as 'effective', 'habits', 'interdependence', 'independence', 'synergic', 'paradigm', 'continuum', 'public victory', 'private victory', 'circle of influence' and so on would be found frequently. Once the search engine indexing tool that uses the LSI technique recognizes these commonly-used words from a given set of documents, it can find other documents or Web pages on the net that contains the same set of keywords in almost similar frequency and index them in the database beside the relevant context (Stephen Covey and his preaching) that it leads to.

Now compare this simple method with a human brain's approach to search a context. If you are given a set of document and asked to locate the one's that discuss a particular context, what will you do? Anyone will try to find out the things in common in the sample context and use the observation to compare the rest of the documents to classify them. This intelligence has been added to the lifeless crawler-software or computer technology through the LSI technology.

Quite obviously, the LSI algorithm doesn't understand anything about the meaning of a word in a document. It just reads through the pattern of the usage of particular words and calculates the correlation of their occurrence and hence their correlation with a particular context. Let's get into the practical side of it, that is, how it is applied to a search engine technique.

About The Author

Rakesh Ojha is a successful Internet marketer utilizing both pay-per-click marketing and search engine optimization to increase website traffic. To learn more, visit http://www.sem.mosaic-service.com.

[email protected]

Logged

Pages: [1]   Go Up
  Print  
 
Jump to:  

Copyright © 2006-2023 TechnoWorldInc.com. All Rights Reserved. Privacy Policy | Disclaimer
Page created in 0.073 seconds with 24 queries.