Thursday, June 09, 2005

LSI...

I'm learning about Latent Semantic Indexing (LSI). Not in my free time, just at work.

In the coming years, search engines may start utilizing LSI to rank websites better based on LSI.

"Latent semantic indexing (LSI) is one of the most sophisticated modern attempts at high quality automatic indexing. It is based on co-occurrence clustering of terms and the identification of documents associated with these term clusters. By relying on co-occurrence data, LSI is also able to deal with the problem of the variety of terms that can be used to express similar concepts. For example, both ³lawyers² and ³attorneys² are likely to belong to the same cluster with related terms such as ³courts,² ³trials,² ³judges,² ³sentencing,² etc"

In other words: an intelligent synonym finder.

For more information (if you can't sleep)... National Institute for Technology and Liberal Education

No comments: