What is a search engine indexer?
A search engine is made up of many complex parts, which vary from search engine to search engine. Although there are two main parts, which all search engine have to have to call them self’s a search engine. Firstly a Spider/Crawler which we covered earlier, is what goes out in to the web and gathers the Data. Now logically something has to happen to this data for it to provide accurate results against a search engine query (what you type in the search box), this is where the indexer comes into play. The indexer takes the data collected by the spiders then analyzes, stores and catalogues the information. This is done so there is a set structure, for retrieval against a given query. Think of the indexer as a Super Liberian, taking a set of books analyzes them for there topic then storing them into relevant place on the shelf’s. They also place them within a category in a computer so when a customer ask for a book on a certain topic the Liberian can type the information in to the computer and give the exact location of the said book or books based on the topic requested.
That was a very loose interpretation of what a indexer does. In fact what a search engine indexer does is a lot more complicated than that, especially when it comes to cataloging information. First because of the shear scale of the information it has to categorize, as this much information and many process has to be undertook to obtain relevance of the said information to the query.
The major issue when it comes to relevance is the current technology the web is based on which is its natural language. Unlike are real world Liberian, a computer can not differentiate between the mean of the same word. For example if your search for “Polo”, a search engine can’t determine whether your searching for the sweet, the car type or the game with out more information, hence if you enter that into the search engine you will get an array of result based on different topics. This is the next stage of web semantic index coming to us in 2013, where the indexer will be able to differentiate between the above example. Don’t ask me how because it is still in the development stage but it is coming.
Related posts:
[...] to run effectively and efficiently. These main elements consist of four main areas such as spiders, indexer, users and a user friendly interface. Without these running effectively it can have a dramatic [...]
Pingback by What does search engines consist of? | Search Engine Optimisation (SEO) Feed from Position Gold Ltd — August 20, 2008 @ 10:31 am
[...] the results you have searched for. As a Spider downloads a small amount of website pages (no search engine indexes more than 16%), Search engines realise that their users want to have the most relative results and [...]
Pingback by What is a Search Engine Spider? | Search Engine Optimisation (SEO) Feed from Position Gold Ltd — August 20, 2008 @ 10:33 am
[...] Google was registered in September 1997 as http://www.Google.co.uk. By the end of 1998, Google had already indexed over 60 million pages and since then has grown [...]
Pingback by What is Google? | Search Engine Optimisation (SEO) Feed from Position Gold Ltd — August 20, 2008 @ 10:41 am