WebsiteGear Logo Log In
New User? Sign Up
About | Contact | FAQ
  Home Content Website Promotion Search Engine Optimization Thursday, July 24, 2008 
POPULAR ARTICLES
Nav Subdomain Configuration - How To Setup A Sub Domain
Nav Website Layout - Tips & Tricks
Nav Round Robin DNS Load Balancing
Nav Domain Configuration - How To Setup A Domain Name
Nav Introduction To Server Load Balancing
Nav Server Load Balancing Methods
Nav Tips On Using SubDomain
Nav Breadcrumb Navigation
FEATURED NEWS | POPULAR NEWS
View More News View More News
SPONSORED LINKS
Print| Email| Save| Discuss| Feeds


About Search Engines - Part II
Published: Friday, August 20, 2004


Indexing the Web Content

Similar to an index of a book, a search engine also extracts and builds a catalog of all the words that appear on each web page and the number of times it appears on that page etc. Indexing of web content is a challenging task assuming an average of 1000 words per web page and billions of such pages. Indexes are used for searching by keywords, therefore, it has to be stored in the memory of computers to provide quick access to the search results.

Indexing starts with parsing the website content using a parser. The parser can extract the relevant information from a web page by excluding certain common words (such as a, an, the - also known as stop words), HTML tags, Java Scripting and other bad characters. A good parser can also eliminate commonly occurring content in the website pages (such as navigation links) so that they are not counted as a part of the page's content.

Once the indexing is completed, the results are stored in memory, in a sorted order. This helps in retrieving the information quickly. Indexes are updated periodically as new content is crawled. Some indexes help create a dictionary (lexicon) of all words that are available for searching. Also a lexicon helps in correcting mistyped words by showing the corrected versions in a search result. A part of the success of the search engine lies in how the indexes are built and used. Various algorithms are used to optimize these indexes so that relevant results are found easily without much computing resource usage.

Storing the Web Content

In addition to indexing the web content, the individual pages are also stored in the search engine's database. Due to cheaper disk storage, the storage capacity of search engines is very huge, and often runs into terabytes of data. However, retrieving this data quickly and efficiently requires special distributed and scalable data storage functionality. The amount of data, that a search engine can store, is limited by the amount of data it can retrieve for search results. Google can index and store about 3 billion web documents. This capacity is far more than any other search engine during this time.

Search Algorithms and Results

Once user enters the search keywords, the search engine's search algorithm looks up the indexes for matches for the search keywords. Once it can match the keywords in the index, the search engine tries to provide the most relevant contents first. This relevance matching is achieved by various search engine algorithms and hence is the bread and butter of search engine's popularity. Among all the search engines on the internet, Google stands out from the rest because it can provide more relevant answers to search queries. The search algorithms, that are used to find the most relevant results from a hay stack of web content, are different from one another. That is why search results, for the same keywords, produces different results on various search engines.

Advanced search engines, like Google, use a relevance ranking system, where each web page is ranked based on various factors such as:
  1. Content analysis : The content of each webpage is evaluated for the keywords based on the number of occurrences, position in the page (such as title, meta tags, heading), font size, proximity between them etc.


  2. Linking structure : The links from an external page or website to this page are analyzed for keywords in the link structure. Also links from a popular website will lead to a higher ranking.


  3. Page ranking :This is a relative ranking of a website based on an algorithm that is used specifically by Google. The page rank denotes the ranking of a web page based on its popularity and quality of links, among various other factors. The basic idea behind a higher page rank is that it is easier to find the website on the internet.

Conclusion

The search results decide the fate of a search engine. Different search engines try to cater to different users. AskJeeves is known to be popular because it provides search results based on descriptive question like queries. Its engine is optimized to parse the user friendly search query for keywords, which are then internally used to perform the search. The user feels as if the question was processed by a human behind the computer. Search engine technology is evolving every day and new researches are carried out to provide more concept and descriptive based search queries. However, the same theory applies - "The search engine, which provides the most relevant results, will rule".
Previous Article About Internet Search Engines
Print| Email| Save| Discuss| Feeds
RELATED ARTICLES
Nav How Internet Search Engines Work
A search engine can provide links to relevant information based on your requirement or query. Learn how a search engine works in order to understand the basics of search engine positioning.
Nav Search Engine Optimization - SEO ideas to avoid
This article discusses some web page optimization tricks that webmasters use but might turn out to be harmful to the website.
Nav Search Engine Optimization - Tips & Tricks
Search Engine Optimization has gained a lot of attention in the last few years. This article will provide some tips on how to optimize your web pages for a better search engine ranking.
Nav Froogle Optimization - Optimizing For Google's Product Search Engine
Merchants who have pursued Froogle maximization for their products have reported significant traffic and associated sales increases. This article lists the important strategies involved.
Nav How to Maximize Paid Search Results
Try these proven strategies for pay-per-click marketing to produce your desired results.
RELATED NEWS
News Post China's Baidu.com says quarterly profit rises nearly 87 percent on growth in advertising
Baidu.com Inc., China's leading search engine, said Thursday its second-quarter profit soared 87 percent over the year-earlie...
News Post Response Mine Interactive Announces 50 Percent Growth
Managers Promoted to Lead Two New Divisions
News Post VCs Show Entrepreneurs the Money: $29.4 Billion in 2007
Entrepreneur Magazine Reveals 8th Annual VC 100
News Post 'The Dark Knight' Soars in Searches on Lycos Network During Record-Breaking Box Office Debut
The Lycos 50(TM) for Week Ending July 19, 2008
News Post GSS Migration Toolkit for Sun Communications Suite Enables Migration from IBM Lotus Notes and Domino to Web 2.0 and SaaS
MOUNTAIN VIEW, Calif. , July 24 /PRNewswire/ -- Global System Services Corporation (GSS) today announced the GSS Migration To...
Submit News | View More NewsView more news
RELATED CLASSIFIED ADS
Classified Ad Low Cost Search Search Engine submissions
Great News! Webcraft.in has become India's Youngest ISO 9001:2000 Certified IT Company. While the In ...
Classified Ad Search Engine Optimization Services
Offering qualified Search Engine Optimization Services that helps your website bring on the top of m ...
Classified Ad Search Engine Optimization Services
Not satisfied with your website's organic rankings in search engines? It may be due to you ...
Classified Ad Search engine optimization services
Search engine optimization is the important element of website. It is the process that makes your si ...
Classified Ad Search Engine Optimization Services
http://www.habinfotech.com/search-engine-optimization.htmlSearch Engine Optimization Services Pro ...
Post Free Ad | View More View more classifieds
RELATED FORUM POSTS
Forum Post Search Engine Optimization Made Easy
It is becoming more and more apparent that high search engine rankings is vital for getting massive ...
Forum Post Yahoo Search Marketing (Overture) Upgrades
Yahoo is upgrading its search marketing platform, which is used for text link ads (formerly Overture ...
Forum Post SEO tricks do to improve their rankings
A: Choose the best and most effective keywords for each page of your site. Optimize each page for 1 ...
Forum Post hosting and searchengine advertisement in one plan
Is there any sites which provide web hosting and search engine advertisement in one plan?Can anyone ...
Forum Post Top 10 Ranking Solutions
If you want your site to be noticed, make sure that your site stands in the top 10 rankings in the t ...
Add New Post | View More View more forum posts


Copyright © 2003-2008 WebsiteGear Inc. All rights reserved.
About | Advertise | Submit Content | Privacy | Agreement | Contact