Interview Question in SQL Server Indexing


 

Interview Question :: What can a web crawler read? Can they read cf include tags


I would like to know what google and yahoo webcrawlers can read.

Teh reason is, I have a website that has many html anchor text links but many more things on the site are much more complicated.

The navigation is javascript, the press releases and other important information are in a SQL database and that database is linked to the page using CF includes.

So what I really want to knowis :can a webcrawler folow the cf includes to the database information on a html page and index those links?

I am interested in reading some source information so please site your sources. Also, please explain things in plain english, not programming lingo.

Thank you!

10 points to best answer!
Answers to "What can a web crawler read? Can they read cf include tags"
RE: What can a web crawler read? Can they read cf include tags?

the only data that a webcrawler can read are those that you see when you do a view source on your browser. the cf tag is processed by the application server to generate the html page which is then sent out to the browser. so no, the webcrawler cannot read the cf tags.
 
Vote for this answer ::  
RE: What can a web crawler read? Can they read cf include tags?

The problem lies within your wording of the question. You are asking a very vague question of "What can a web crawler do?" and well, it can do anything the programmers want it to do. In general, Google and Yahoo for instance, there are certain tags and commands you can give to spiders (another name for web crawlers) to ignore pages, or other little things. Other than that though they really only provide basic functions such as reading HTML for links, indexing the text on a page, and following the links. They'll have similar ones for images of course. Your best bet is to look at google and yahoo and see if they list any documentation on their spiders, but don't bet on it being too detailed as it is a competitive market.
 
Vote for this answer ::  
Update Alert Setting