<?xml version="1.0" encoding="utf-8" ?><rss version="2.0" xml:base="https://www.webmaster-forums.net/crss/node/1019021" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title></title>
    <link>https://www.webmaster-forums.net/crss/node/1019021</link>
    <description></description>
    <language>en</language>
          <item>
    <title></title>
    <link>https://www.webmaster-forums.net/web-database-development/how-search-engines-work#comment-1113578</link>
    <description> &lt;p&gt;Woh, neato!&lt;/p&gt;
&lt;p&gt;The ID of this TWF thread is 19021.  The URL of that doc includes 1921.&lt;/p&gt;
 </description>
     <pubDate>Wed, 04 Sep 2002 18:11:13 +0000</pubDate>
 <dc:creator>Mark Hensler</dc:creator>
 <guid isPermaLink="false">comment 1113578 at https://www.webmaster-forums.net</guid>
  </item>
  <item>
    <title></title>
    <link>https://www.webmaster-forums.net/web-database-development/how-search-engines-work#comment-1113577</link>
    <description> &lt;p&gt;Looks like I have something to read for my boring 1-4PM class today. &lt;img src=&quot;https://www.webmaster-forums.net/misc/smileys/wink.png&quot; title=&quot;Wink&quot; alt=&quot;Wink&quot; class=&quot;smiley-content&quot; /&gt;&lt;/p&gt;
 </description>
     <pubDate>Wed, 04 Sep 2002 18:08:14 +0000</pubDate>
 <dc:creator>Mark Hensler</dc:creator>
 <guid isPermaLink="false">comment 1113577 at https://www.webmaster-forums.net</guid>
  </item>
  <item>
    <title></title>
    <link>https://www.webmaster-forums.net/web-database-development/how-search-engines-work#comment-1113575</link>
    <description> &lt;p&gt;hey thanks for the reply Peter, and sorry i didnt acknowledge it sooner.  &lt;/p&gt;
&lt;p&gt;I ran across this purely by chance last night, &lt;a href=&quot;http://www7.scu.edu.au/programme/fullpapers/1921/com1921.htm&quot; class=&quot;bb-url&quot;&gt;The Anatomy of a Large-Scale Hypertextual Web Search Engine&lt;/a&gt; which was written by the founders of Google.  I found it fascinating, and am now enlightened.&lt;/p&gt;
 </description>
     <pubDate>Wed, 04 Sep 2002 17:13:44 +0000</pubDate>
 <dc:creator>ROB</dc:creator>
 <guid isPermaLink="false">comment 1113575 at https://www.webmaster-forums.net</guid>
  </item>
  <item>
    <title></title>
    <link>https://www.webmaster-forums.net/web-database-development/how-search-engines-work#comment-1112681</link>
    <description> &lt;p&gt;I&#039;m not a search engine expert, but I would imagine what the big search engines do, isn&#039;t terribly different from what you or I would do to search our site, just taken to the next level. All the data would be stored in efficiently indexed tables (system level) in database farms and the data would be extracted by efficient search algorithms.&lt;/p&gt;
&lt;p&gt;They might do stuff like ignore common words (the, and, etc) to improve search speed. They also might not re-execute similar searches, that is, if I logged into Google and searched for &quot;Thai Food&quot;, then you log in a few hours later and search for &quot;Thai Food&quot;, you wouldn&#039;t be searching the whole database again, just getting the results I already fetched (in html). It would only re-execute the query if it is older than a set amount of time (say a day).&lt;/p&gt;
&lt;p&gt;As for set-up, that really depends on what you&#039;re storing, but a basic set-up could be:&lt;/p&gt;
&lt;p&gt;SearchMain table:&lt;br /&gt;
SearchMainID (PK Identity)&lt;br /&gt;
PageName varchar(50)&lt;br /&gt;
PageTitle varchar(50)&lt;br /&gt;
PageIndexDate smalldatetime&lt;/p&gt;
&lt;p&gt;SearchItem table:&lt;br /&gt;
SearchItemID (PK Identity)&lt;br /&gt;
ItemMainID (from SearchMain table)&lt;br /&gt;
PageContent (text)&lt;/p&gt;
&lt;p&gt;I separated into 2 tables to improve performance, there may be times when only the main table needs to be used, by separating the PageContent info into another table performance is kept high (by avoiding the cursor having to scan through a slow text field when it doesn&#039;t have too).&lt;/p&gt;
&lt;p&gt;Have fun!&lt;/p&gt;
 </description>
     <pubDate>Mon, 19 Aug 2002 19:52:54 +0000</pubDate>
 <dc:creator>Peter J. Boettcher</dc:creator>
 <guid isPermaLink="false">comment 1112681 at https://www.webmaster-forums.net</guid>
  </item>
  </channel>
</rss>
