<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Square Galaxy &#187; Search Engines</title>
	<atom:link href="http://squaregalaxy.com/tag/search-engines/feed/" rel="self" type="application/rss+xml" />
	<link>http://squaregalaxy.com</link>
	<description>A blog by Jacob</description>
	<lastBuildDate>Thu, 26 Aug 2010 19:14:39 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0</generator>
		<item>
		<title>Aggregate your searches with OpenSearch</title>
		<link>http://squaregalaxy.com/tech/aggregate-your-searches-with-opensearch/</link>
		<comments>http://squaregalaxy.com/tech/aggregate-your-searches-with-opensearch/#comments</comments>
		<pubDate>Fri, 10 Nov 2006 09:38:03 +0000</pubDate>
		<dc:creator>Jacob</dc:creator>
				<category><![CDATA[Technical]]></category>
		<category><![CDATA[aggregation]]></category>
		<category><![CDATA[blog]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[opensearch]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[Search Engines]]></category>

		<guid isPermaLink="false">http://jacob.peargrove.com/tech/2006/search-engines/aggregate-your-searches-with-opensearch/</guid>
		<description><![CDATA[The rise of RSS and Atom feeds brought increased usage of the word, aggregate, which meant to gather many articles together to one destination. This turns out to be a really great thing. No longer do I have to go from site to site to check on all my favorite news. I can use an [...]]]></description>
			<content:encoded><![CDATA[<p>The rise of <acronym title="Really Simple Syndication">RSS</acronym> and Atom feeds brought increased usage of the word, <em>aggregate</em>, which meant to gather many articles together to one destination.  This turns out to be a really great thing.  No longer do I have to go from site to site to check on all my favorite news.  I can use an aggregator, and bring all the news from all my favorite sites together to one point.  Not only can I do this with news, but also blog posts, photos, audio clips, events, alerts, and the list goes on.<br />
<span id="more-718"></span><br />
Now the time has arrived to bring aggregation to search.  Why should I have to go to all of my various different search engines to search for a term, when I should be able to have all the results from all my my favorite search engine aggregated into one single application?</p>
<p>Search result aggregation is not a new thing.  The <a href="http://www.metacrawler.com">MetaCrawler search engine</a> has been doing this sort of thing for years.  Unfortunately, the search information is bounded and limited to just a couple of pre-determined set of engines.</p>
<p>OpenSearch is a set of standards which opens up search aggregation.  Any search engine can share their search results using the OpenSearch standards, so that search aggregators can pull it all together. All the OpenSearch information and documentation is actually pretty well written (unlike other specifications which are dozens of pages long and too difficult to get through) and are available at <a href="http://www.opensearch.org">www.OpenSearch.org</a>.</p>
<p>One example is that blog searches can be aggregated. I operate <a href="http://ldssearch.com/~blog">a</a> <a href="http://jacob.peargrove.com/blog">few</a> <a href="http://jacob.peargrove.com/tech">different</a> <a href="http://jacob.peargrove.com/religion">blogs</a>, each one with its own search function.  If I were to want to look for the word &#8220;oregon&#8221; in my blogs, I would have to visit each one individually and use their individual search engines.   I could try using a third party search engine limited to the scope of only my blogs, but if that search engine hasn&#8217;t visited all the posts in all my blogs, then I&#8217;m not getting great search results back.  Now suppose that each of my blogs&#8217; search functions supported OpenSearch.  Then I could simply use a search aggregator and search all of them at once, and all their results displayed all at once.  Bingo.</p>
<p>OpenSearch version 1.1 uses standard technology and is composed of two main parts.  There is the OpenSearch description file which tells an aggregator or search application how to perform a search.  This is what Firefox 2 and IE7 use for their search plugin format.  The description file is <acronym title="eXtensible Markup Language">XML</acronym> with a fairly simple vocabulary.  The second part is the search results, which uses <acronym title="Really Simple Syndication">RSS</acronym> and Atom format standards along with a few custom tags to return the search data.  Using <acronym title="Really Simple Syndication">RSS</acronym> and Atom makes it easier to write search clients and servers which use OpenSearch because there are already libraries for the standards and developers are already familiar with the technology.</p>
<p>So when you use a search aggregator or application, such as A9.com, the application uses the description files to properly format search queries.  The queries are sent to the search servers, which return the search results as an Atom or <acronym title="Really Simple Syndication">RSS</acronym> formated file.  The application then parses and displays in some meaningful way the search results from the various sources.</p>
<p>In the past we were limited to using either one generic monstrous search engine like Google, or bouncing around to many different tiny little specific search engines.  With OpenSearch, we should be able to instead search many specific search engines without having to bounce around.</p>]]></content:encoded>
			<wfw:commentRss>http://squaregalaxy.com/tech/aggregate-your-searches-with-opensearch/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Another biggest month ever</title>
		<link>http://squaregalaxy.com/blogging/another-biggest-month-ever/</link>
		<comments>http://squaregalaxy.com/blogging/another-biggest-month-ever/#comments</comments>
		<pubDate>Wed, 27 Sep 2006 06:06:13 +0000</pubDate>
		<dc:creator>Jacob</dc:creator>
				<category><![CDATA[Blogging]]></category>
		<category><![CDATA[dance]]></category>
		<category><![CDATA[Search Engines]]></category>

		<guid isPermaLink="false">http://jacob.peargrove.com/blog/2006/general/another-biggest-month-ever/</guid>
		<description><![CDATA[Last month I announced that August 2006 was the biggest month ever for this website. August&#8217;s reign didn&#8217;t even last a month as September, with 4 days left in the month, just passed up August. Well over 1000 users came in so far this month through Google alone, which is a 30% increase over all [...]]]></description>
			<content:encoded><![CDATA[<p>Last month I announced that <a href="http://jacob.peargrove.com/blog/2006/general/biggest-month-ever/">August 2006 was the biggest month ever</a> for this website.  August&#8217;s reign didn&#8217;t even last a month as September, with 4 days left in the month, just passed up August.  Well over 1000 users came in <em>so far</em> this month through Google alone, which is a 30% increase over all of last month.  The <a href="http://jacob.peargrove.com/blog/category/photos/">photos category</a> was the most popular, while posts about <a href="http://jacob.peargrove.com/blog/2004/technical/search-engines/">search engines</a> and <a href="http://jacob.peargrove.com/blog/2005/sociality/halloween-dances/">Halloween dances</a> were the most popular posts.</p>
<p>Thank you all very much for reading.  I appreciate every visitor, including the rude ones, but not really the ones that spam.</p>]]></content:encoded>
			<wfw:commentRss>http://squaregalaxy.com/blogging/another-biggest-month-ever/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>Introducing LDSsearch.com</title>
		<link>http://squaregalaxy.com/religion/introducing-ldssearchcom/</link>
		<comments>http://squaregalaxy.com/religion/introducing-ldssearchcom/#comments</comments>
		<pubDate>Fri, 19 May 2006 07:58:09 +0000</pubDate>
		<dc:creator>Jacob</dc:creator>
				<category><![CDATA[Religion]]></category>
		<category><![CDATA[Search Engines]]></category>

		<guid isPermaLink="false">http://jacob.peargrove.com/blog/2006/technical/internet/introducing-ldssearchcom/</guid>
		<description><![CDATA[Update:This search engine no longer exists as it wasn&#8217;t fulfilling its goals. For work, I&#8217;ve been working on a new search engine called LDSsearch.com. It is finally to the point where I feel like I can start showing it off and get feedback on it. This search engine only indexes Internet content which has been [...]]]></description>
			<content:encoded><![CDATA[<p><ins datetime="2008-09-24T16:18:29+00:00">Update:</ins>This search engine no longer exists as it wasn&#8217;t fulfilling its goals.</p>
<p>For <a href="http://moregoodfoundation.org">work</a>, I&#8217;ve been working on a new search engine called <a href="http://LDSsearch.com">LDSsearch.com</a>.  It is finally to the point where I feel like I can start showing it off and get feedback on it.</p>
<div style="float:left;margin:0.8em;border-style:solid;border-width:1px;border-color:black;"><img id="image293" src="http://squaregalaxy.com/wp-content/uploads/2006/05/ldssearchshot.jpg" alt="LDSsearch.com screenshot"/></div>
<p>This search engine only indexes Internet content which has been identified to be <acronym title="Latter-day Saints">LDS</acronym>-friendly material.  The idea is to help members, and investigators to find accurate information about the mormon religion without having to sort through anti-mormon sites.</p>
<p><a href="http://LDSsearch.com">LDSsearch.com</a> uses the open-source search engine, <a href="http://lucene.apache.org/nutch/">Nutch</a>. Nutch made it faster and easier to set up a search engine. We&#8217;re using version 0.8-dev, which is in need of some very good documentation.</p>
<p>Please take a look at it, and tell me what you think by commenting on this post.</p>]]></content:encoded>
			<wfw:commentRss>http://squaregalaxy.com/religion/introducing-ldssearchcom/feed/</wfw:commentRss>
		<slash:comments>10</slash:comments>
		</item>
		<item>
		<title>Search Engines</title>
		<link>http://squaregalaxy.com/tech/search-engines/</link>
		<comments>http://squaregalaxy.com/tech/search-engines/#comments</comments>
		<pubDate>Sat, 20 Nov 2004 08:03:20 +0000</pubDate>
		<dc:creator>Jacob</dc:creator>
				<category><![CDATA[Technical]]></category>
		<category><![CDATA[Links]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[Search Engines]]></category>
		<category><![CDATA[soft]]></category>

		<guid isPermaLink="false">http://jacob.peargrove.com/blog/?p=32</guid>
		<description><![CDATA[I&#8217;ve been researching different search engines that can be used to index a limited set of websites. I&#8217;ve come up with these four as possible candidates: ht://Dig does the job, but it has problems updating an index, and perhaps with database corruption. It will search pdf, word, and other documents if given proper parsers. The [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve been researching different search engines that can be used to index a limited set of websites.  I&#8217;ve come up with these four as possible candidates:</p>
<p><a href="http://htdig.sf.net">ht://Dig</a> does the job, but it has problems updating an index, and perhaps with database corruption.  It will search pdf, word, and other documents if given proper parsers.</p>
<p><a href="http://tsep.sourceforge.net/">The Search Engine Project</a> is a search engine that runs from php and indexes into MySQL (v4+).</p>
<p><a href="http://www.sitellite.org/index/siteSearch">SiteSearch</a> uses Lucene (java based, from Apache Jakarta) as the indexer, has a php front end, and provides search and click statistics.</p>
<p><a href="http://www.bitmechanic.com/projects/spindle/">Spindle</a> also uses Lucene.   I don&#8217;t know much more than that.</p>]]></content:encoded>
			<wfw:commentRss>http://squaregalaxy.com/tech/search-engines/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
