|
Search Appliance13 September 2008 While working in Scotland at Digital Routes, we had a client that needed a search solution for a large number of confidential documents. The first implementation was a disaster: the programmer didn't know what a hash table was and created a buggy O(n3) mess. My rewrite was somewhat more successful, it continues to run stably and scalably six years later. But eventually the client's requirements changed. Now they need to index PDF and Word documents in real-time. Solution: Google's Mini Search Appliance.
"If you install the search appliance in an office, place it in an area where any noise produced by the cooling fan in the search appliance will not be disturbing."
Since I happen to be a Google engineer, the GSA source code is available to me. So I've taken the opportunity to submit several patches (for the stuff I can fix) and file a bug report (for the stuff I can't). It's great to be at a company where if you see something wrong you can just reach in and fix it, even if it is not your department. I trust that future users of the appliance will have a better experience. In the mean time, be sure to order some ear plugs when ordering your Google Search Appliance. This website is my personal blog. The opinions expressed here are my own, and not my employer's. This normally goes without saying, but this post is a bit more pointed than usual. Also note that this post only addresses the one-time network configuration, not the (more important) crawl and search results. |