Neil's News

Search Appliance

13 September 2008

While working in Scotland at Digital Routes, we had a client that needed a search solution for a large number of confidential documents. The first implementation was a disaster: the programmer didn't know what a hash table was and created a buggy O(n3) mess. My rewrite was somewhat more successful, it continues to run stably and scalably six years later. But eventually the client's requirements changed. Now they need to index PDF and Word documents in real-time. Solution: Google's Mini Search Appliance.

[Google Search Appliance box] For some inexplicable reason, Digital Routes' hosting provider is unable or unwilling to enter the IP, subnet mask and gateway information into the appliance's web-based setup. The work-around is to ship the new appliance to me in California so that I can configure it then ship it on to the host in Quebec.

[Google Search Appliance server] The Google Mini is certainly a beautiful machine. The instructions are straight-forward, the cables are colour-coded, and the setup process is well thought out. Then I pushed the power button. Good grief that thing is loud. The cooling fan on that little box sounds like a jet engine. I was half expecting to see a stack of Mach diamonds appear out the back vent. Belatedly I found this note buried in the GSA documentation:

"If you install the search appliance in an office, place it in an area where any noise produced by the cooling fan in the search appliance will not be disturbing."

[Broken images on Google Search Appliance] The admin screens are also uncharacteristically clunky. The main menu has twelve broken image icons on it. Another page drops some (but not all) of your form data if any field is invalid. Using the back link to edit a previous screen takes 82 seconds to process. The user experience is certainly sub-optimal.

Since I happen to be a Google engineer, the GSA source code is available to me. So I've taken the opportunity to submit several patches (for the stuff I can fix) and file a bug report (for the stuff I can't). It's great to be at a company where if you see something wrong you can just reach in and fix it, even if it is not your department. I trust that future users of the appliance will have a better experience. In the mean time, be sure to order some ear plugs when ordering your Google Search Appliance.

This website is my personal blog. The opinions expressed here are my own, and not my employer's. This normally goes without saying, but this post is a bit more pointed than usual. Also note that this post only addresses the one-time network configuration, not the (more important) crawl and search results.

< Previous | Next >

 
-------------------------------------
Legal yada yada: My views do not necessarily represent those of my employer or my goldfish.