##### ### ## ### ######## ### ## ### ###TM ####### ####### ####### ######## ####### ####### ####### ### ### ### ### ### ### ### ### ### ## ## ## ## ## ### ## ## ## ## ## ### ### ### ## ### ### ### ## ### ## ####### ####### ## ######## ### ####### ## ####### ##### ### ## ######## ### ### ## #### ### ##### Welcome to corzoogle! corz.org 2004-> itstory.. aka 'version history'.. aka 'changes'.. last updated: MONDAY Oct 29, 2007 v0.9.5b [current live testing beta] * improved URL encoding in results. * fixed entity encoding in result snippets, even really bad HTML pages should produce validating snippets. You still need to go fix your dodgy pages, though! v0.9.4b * improvements to the mailer functions, should be more robust under various server setups * fixed stray tag I introduced in the last version. v0.9.3b * minor stylistic improvements * improved mail notification * fixes in distributed version (removed certain corz.org localizations) v0.9.2b * fixed potential php notices for missing files (optional bot agents file so the mailer knows who *not* to notify you about). v0.9.1b you can now configure the location of the hits file. if you cannot create a writable file in corzoogle's directory, perhaps you can use the system temp directory, or something like that. You can now add a notice to corzoogle, which will display under the main search form. you can use it to tell your users anything. improved the recent hits stylings, with its own css class, and fully- configurable character widths, you can put it anywhere on the page, and do anything with it. I also improved its arithmetic, and it should now remain within whatever limit you have set, e.g. 80 characters. tip: use half the character value as an em value for the css class' width, so for 80 characters, we might use.. .recent-hits { width: 40em; float: right; text-align: right; font-size: small; } corzoogle will now perform intelligent transformation of any hits from your corzblog installation. It can also create "flat links", if you use and prefer that. v0.9b corzoogle now outputs 100% Strict XHTML compliant web code, in nice
s with css controlling all the positional and style elements. Have fun! v0.8.4.2b fixed occasional issue with snippets being *much* longer than they should have been. This version hasn't been officially released as it is in a state of flux, though runs happily here at corz.org and can be had by a simple email. v0.8.4b started integrating the general site xhtml upgrade. v0.8rc3 [current release] a few minor surface changes. everything is nice and stable. v0.8rc2 fixed the bug with the search between tags. I really don't know how that > got deleted! but it was a while ago! certain documents with plain old tags were being ignored in the search results. this only applied to case-sensitive search-between tags, the default setting. results snippets will now always begin at proper word boundries. corzoogle will now automatically select the query terms in the input box, so you can begin another search immediately, if you need to, by simply typing. a small but significant improvement. fixed the form focus error some browsers gave by basically just removing the second form altogether, it just caused problmes anyhow. v0.7.5b6 - v0.8rc1 minor changes, too busy to document! v0.7.5b5 altered slightly the way corzoogle strips undesirable characters from the input query. instead of replacing them with a space, they are simply removed. a search for need w[[ords would search for need+words, previously we would have got results for need+ords, the "w" having been removed as a single letter. added "extension mangling".. Let's say you have a php site, and you keep your content in files ending .htm, but if they are accessed directly, you redirect (with htaccess) to the php container page. now you can have corzoogle alter the extension for you, so the link goes directly to the php file. a hit for "darkstat.htm" would link directly to "darkstat.php" another use for mangling is .comment files. rather than the raw comment file, you want the user to load the page it is attached to. so a hit for "darkstat.php.comment" will present a link directly to "darkstat.php#comments". the "real" filename, as always, is available by hovering the mouse over the title. this feature will be disabled by default, to enable, simply uncomment (remove the "//" from in front of) the line beginning.. //$mangle = array(... put the ".. " back in the end of snippet pieces in multiple query searches, no more weird words in the snippets. added a preference for the length of the "latest hits" links. the default is enough hits to fill 81 chracters, but you can change that now. removed some limitations of the search_between feature. will be more accurate when using mixed start and end tags. if your end tag doesn't exist, but the start tag did, corzoogle scans right to the end of the document. you could even enter a bogus tag to have corzoogle always do that. v0.7.5b4 added a preference for "embedding", for folks that want to run corzoogle from inside another page. corzoogle will return the results in the same space. added some notes about the $search_between preference, and how it relates to $search_in_tags. added multiple corzblog searching. previously we were limited to a single blog installation. you no longer need to specify this, corzoogle will return correct results for any number of blog installations automatically. I spotted this limitation after I started my devblog. if you use a name other than "arc" for your corzblog archives, this functionality will break, though you could edit in whatever name you do use. (why would anyone want to change that?) v0.7.5b3 fixed a bug I introduced in 0.7.5b1, where certain queries, matching words right at the end of the text could produce unexpected results, usually putting the entire page into the preview. fixed a minor bug where certain documents could potentially be scored twice for their filename. added facility to set initial scoring weight in a multiple-query term. added switch to force corzoogle to only search documents containing your chosen $search_between tags, particularly for folks that use custom $search_between tags. v0.7.5b2 added "notify" facility for searches. you can be emailed every time someone searches your site! you might want to put a filter on that! this facility is disabled by default, and completely untested; please report back any interesting findings. $search_between tags can now be case-insensitive, if you need that. v0.7.5b1 small performance improvements when searching for queries with -not words "search between" tags can be case-sensitive (default) or case-insensitive, the latter is marginally slower, but potentially useful. v0.7.4 added (ugly) highlights to "exact" word matches. they are disabled by default (they were sort of in there already, some one wanted them working) This looks like someone drew a marker pen through all the exact hits *eew* added a switch for searching inside html tags. I'll set this to false, which produces neater reults. if you are using coroogle to "grep" your site for editing, or searching web-development technical documentation, or such-like, you'll likely want to set this to true. preview snippets can now be presented with the query terms in context, in other words, the query term appears some way into the text, and not right at the start of the snippet. aids comprehension. of course, how far in the term appears, is configurable in the prefs. corzoogle now attempts to create the "recent hits" file if it's not there, which it did originally (dunno how that got switched off). There's still a message if this wasn't possible (likely). fixed a wee bug where html tags were being passed to the "past-hits" and users could create interesting html structures at the foot of results pages. added a few "googies" to the distribution; customised Apache manual index page with built-in corzoogle search, is one. v0.7.3.6 added < and > to the removed characters. You won't get unexpected html structures in the results now. Searching for "