Computer Chess Club Archives


Search

Terms

Messages

Subject: Re: CCC Search Engine alpha (some stats)

Author: Joshua Shriver

Date: 10:51:28 12/02/05

Go up one level in this thread


Perl for the actual processing and PHP for the front end.

My biggest problem so far has been with the zip format. I can read/write
.tar.bz2 fine (and 1/4 the size of the zips) but I want the code to be
compatible with the files that are stored in the CCC archive.

I'm not sure if this'll ever be a website, my first idea was to create a Linux
(*BSD) offline viewer with search capabilities, then there was the discussion of
a new online version so figured I'd just hit 2 birds with 1 stone.

Hopefully in Jan I should have enough to get my own website started and I can
host it there or give my code to CCC for local usage (whichever works).

All of the posts till this month are about 1.2gigs which isn't that bad (or
220megs if tar.bz2)

The actual PHP lookup code won't be that hard to write, the real guts are in the
indexing program which in itself has to do a little AI and A LOT of string
processing (which is why I used Perl).

Once I get something more stable I'll gladly send you a copy.
BTW thanks :) if it wasn't or your ftp site I wouldnt have been able to get the
old posts, I leeched about 500megs worth of zips to get the archive.

-Josh

>
>What technology are you using for the search engine part of it?
>What technology are you using for the front end?



This page took 0 seconds to execute

Last modified: Thu, 15 Apr 21 08:11:13 -0700

Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.