Scorpion is an application built on top of Gwen and a Pears database that will classify a web page according to the classification data in the database. Scorpion is dependent on classes from the Pears distribution, the Gwen distribution and the Dbutils distribution. These can be found at http://purl.org/oclc/research/software Quick Test Instructions: 1) Make sure the following scripts' sh-bang lines point to bash on your system. Executing 'which bash' will tell you where it is. The defaults are shown in parentheses. -) scorpion/setup.sh (#!/bin/bash) -) scorpion/PDB/LCC/test.sh (#!/bin/bash) -) scorpion/PDB/LCC/makeScorpionPDB.sh (#!/bin/bash) 2) Run setup.sh. This script changes some pathnames in the configuration files to be correct for where you've installed Scorpion. It also creates a file with some common shell variables set. This file will be used by some of the other scripts. 3) Put the Jakarta Commons Logging jar file in the scorpion/lib directory 4) cd into the scorpion/PDB/LCC directory. Run 'makeScorpionPDB.sh lccSample' . This is a fairly CPU intensive program, so you may want to shut down other large applcations first. 5) Run test.sh to test the demo database. It will classify the file scorpion/demo/scorpion.input. An HTML fragment with the results of the classification will be placed in scorpion/demo/scorpion.output.html See docs/index.html for more detailed instructions on how to get the application running with your classification database. Devon Smith smithde@oclc.org Copyright (c) 2000-2002 OCLC Online Computer Library Center, Inc. and other contributors. All rights reserved. See the included LICENSE file for more information.