COSC_4P82_Assignment_1/lib/beagle-3.0.3/examples/GP/spambase
Brett 4d4e8018fd ignore 2024-02-14 16:22:25 -05:00
..
MSVCPP init 2024-02-13 21:21:51 -05:00
config ignore 2024-02-14 16:22:25 -05:00
spambase init 2024-02-13 21:21:51 -05:00
templates init 2024-02-13 21:21:51 -05:00
AUTHORS init 2024-02-13 21:21:51 -05:00
COPYING init 2024-02-13 21:21:51 -05:00
INSTALL init 2024-02-13 21:21:51 -05:00
Makefile.am init 2024-02-13 21:21:51 -05:00
Makefile.cvs init 2024-02-13 21:21:51 -05:00
Makefile.in init 2024-02-13 21:21:51 -05:00
README init 2024-02-13 21:21:51 -05:00
acinclude.m4 init 2024-02-13 21:21:51 -05:00
aclocal.m4 init 2024-02-13 21:21:51 -05:00
bootstrap ignore 2024-02-14 16:22:25 -05:00
configure ignore 2024-02-14 16:22:25 -05:00
configure.ac init 2024-02-13 21:21:51 -05:00
spambase.kdevelop init 2024-02-13 21:21:51 -05:00

README

+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+

SPAM e-mail database (spambase): Machine learning using strongly-typed GP
with Open BEAGLE

Copyright (C) 2001-2003
by  Christian Gagne <cgagne@gmail.com>
and Marc Parizeau <parizeau@gel.ulaval.ca>

+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+


Getting started
===============

  Example is compiled in binary 'spambase'. Usage options is described by
  executing it with command-line argument '-OBusage'. The detailed help can
  also be obtained with argument '-OBhelp'.

Objective
=========

  Find a program the will successfully predict whether a given e-mail is spam
  or not from some extracted features.

Comments
========

  The evolved programs works on floating-point values AND Booleans values.
  The programs must return a Boolean value which must be true if e-mail is
  spam, and false otherwise. Don't expect too much from this program as
  it is quite basic and not oriented toward performance. It is there mainly
  to illustrate the use of strongly-typed GP with Open BEAGLE.

Terminal set
============

  IN0, IN1, ...  up to IN56, the e-mail features.      [floating-point]
  0 and 1, two Boolean constants.                      [Boolean]
  Ephemeral constants randomly generated in $[0,100]$  [floating-point]

Function set
============
  AND               [Inputs: Booleans,        Output: Boolean]
  OR                [Input:  Boolean,         Output: Boolean]
  NOT               [Inputs: Booleans,        Output: Boolean]
  +                 [Inputs: floating-points, Output: floating-point]
  -                 [Inputs: floating-points, Output: floating-point]
  *                 [Inputs: floating-points, Output: floating-point]
  /                 [Inputs: floating-points, Output: floating-point]
  <                 [Inputs: floating-points, Output: Booleans]
  ==                [Inputs: floating-points, Output: Booleans]
  if-then-else      [1st Input: Boolean, 2nd & 3rd Input: floating-points,
                     Output: floating-point]

Fitness cases
=============

  A random sample of 400 e-mails over the database, re-chosen for
  each fitness evaluation.

Hits
====

  Number of correct outputs obtained over the 400 fitness cases.

Raw fitness
===========

  Ignored (always 0).


Standardized fitness
====================

  Rate of correct outputs over the fitness cases where
  the desired output was 0 (non-spam).

Adjusted fitness
================

  Rate of correct outputs over the fitness cases where
  the desired output was 1 (spam).

Normalized fitness
==================

  Rate of correct outputs obtained over all the 400 fitness cases.

Stopping criteria
=================

  When the best individual scores 400 hits or when the evolution reaches
  the maximum number of generations.

Reference
=========

  Machine learning repository,
  http://www.ics.uci.edu/~mlearn/MLRepository.html