Unix administration - Training spamassassin with previously scanned messages

This is Interesting: Free IT Magazines  
Home > Archive > Unix administration > February 2007 > Training spamassassin with previously scanned messages





You are viewing an archived Text-only version of the thread. To view this thread in it's original format and/or if you want to reply to this thread please [click here]

Author Training spamassassin with previously scanned messages
pbeckhelm@gmail.com

2007-02-20, 1:26 am

Greetings folks.

Quick question I can't seem to find the answer to anywhere else:

Can you train SpamAssassin with messages that have already been
scanned by SA and that have SA headers in them?

Ex:

>From user@example.com Wed Dec 27 18:04:42 2006

Received: from localhost by host.example.net
with SpamAssassin (2.63 2004-01-11);
Wed, 27 Dec 2006 18:04:42 -0800
From: user@example.com
To: undisclosed-recipients:;
Subject: V|@@GRA!!!!
Date: Wed, 27 Dec 2006 18:04:29 -0800 (PST)
Message-Id: <20061228020429.44D9CA0@host.example.net>
X-Spam-Flag: YES
X-Spam-Checker-Version: SpamAssassin 2.63 (2004-01-11) on
host.example.net
X-Spam-Level: *********
X-Spam-Status: Yes, hits=9.6 required=5.0 tests=DRUGS_ERECTILE,
DRUGS_ERECTILE_OBFU,FORGED_YAHOO_RCVD,MS
GID_FROM_MTA_SHORT,
NO_REAL_NAME,PLING_PLING,SARE_OBFU_VIAGR
A autolearn=no
version=2.63
MIME-Version: 1.0
Content-Type: multipart/mixed; boundary="----------
=_4593263A.ECE2B3E4"

This is a multi-part message in MIME format.

------------=_4593263A.ECE2B3E4
Content-Type: text/plain
Content-Disposition: inline
Content-Transfer-Encoding: 8bit

Spam detection software, running on the system "host.example.net", has
identified this incoming email as possible spam. The original message
has been attached to this so you can view it (if it isn't spam) or
block
similar future email. If you have any questions, see
postmaster@example.com for details.

Content preview: testing [...]

Content analysis details: (9.6 points, 5.0 required)

pts rule name description
---- ----------------------
--------------------------------------------------
0.3 NO_REAL_NAME From: does not include a real name
1.7 SARE_OBFU_VIAGRA BODY: found apparent obfuscation of word
used in spam
3.3 MSGID_FROM_MTA_SHORT Message-Id was added by a relay
0.5 FORGED_YAHOO_RCVD 'From' yahoo.com does not match 'Received'
headers
1.5 DRUGS_ERECTILE_OBFU Obfuscated reference to an erectile drug
1.0 DRUGS_ERECTILE Refers to an erectile drug
1.3 PLING_PLING Subject has lots of exclamation marks



------------=_4593263A.ECE2B3E4
Content-Type: message/rfc822; x-spam-type=original
Content-Description: original message before SpamAssassin
Content-Disposition: inline
Content-Transfer-Encoding: 8bit

Return-Path: <user@example.com>
X-Original-To: domains@example.com
Delivered-To: domains@localhost.example.com
Received: from host2.example.com (unknown [172.27.2.210])
by host.example.net (Postfix) with SMTP id 44D9CA0
for <domains@example.com>; Wed, 27 Dec 2006 18:04:29 -0800
(PST)
Subject: V|@@GRA!!!!
Message-Id: <20061228020429.44D9CA0@host.example.net>
Date: Wed, 27 Dec 2006 18:04:29 -0800 (PST)
From: user@example.com
To: undisclosed-recipients:;

Rainer Frey

2007-02-20, 7:18 am

pbeckhelm@gmail.com wrote:

> Greetings folks.
>
> Quick question I can't seem to find the answer to anywhere else:
>
> Can you train SpamAssassin with messages that have already been
> scanned by SA and that have SA headers in them?


You can:

from sa-learn man page (SA 3.1.x)
SpamAssassin remembers which mail messages it has learnt already, and will
not re-learn those messages again, unless you use the --forget option.
Messages learnt as spam will have SpamAssassin markup removed, on
the fly.

If you make a mistake and scan a mail as ham when it is spam, or vice
versa, simply rerun this command with the correct classification, and the
mistake will be corrected. SpamAssassin will automatically ’for‐
get’ the previous indications.
setuidzero@gmail.com

2007-02-20, 1:17 pm

On Feb 19, 11:46 pm, Rainer Frey <rainerf...@web.de> wrote:
> pbeckh...@gmail.com wrote:
>
>
>
> You can:
>
> from sa-learn man page (SA 3.1.x)
> SpamAssassin remembers which mail messages it has learnt already, and will
> not re-learn those messages again, unless you use the --forget option.
> Messages learnt as spam will have SpamAssassin markup removed, on
> the fly.
>
> If you make a mistake and scan a mail as ham when it is spam, or vice
> versa, simply rerun this command with the correct classification, and the
> mistake will be corrected. SpamAssassin will automatically 'for-
> get' the previous indications.


Fabulous, problem solved! Much appreciated

Sponsored Links






Free braindumps | Software forum | Database administration forum

Copyright 2003 - 2008 webservertalk.com