venta: (Default)
[personal profile] venta
For those of you who are still occasionally amused by auto-thesaursusised spam. I know I am :)

These lozenges are merely equal standard lozenges but they are specially developed to be spoiled and soluble below the lingua. The pills is took up at the rima oris and goes into the bloodstream in real time alternatively of moving through with the breadbasket. This effects in a quicker much more mighty effect which run up to 44 hours!

Date: 2005-02-24 10:58 am (UTC)
From: [identity profile] mejoff.livejournal.com
auto-thesaursusised? it looks more badly translated from the [insert oriental language here]. Some googlesearching should reveal vast amounts of such, mainly instruction manuals for electronics. I love them, they're my favourite!:)

Date: 2005-02-24 11:02 am (UTC)
From: [identity profile] mejoff.livejournal.com
www.engrish.com

Date: 2005-02-24 11:04 am (UTC)
From: [identity profile] venta.livejournal.com
From less convoluted examples, I'd come up with the idea that it might be auto- (or just badly-)thesaurusised to avoid simplistic spam filters.

If I get mail, of only a few sentences, which contains the words "tablet" and "hard" and "penis", then it's a reasonable bet it's spam. If it contains "pill" and "rock" and "member", then the filter doesn't trigget. So, I then add those words to my filter... so the spammer starts using different synonyms. Which get less and less synonymous as time goes on.

I haven't done any looking into this, it was just my theory. Admittedly, though, at least some of the above example shows signs of bad auto-translation as well.

Date: 2005-02-24 11:07 am (UTC)
From: [identity profile] lanfykins.livejournal.com
I am reminded of the old BabelFish game.

Comes, friendly bomb, and autumn now is not the grass grazes the cow group in its not suitable manner there mire, the death

Date: 2005-02-24 11:08 am (UTC)
From: [identity profile] venta.livejournal.com
Aye, I was just remembering that. I may knock up a quiz based on that principle when I've the time.

Date: 2005-02-24 11:50 am (UTC)
From: [identity profile] wimble.livejournal.com
I'm using (well, I was: my connection to home has fallen over!) spamassasin, which has a Bayesian logic filter in it. Which does essentially the same job as you're describing, in adding specific words to the spam filter.

But it has the advantage of adding all the words in the spam message, giving them automagically adjusted weightings. So even lingua goes in.

When you add this to the normal spam detection rules (duff dates, silly html colours, "Dear friend", etc), and finally, the network spam centres (DCC, Pyzor, which hold checksums of spam mails, in much the same way that freedb does for CDs), it rapidly adapts to changing content.

So far, this year, 44 spams have got through the filter. I don't know how many it's blocked.

Date: 2005-02-24 11:57 am (UTC)
From: [identity profile] onebyone.livejournal.com
Do you know how many false positives there have been?

Date: 2005-02-24 12:07 pm (UTC)
From: [identity profile] wimble.livejournal.com
Zero :)

My alledged spam goes into a "caughtspam" folder, which I periodically browse through and then delete.

All the messages scoring above 5 are classed as spam, and those scoring above 10 are automatically added to the bayesian database. When I flush the file, I tell it to learn all the rest (ie. those between 5 and 10), which is usually about half of the content.

Ah... It also auto-whitelists addresses, so recognised non-spammers are given a certain amount more leeway in their content. And that really does improve things (given that I get quite a lot from LJ Notify!)

Date: 2005-02-24 12:01 pm (UTC)
From: [identity profile] venta.livejournal.com
I don't know how my Spam filtering works - it's done by my mail provider. I don't bin spam, though, just mark it as (a) I'm worried about false positives and (b) I quite like reading the bilge from time to time. I think I've had three bits of spam unmarked since I got the blacktreacle address (nearly a year).

My notification of my vodafone bill used to get marked as spam, but after a couple of cases of me saying "no, no, I want this" it seems to have got itself sorted out.

Date: 2005-02-24 12:10 pm (UTC)
From: [identity profile] wimble.livejournal.com
I've never liked the idea of not knowing, and the filtering being done by the provider: they might (potentially, for the paranoid amongst you) be deleting "spam" before it ever gets to you. And if they've got a false positive, you'd quite likely never know.

I get quite a lot to
a) my ntl address, since it's an obvious domain, and I've got a very short username.
b) directly to my mailserver at home, since it'll simply accept email.

Date: 2005-02-24 12:05 pm (UTC)
From: [identity profile] maviscruet.livejournal.com
Good lord, your title, I recognise the song.

It is a billy brag song right?

Or am i mad?

Date: 2005-02-24 12:06 pm (UTC)
From: [identity profile] venta.livejournal.com
One notional kudo to that man. It is indeed Billy Bragg, from Walk Away, Renée.

Date: 2005-02-24 12:55 pm (UTC)
zotz: (Default)
From: [personal profile] zotz
The most illegible bachelor in town . . .

Profile

venta: (Default)
venta

December 2025

S M T W T F S
 123456
78910111213
14151617181920
212223 24252627
28293031   

Most Popular Tags

Style Credit

Expand Cut Tags

No cut tags
Page generated Dec. 27th, 2025 12:55 am
Powered by Dreamwidth Studios