First page Back Continue Last page Overview Graphics
So it's Naive Bayesian underneath?
Yes, CRM114 uses a Naïve Bayesian classifier.
The better feature set created by the SBPH feature hash gives better performance.
Hashing and bucketing keeps all evidence from the training phase for use during classification
Phrases in colloquial English are much more standardized than words alone- this makes filter evasion much harder