Sometimes, when we discuss our anomaly detection solutions with potential customers or analysts, we notice that there is still a lack of knowledge out there about the power and nature of intelligent anomaly detection. Often we are asked: "Is anomaly detection not just a fancy name for baselining?" Of course it is not, but it is understandable why many people would think that. After all, the first systems which claimed to be able to detect anomalies on the network were indeed based on baselines, or worse, on manually specified rules. And even today there are still systems out there, which try to utilize baselining to detect network anomalies.
We believe that this technique is very much flawed, and that an intelligent approach to anomaly detection is needed. This is why we employ artificial intelligence, by using specialized neural networks in netDeFlect, Esphion's anomaly detection solution.
But let me outline this issue in more detail.
Baselining: What is it and what are the problems?
The idea of baselining is simple: Observe the network and record certain statistics about the network. Then find averages, minima and maxima for those statistics. After that is done, continue to observe and raise an alarm if the observed statistic deviates out of the established range.
For example: Let's say you observe the packet rate on your network for a week. You find that during that week you see a maximum rate of X pps. You can then monitor to see if the packet rate ever exceeds X, which might be indicative of a DDoS attack. If X is exceeded, you can sound an alarm. In effect, X has become the baseline, on which you base continuous anomaly detection.
This is a very simple example, but illustrates an important point about all baselining techniques: They rely on historical data. In other words, the network has been observed, and something that was typical about the network during the observation period has been retained and is used from now on.
The problem then is obvious: If the behavior of your network changes for legitimate reasons, then such a baselined system will not be able to differentiate between an anomaly that needs to be investigated and a normal, legitimate increase in network activity. An additional disadvantage is that anomalies can only be detected for the baselines which have been established.
There are many variations and advanced techniques which have been developed over time, in order to make baselining more suitable to deal with a real and changing world. However, the core problem always remains: Reliance on historical data, which inevitably leads to false positives, or missed anomalies.
Using intelligence, rather than historical data
While there is always a place for baselining as a supportive technology, the core detection of anomalies has to be much more intelligent than that. We are using specialized neural networks with great success. Neural networks, as you know, mimic the way in which the human brain works. This is the level of intelligence that is needed to be good in the field.
These neural networks look at the traffic in an entirely different way. Rather than comparing current data with historical data they look at how the traffic interacts, and how different types of traffic relate to each other. The neural networks allow us to look into the network at this very moment, without any concern for what it was like in the past. They enable us to examine how things are changing not just if they are different from something that was observed a while back.
This intelligence allows the neural networks to produce accurate results, even if the overall network usage profile has changed, for example when new applications have been enabled, or the user base has increased. A baselining system has graet difficulties with a situation like this. A truly intelligent system, just like an experienced network engineer, does not.
Juergen