While using Isolation Forest for anomaly detection in data should we train the model with only normal data or mix of both normal as well as outlier data? Also what is the best algorithm for anomaly detection for multivariate data? I want minimum false positives.
- I am looking at contamination level less than 5% .
- Also what is the best ML algorithm for anomaly detection for multivariate data so that it gives minimum false positives.
Note: I know that false positives reduction is a matter of tuning the model but I wanted to know the most efficient algorithm. from blogs I have understood that IsolationForest is one of the newest and most efficient unsupervised anomaly detection algorithm.