VMware Research | Anomaly Detection

Introduction

Anomaly detection algorithms that intuitive, rigorous and scalable.

Summary

Monitoring large volumes of data and finding anomalous behavior in them is a ubiquitous challenge. The data are of typically high-dimensional, heterogeneous (categorical and numerical), and contain irrelevant attributes and noise. Labels are often scarce and/or expensive, hence unsupervised learning methods are called for.

Our goal is to come up with algorithms that

Make minimal generative assumptions, and hence apply broadly.
Give rigorous guarantees, and whose outcomes are easily interpretable.
Can handle heterogeneous datasets.
Are highly performant and scale to large volumes and high dimensions.

Researchers

Udi Wieder

Senior Researcher

Palo Alto, CA, US

Parikshit Gopalan

Senior Researcher

Palo Alto, CA, US

External Researchers

Roie Levin
Vatsal Sharan

Related Publications

Multicalibrated partitions for importance weights March, 2022

PIDForest: Anomaly Detection and Certification via Partial Identification August, 2019

Efficient Anomaly Detection via Matrix Sketching December, 2018

Category

Graduated Research Projects

Research Areas

Algorithms
Machine Learning
Statistics