﻿---
title: Machine learning analysis function reference
description: The machine learning features include analysis functions that provide a wide variety of flexible ways to analyze data for anomalies. When you create anomaly...
url: https://www.elastic.co/elastic/docs-builder/docs/3016/explore-analyze/machine-learning/anomaly-detection/ml-functions
products:
  - Elasticsearch
  - Machine Learning
applies_to:
  - Elastic Cloud Serverless: Generally available
  - Elastic Stack: Generally available
---

# Machine learning analysis function reference
The machine learning features include analysis functions that provide a wide variety of flexible ways to analyze data for anomalies.
When you create anomaly detection jobs, you specify one or more detectors, which define the type of analysis that needs to be done. If you are creating your job by using machine learning APIs, you specify the functions in detector configuration objects. If you are creating your job in Kibana, you specify the functions differently depending on whether you are creating single metric, multi-metric, or advanced jobs.
Most functions detect anomalies in both low and high values. In statistical terminology, they apply a two-sided test. Some functions offer low and high variations (for example, `count`, `low_count`, and `high_count`). These variations apply one-sided tests, detecting anomalies only when the values are low or high, depending one which alternative is used.
You can specify a `summary_count_field_name` with any function except `metric`. When you use `summary_count_field_name`, the machine learning features expect the input data to be pre-aggregated. The value of the `summary_count_field_name` field must contain the count of raw events that were summarized. In Kibana, use the **summary_count_field_name** in advanced anomaly detection jobs. Analyzing aggregated input data provides a significant boost in performance. For more information, see [Aggregating data for faster performance](https://www.elastic.co/elastic/docs-builder/docs/3016/explore-analyze/machine-learning/anomaly-detection/ml-configuring-aggregation).
If your data is sparse, there may be gaps in the data which means you might have empty buckets. You might want to treat these as anomalies or you might want these gaps to be ignored. Your decision depends on your use case and what is important to you. It also depends on which functions you use. The `sum` and `count` functions are strongly affected by empty buckets. For this reason, there are `non_null_sum` and `non_zero_count` functions, which are tolerant to sparse data. These functions effectively ignore empty buckets.
- [Count functions](https://www.elastic.co/elastic/docs-builder/docs/3016/reference/machine-learning/ml-count-functions)
- [Geographic functions](https://www.elastic.co/elastic/docs-builder/docs/3016/reference/machine-learning/ml-geo-functions)
- [Information content functions](https://www.elastic.co/elastic/docs-builder/docs/3016/reference/machine-learning/ml-info-functions)
- [Metric functions](https://www.elastic.co/elastic/docs-builder/docs/3016/reference/machine-learning/ml-metric-functions)
- [Rare functions](https://www.elastic.co/elastic/docs-builder/docs/3016/reference/machine-learning/ml-rare-functions)
- [Sum functions](https://www.elastic.co/elastic/docs-builder/docs/3016/reference/machine-learning/ml-sum-functions)
- [Time functions](https://www.elastic.co/elastic/docs-builder/docs/3016/reference/machine-learning/ml-time-functions)