--- title: Regular expression syntax description: A regular expression is a way to match patterns in data using placeholder characters, called operators. Elasticsearch supports regular expressions in... url: https://www.elastic.co/elastic/docs-builder/docs/3028/reference/query-languages/query-dsl/regexp-syntax products: - Elasticsearch --- # Regular expression syntax A [regular expression](https://en.wikipedia.org/wiki/Regular_expression) is a way to match patterns in data using placeholder characters, called operators. Elasticsearch supports regular expressions in the following queries: - [`regexp`](https://www.elastic.co/elastic/docs-builder/docs/3028/reference/query-languages/query-dsl/query-dsl-regexp-query) - [`query_string`](https://www.elastic.co/elastic/docs-builder/docs/3028/reference/query-languages/query-dsl/query-dsl-query-string-query) Elasticsearch uses [Apache Lucene](https://lucene.apache.org/core/)'s regular expression engine to parse these queries. ## Reserved characters Lucene’s regular expression engine supports all Unicode characters. However, the following characters are reserved as operators: ``` . ? + * | { } [ ] ( ) " \ ``` Depending on the [optional operators](#regexp-optional-operators) enabled, the following characters may also be reserved: ``` # @ & < > ~ ``` To use one of these characters literally, escape it with a preceding backslash or surround it with double quotes. For example: ``` \@ \\ "john@smith.com" ``` The backslash is an escape character in both JSON strings and regular expressions. You need to escape both backslashes in a query, unless you use a language client, which takes care of this. For example, the string `a\b` needs to be indexed as `"a\\b"`: ```json { "my_field": "a\\b" } ``` This document matches the following `regexp` query: ```json { "query": { "regexp": { "my_field.keyword": "a\\\\.*" } } } ``` ## Standard operators Lucene’s regular expression engine does not use the [Perl Compatible Regular Expressions (PCRE)](https://en.wikipedia.org/wiki/Perl_Compatible_Regular_Expressions) library, but it does support the following standard operators. Matches any character. For example: ``` ab. ``` Repeat the preceding character zero or one times. Often used to make the preceding character optional. For example: ``` abc? ``` Repeat the preceding character one or more times. For example: ``` ab+ ``` Repeat the preceding character zero or more times. For example: ``` ab* ``` Minimum and maximum number of times the preceding character can repeat. For example: ``` a{{2}} a{2,4} a{2,} ``` OR operator. The match will succeed if the longest pattern on either the left side OR the right side matches. For example: ``` abc|xyz ``` Forms a group. You can use a group to treat part of the expression as a single character. For example: ``` abc(def)? ``` Match one of the characters in the brackets. For example: ``` [abc] ``` Inside the brackets, `-` indicates a range unless `-` is the first character or escaped. For example: ``` [a-c] [-abc] [abc\-] ``` A `^` before a character in the brackets negates the character or range. For example: ``` [^abc] [^a-c] [^-abc] [^abc\-] ``` Character range classes such as `[a-c]` do not behave as expected when using `case_insensitive: true` — they remain case sensitive. For example, `[a-c]+` with `case_insensitive: true` will match strings containing only the characters 'a', 'b', and 'c', but not 'A', 'B', or 'C'. Use `[a-zA-Z]` to match both uppercase and lowercase characters.This is due to a known limitation in Lucene's regular expression engine. See [Lucene issue #14378](https://github.com/apache/lucene/issues/14378) for details. ## Optional operators You can use the `flags` parameter to enable more optional operators for Lucene’s regular expression engine. To enable multiple operators, use a `|` separator. For example, a `flags` value of `COMPLEMENT|INTERVAL` enables the `COMPLEMENT` and `INTERVAL` operators. ### Valid values Enables all optional operators. Alias for the `ALL` value. Enables the `~` operator. You can use `~` to negate the shortest following pattern. For example: ``` a~bc ``` Enables the `#` (empty language) operator. The `#` operator doesn’t match any string, not even an empty string. If you create regular expressions by programmatically combining values, you can pass `#` to specify "no string." This lets you avoid accidentally matching empty strings or other unwanted strings. For example: ``` #|abc ``` Enables the `<>` operators. You can use `<>` to match a numeric range. For example: ``` foo<1-100> foo<01-100> ``` Enables the `&` operator, which acts as an AND operator. The match will succeed if patterns on both the left side AND the right side matches. For example: ``` aaa.+&.+bbb ``` Enables the `@` operator. You can use `@` to match any entire string. You can combine the `@` operator with `&` and `~` operators to create an "everything except" logic. For example: ``` @&~(abc.+) ``` Disables all optional operators. ## Unsupported operators Lucene’s regular expression engine does not support anchor operators, such as `^` (beginning of line) or `$` (end of line). To match a term, the regular expression must match the entire string.