elasticsearch
Loading

Painless language specification (syntax)

Serverless Stack

Painless is a scripting language designed for security and performance in Elasticsearch.
Painless syntax closely resembles Java syntax while providing additional scripting-focused features:

  • Dynamic typing
  • Map and list accessor shortcuts
  • Array Initializers
  • Object simplified object manipulation

Built on the Java Virtual Machine (JVM), Painless compiles directly into bytecode and runs in a controlled sandbox environment optimized for Elasticsearch scripting requirements.

For information about basic constructs that Painless and Java share, refer to corresponding topics in the Java Language Specification. However, understanding the differences in Painless is essential for effective script development.

Painless scripts are parsed and compiled using the ANTLR4 and ASM libraries. Scripts are compiled directly into Java Virtual Machine (JVM) bytecode and executed against a standard JVM.

Painless compilation process
  1. Script input: Painless code is embedded in JSON queries.
  2. Compilation: ANTLR4 and ASM libraries parse and compile the script into JVM bytecode.
  3. Execution: Bytecode runs on the standard Java Virtual Machine.

This documentation presents the Painless language syntax in an educational format, optimized for developer understanding. The underlying grammar, however, is implemented using more concise ANTLR4 rules for efficient parsing.

Painless syntax availability and behavior vary by execution context, ensuring scripts are safe within their intended environment. Each context directly affects script functionality since they provide a set of variables, API restrictions, and the expected data types a script will return.

This context-aware design allows Painless to optimize performance and security for each use case. For comprehensive information about available contexts and their specific capabilities, refer to Painless contexts.

  • Available variables: Context-specific variables such as doc, ctx, _source, or specialized objects depending on the execution context
  • API allowlist: Permitted classes, methods, and fields from Java and Elasticsearch APIs
  • Return type expectations: Expected script output format and type constraints

Understanding context-syntax relationships is essential for effective Painless development. For detailed information about context-syntax patterns and practical examples, refer to Painless syntax-context bridge.

Painless integration-points

Double-click to expand the image.

Each integration point corresponds to a specific Painless context with distinct capabilities and variable access patterns.

Painless has implemented key differences from Java in order to optimize security and scripting performance:

  • Dynamic typing with def: Runtime type determination for flexible variable handling
  • Enhanced collection access: Direct syntax shortcuts for Maps (map.key) and Lists (list[index])
  • Stricter casting model: Explicit type conversions prevent runtime errors
  • Reference vs content equality: == calls .equals()method, === for reference comparison
  • Security restrictions: No reflection APIs, controlled class access through allowlists
  • Automatic safety controls: Loop iteration limits and recursion prevention

These differences ensure safe execution while maintaining familiar Java-like syntax for developers.

Painless integrates with Lucene’s expression language, enabling JavaScript syntax for high-performance mathematical operations and custom functions within Elasticsearch.

  • Performance optimization: Compiles directly to bytecode for native execution speed
  • Mathematical functions: Access to specialized mathematical functions for scoring calculations
  • Field access: Streamlined doc\['field'].value syntax for document field operations
  • Restricted to mathematical expressions and field access operations
  • No complex control flow or custom function definitions
  • Limited to numeric and boolean data types

JavaScript expressions through Lucene provide a specialized, high-performance option designed specifically for custom ranking and sorting functions. For comprehensive information about Javascript expression capabilities, syntax examples, and usage patterns, refer to Lucene Expressions Language.