﻿---
title: Thai tokenizer
description: The thai tokenizer segments Thai text into words, using the Thai segmentation algorithm included with Java. Text in other languages in general will be...
url: https://www.elastic.co/elastic/docs-builder/docs/3016/reference/text-analysis/analysis-thai-tokenizer
products:
  - Elasticsearch
---

# Thai tokenizer
The `thai` tokenizer segments Thai text into words, using the Thai segmentation algorithm included with Java. Text in other languages in general will be treated the same as the [`standard` tokenizer](https://www.elastic.co/elastic/docs-builder/docs/3016/reference/text-analysis/analysis-standard-tokenizer).
<warning>
  This tokenizer may not be supported by all JREs. It is known to work with Sun/Oracle and OpenJDK. If your application needs to be fully portable, consider using the [ICU Tokenizer](https://www.elastic.co/elastic/docs-builder/docs/3016/reference/elasticsearch/plugins/analysis-icu-tokenizer) instead.
</warning>


## Example output

```json

{
  "tokenizer": "thai",
  "text": "การที่ได้ต้องแสดงว่างานดี"
}
```

The above sentence would produce the following terms:
```text
[ การ, ที่, ได้, ต้อง, แสดง, ว่า, งาน, ดี ]
```


## Configuration

The `thai` tokenizer is not configurable.