Docs
ues
Elasticsearch
Create Cluster

Create a Cluster

On the console, find ES Service Elasticsearch under the Big Data category in Products and Services to enter the product page.

Creation

The page will list all the regions and availability zones where the service can be created. After selecting the required availability zone, service version, disk type, node type, disk size, number of nodes, and filling in some initialization information, the service cluster can be quickly deployed.

Cluster Type Selection

**Basic **

Dedicated Master Node

When creating initially, you can choose the cluster type:

* Basic type is the situation where the cluster does not have a separate master node, that is, all nodes are configured as:

`node.master: true`

`node.data: true`

* The dedicated master node type is the situation where the cluster has a separate master node

The configuration of the master node is:

`node.master: true`

`node.data: false`

The configuration of the data node is:

`node.master: false`

`node.data: true`

If your cluster has a large number of nodes and high node configurations, consider using the master node separation type. This ensures that the master node uses a lower configuration to avoid resource waste, and guarantees that the data node uses a higher configuration to improve data processing capabilities. The number of master nodes for the master node separation type is 3, the number of nodes is 5-20, the master node configuration used is 2 cores 8G, and the master node configuration used when the number of nodes is greater than 20 is 4 cores 8G.

It’s worth noting that the basic type does not currently support adding master nodes.

Node Specifications, Node Quantity Selection and Payment Method

Among the selectable node specifications

Classified by disk and specifications:

Disk typeIdentifier
RSSD cloud disko.es
SSD cloud diskn.es

Selectable node specifications detailed configuration (refer to the latest node “UES Price” navigation page):

ModelNameConfiguration
Memory-intensiveo.es4m.medium2 cores 8G
Memory-intensiveo.es4m.xlarge4 cores 16G
Memory-intensiveo.es4m.2xlarge8 cores 32G
Memory-intensiveo.es4m.4xlarge16 cores 64G
Regular modelo.es2m.medium2 cores 4G
Regular modelo.es2m.xlarge4 cores 8G
Regular modelo.es2m.2xlarge8 cores 16G
Regular modelo.es2m.4xlarge16 cores 32G
Regular modelo.es2m.8xlarge32 cores 64G

Compute Specifications

Before choosing the computing specifications, you need to first pay attention to the disk space of the UES node, and then pay attention to the main data stored in the cluster, data increment, etc., to select the node specifications corresponding to the appropriate disk size.

Disk Size

The main factors affecting the storage capacity of Elasticsearch service are:

  • Number of replicas: replicas help increase data reliability, but at the same time, it also increases storage costs. The default and recommended number of replicas is 1. For some scenes that can tolerate data loss caused by exceptions, consider setting the number of replicas to 0.
  • Data expansion: In addition to the original data, ES needs to store indexes, column storage data, etc. After applying encoding compression and other technologies, it generally expands by 10%.
  • Internal overhead: ES occupies about 20% of disk space for segment merging, ES Translog, logs, etc.
  • Reserved by the operating system: by default, the operating system reserves 5%, used for key process handling, system recovery, preventing disk fragmentation issues, etc.
  • Safety threshold: usually, at least 15% safety threshold is reserved.

Therefore, the actual space occupied by data in Elasticsearch can be estimated using the following formula:

Disk total = Source data × (1 + Number of replicas) × (1 + Data expansion) / (1 - Internal task overhead) / (1 - Operating system reserved) / (1 - Safety threshold)
           = Source data × (1 + Number of replicas) × 1.7

Number of Nodes

The choice of node quantity is closely related to index shards, data redundancy, fault transfer, fault recovery, processing efficiency, etc. If you have a deep understanding and proficient use of Elasticsearch, you should be able to choose a reasonable node configuration and quantity; if you are still learning about Elasticsearch, you can choose based on actual business needs and the amount of data to be processed.

To ensure the high availability of the cluster, the basic type cluster defaults to a minimum of 3 nodes, and the master node separation type cluster defaults to 3 master nodes, with a minimum of 2 data nodes.

After successful creation, subsequent clusters can be expanded nodes (does not support master node creation and expansion).

Payment Selection

The billing mode is divided into hourly, monthly, and annual payment. The console supports modifying the billing type.

Set Kibana Account Password

Kibana is an open-source analytics and visualization platform and is a commonly used management tool for Elasticsearch. You can use Kibana to search, view, and interact with index data in the cluster, and use various charts, tables, maps, etc. Kibana can easily display advanced data analysis and visualization.

Kibana makes understanding large amounts of data easy.

The product defaults to enabling the Kibana service and adding access control. You need to set the account password when creating the cluster.

Confirm payment

After the selection and filling in of cluster information is completed, click Create to go to the payment confirmation page, complete the payment and create.

Cluster Deployment

Wait for deployment. The deployment time may vary depending on the scale of the cluster, the deployment time is approximately between 2 and 5 minutes.

During deployment, the cluster status of the cluster in the list will be displayed as “blue” status creating, and after successful creation, the cluster status will dynamically change to “green” status running.

If the creation status remains for a long time, you can click refresh to reacquire the cluster status.