8.1.0 Deployment Failed

Note:
This topic has been translated from a Chinese forum by GPT and might contain errors.

Original topic: 8.1.0部署失败

| username: herowk

[TiDB Usage Environment] Production Environment / Testing / PoC
[Target Machine Operating System] Rocky Linux 9.4
[TiDB Version] v8.1.0
[Reproduction Path] tiup cluster deploy tidb-test v8.1.0 ./topology.yaml --user tidbuser -p
[Encountered Problem: Problem Phenomenon and Impact]
Enabling component pd
Enabling instance 10.0.5.199:2379
Failed to enable unit: Unit file pd-2379.service does not exist.

Error: failed to enable/disable pd: failed to enable: 10.0.5.199 pd-2379.service, please check the instance’s log(/data1/tidb/tidb-deploy/pd-2379/log) for more detail.: executor.ssh.execute_failed: Failed to execute command over SSH for ‘tidbuser@10.0.5.199:22’ {ssh_stderr: Failed to enable unit: Unit file pd-2379.service does not exist.
, ssh_stdout: , ssh_command: export LANG=C; PATH=$PATH:/bin:/sbin:/usr/bin:/usr/sbin; /usr/bin/sudo -H bash -c “systemctl daemon-reload && systemctl enable pd-2379.service”}, cause: Process exited with status 1

[Resource Configuration] Enter TiDB Dashboard - Cluster Info - Hosts and take a screenshot of this page
[Attachments: Screenshots/Logs/Monitoring]

| username: xfworld | Original post link

Failed to execute command over SSH for ‘tidbuser@10.0.5.199:22’ {ssh_stderr: Failed to enable unit: Unit file pd-2379.service does not exist.

SSH error occurred.

| username: herowk | Original post link

Not really…

| username: herowk | Original post link

Pre-installation Check:
herowk@ubuntu-portable:~/02Works/00Practice/tidb$ tiup cluster check ./topology.yaml --apply --user tidbuser -p
Input SSH password:

  • Detect CPU Arch Name

    • Detecting node 10.0.5.199 Arch info … Done
    • Detecting node 10.0.5.202 Arch info … Done
    • Detecting node 10.0.5.203 Arch info … Done
    • Detecting node 10.0.5.204 Arch info … Done
    • Detecting node 10.0.5.201 Arch info … Done
    • Detecting node 10.0.5.200 Arch info … Done
    • Detecting node 10.0.5.205 Arch info … Done
  • Detect CPU OS Name

    • Detecting node 10.0.5.199 OS info … Done
    • Detecting node 10.0.5.202 OS info … Done
    • Detecting node 10.0.5.203 OS info … Done
    • Detecting node 10.0.5.204 OS info … Done
    • Detecting node 10.0.5.201 OS info … Done
    • Detecting node 10.0.5.200 OS info … Done
    • Detecting node 10.0.5.205 OS info … Done
  • Download necessary tools

    • Downloading check tools for linux/amd64 … Done
  • Collect basic system information

    • Getting system info of 10.0.5.199:22 … Done
    • Getting system info of 10.0.5.202:22 … Done
    • Getting system info of 10.0.5.203:22 … Done
    • Getting system info of 10.0.5.204:22 … Done
    • Getting system info of 10.0.5.201:22 … Done
    • Getting system info of 10.0.5.200:22 … Done
    • Getting system info of 10.0.5.205:22 … Done
  • Check time zone

    • Checking node 10.0.5.202 … Done
    • Checking node 10.0.5.203 … Done
    • Checking node 10.0.5.204 … Done
    • Checking node 10.0.5.201 … Done
    • Checking node 10.0.5.200 … Done
    • Checking node 10.0.5.205 … Done
    • Checking node 10.0.5.199 … Done
  • Check system requirements

    • Checking node 10.0.5.199 … Done
    • Checking node 10.0.5.202 … Done
    • Checking node 10.0.5.203 … Done
    • Checking node 10.0.5.204 … Done
    • Checking node 10.0.5.201 … Done
    • Checking node 10.0.5.200 … Done
    • Checking node 10.0.5.205 … Done
  • Cleanup check files

    • Cleanup check files on 10.0.5.199:22 … Done
    • Cleanup check files on 10.0.5.202:22 … Done
    • Cleanup check files on 10.0.5.203:22 … Done
    • Cleanup check files on 10.0.5.204:22 … Done
    • Cleanup check files on 10.0.5.201:22 … Done
    • Cleanup check files on 10.0.5.200:22 … Done
    • Cleanup check files on 10.0.5.205:22 … Done
      Node Check Result Message

10.0.5.199 cpu-cores Pass number of CPU cores / threads: 8
10.0.5.199 cpu-governor Warn Unable to determine current CPU frequency governor policy, auto fixing not supported
10.0.5.199 memory Pass memory size is 16384MB
10.0.5.199 selinux Pass SELinux is disabled
10.0.5.199 thp Pass THP is disabled
10.0.5.199 command Pass numactl: policy: default
10.0.5.199 os-version Pass OS is Rocky Linux 9.4 (Blue Onyx)
10.0.5.202 os-version Pass OS is Rocky Linux 9.4 (Blue Onyx)
10.0.5.202 cpu-cores Pass number of CPU cores / threads: 8
10.0.5.202 cpu-governor Warn Unable to determine current CPU frequency governor policy, auto fixing not supported
10.0.5.202 memory Pass memory size is 32768MB
10.0.5.202 selinux Pass SELinux is disabled
10.0.5.202 thp Pass THP is disabled
10.0.5.202 command Pass numactl: policy: default
10.0.5.202 timezone Pass time zone is the same as the first PD machine: Asia/Shanghai
10.0.5.203 os-version Pass OS is Rocky Linux 9.4 (Blue Onyx)
10.0.5.203 cpu-cores Pass number of CPU cores / threads: 8
10.0.5.203 cpu-governor Warn Unable to determine current CPU frequency governor policy, auto fixing not supported
10.0.5.203 memory Pass memory size is 32768MB
10.0.5.203 selinux Pass SELinux is disabled
10.0.5.203 thp Pass THP is disabled
10.0.5.203 command Pass numactl: policy: default
10.0.5.203 timezone Pass time zone is the same as the first PD machine: Asia/Shanghai
10.0.5.204 memory Pass memory size is 32768MB
10.0.5.204 selinux Pass SELinux is disabled
10.0.5.204 thp Pass THP is disabled
10.0.5.204 command Pass numactl: policy: default
10.0.5.204 timezone Pass time zone is the same as the first PD machine: Asia/Shanghai
10.0.5.204 os-version Pass OS is Rocky Linux 9.4 (Blue Onyx)
10.0.5.204 cpu-cores Pass number of CPU cores / threads: 8
10.0.5.204 cpu-governor Warn Unable to determine current CPU frequency governor policy, auto fixing not supported
10.0.5.201 cpu-cores Pass number of CPU cores / threads: 32
10.0.5.201 cpu-governor Warn Unable to determine current CPU frequency governor policy, auto fixing not supported
10.0.5.201 memory Pass memory size is 65536MB
10.0.5.201 selinux Pass SELinux is disabled
10.0.5.201 thp Pass THP is disabled
10.0.5.201 command Pass numactl: policy: default
10.0.5.201 timezone Pass time zone is the same as the first PD machine: Asia/Shanghai
10.0.5.201 os-version Pass OS is Rocky Linux 9.4 (Blue Onyx)
10.0.5.200 os-version Pass OS is Rocky Linux 9.4 (Blue Onyx)
10.0.5.200 cpu-cores Pass number of CPU cores / threads: 8
10.0.5.200 cpu-governor Warn Unable to determine current CPU frequency governor policy, auto fixing not supported
10.0.5.200 memory Pass memory size is 16384MB
10.0.5.200 selinux Pass SELinux is disabled
10.0.5.200 thp Pass THP is disabled
10.0.5.200 command Pass numactl: policy: default
10.0.5.200 timezone Pass time zone is the same as the first PD machine: Asia/Shanghai
10.0.5.205 selinux Pass SELinux is disabled
10.0.5.205 thp Pass THP is disabled
10.0.5.205 command Pass numactl: policy: default
10.0.5.205 timezone Pass time zone is the same as the first PD machine: Asia/Shanghai
10.0.5.205 os-version Pass OS is Rocky Linux 9.4 (Blue Onyx)
10.0.5.205 cpu-cores Pass number of CPU cores / threads: 4
10.0.5.205 cpu-governor Warn Unable to determine current CPU frequency governor policy, auto fixing not supported
10.0.5.205 memory Pass memory size is 16384MB

  • Try to apply changes to fix failed checks
    • Applying changes on 10.0.5.199 … Done
    • Applying changes on 10.0.5.202 … Done
    • Applying changes on 10.0.5.203 … Done
    • Applying changes on 10.0.5.204 … Done
    • Applying changes on 10.0.5.201 … Done
    • Applying changes on 10.0.5.200 … Done
    • Applying changes on 10.0.5.205 … Done
| username: herowk | Original post link

Resolved:
Only made two configurations:

sudo vi /etc/systemd/system/disable-transparent-huge-pages.service
# File content
[Unit]
Description=Disable Transparent Huge Pages (THP)
DefaultDependencies=no
After=sysinit.target local-fs.target
Before=mongod.service

[Service]
Type=oneshot
ExecStart=/bin/sh -c 'echo never | tee /sys/kernel/mm/transparent_hugepage/enabled > /dev/null'

[Install]
WantedBy=basic.target
sudo systemctl daemon-reload 
sudo systemctl enable disable-transparent-huge-pages 
sudo systemctl start disable-transparent-huge-pages 

Edited /etc/fstab to disable swap.

| username: 健康的腰间盘 | Original post link

Saved, you never know when it might come in handy.

| username: 友利奈绪 | Original post link

Missing configuration content.

| username: zhaokede | Original post link

Bookmark it.

| username: TiDBer_QYr0vohO | Original post link

Sure, please provide the Chinese text you need translated into English.

| username: tony5413 | Original post link

Disable transparent hugepages and disable swap.

| username: TiDBer_LM | Original post link

Bookmarking this, might come in handy.

| username: 呢莫不爱吃鱼 | Original post link

Mark, for reference.

| username: Jack-li | Original post link

Saved.

| username: zhh_912 | Original post link

Got it.