Failed to Deploy TiFlash on Single TiDB Server

Note:
This topic has been translated from a Chinese forum by GPT and might contain errors.

Original topic: TIDB单服务器部署tiflash失败

| username: 健康的腰间盘

Single server deployment test, the memory should be sufficient, the CPU supports AVX2, the ports do not appear to be occupied, but TiFlash just won’t start.

| username: 源de爸 | Original post link

Could you post the logs? Are there any errors?

| username: TiDBer_QYr0vohO | Original post link

There should be logs. There is an issue with your “more” command.

| username: zhanggame1 | Original post link

Check the TiFlash related startup logs with systemctl.

| username: 健康的腰间盘 | Original post link

Sorry, the previous method of checking the logs was incorrect. Here is the error log.

| username: 健康的腰间盘 | Original post link

Sorry, the previous method of checking the logs was incorrect. Here are the logs:

| username: DBAER | Original post link

It seems difficult to find the leader, but the TiKV cluster is all normal.

| username: TiDBer_QYr0vohO | Original post link

Check if port 3930 is being used.

| username: tidb菜鸟一只 | Original post link

Check if there are any processes occupying this address?

| username: 健康的腰间盘 | Original post link

I checked with netstat -tuln | grep 3930, and there is no process occupying it.

| username: TiDBer_嘎嘣脆 | Original post link

Are they all single instances?

| username: TiDBer_QYr0vohO | Original post link

Could you share the topology?

| username: 健康的腰间盘 | Original post link

No, all components are deployed on a single server.

| username: tidb菜鸟一只 | Original post link

Please send the configuration file. Maybe it’s not configured correctly.

| username: 健康的腰间盘 | Original post link

global:
user: “tidb”
ssh_port: 22
deploy_dir: “/tidb-deploy”
data_dir: “/tidb-data”

monitored:
node_exporter_port: 9100
blackbox_exporter_port: 9115

server_configs:
tidb:
instance.tidb_slow_log_threshold: 300
tikv:
readpool.storage.use-unified-pool: false
readpool.coprocessor.use-unified-pool: true
pd:
replication.enable-placement-rules: true
replication.location-labels: [“host”]
tiflash:
logger.level: “info”

pd_servers:

  • host: 10.0.30.91

tidb_servers:

  • host: 10.0.30.91

tikv_servers:

  • host: 10.0.30.91
    port: 20160
    status_port: 20180
    config:
    server.labels: { host: “logic-host-1” }

  • host: 10.0.30.91
    port: 20161
    status_port: 20181
    config:
    server.labels: { host: “logic-host-2” }

  • host: 10.0.30.91
    port: 20162
    status_port: 20182
    config:
    server.labels: { host: “logic-host-3” }

tiflash_servers:

  • host: 10.0.30.91

monitoring_servers:

  • host: 10.0.30.91

grafana_servers:

  • host: 10.0.30.91
| username: 健康的腰间盘 | Original post link

Posted below

| username: tidb菜鸟一只 | Original post link

Are you running so many services on a single machine with 16GB of memory? I suggest using tiup playground for testing. It’s quite normal that you can’t start so many services…

| username: 健康的腰间盘 | Original post link

It looks like there’s still 6.5G of free memory.

| username: TiDBer_QYr0vohO | Original post link

Can you check the system logs at /var/log/message?

| username: TiDBer_QYr0vohO | Original post link

There are a bit too many services. Check the system logs for any OOM errors during that time.