Setup with Ansible

Quick Start

INFO

https://github.com/spantaleev/matrix-docker-ansible-deploy/blob/master/docs/quick-start.md

Prerequisites

INFO

https://github.com/spantaleev/matrix-docker-ansible-deploy/blob/master/docs/prerequisites.md

Install deps

bash

# On Control Node
pipx install --include-deps ansible #
pip install passlib #
pipx install rust-just # or sudo apt install just
sudo apt install pwgen # pwgen -s 64 1

Configure firewall
Configure DNS settings

ess-helm

install pg on dedicated server: install pg
set up database for synapse set up database for mas

enablee password

conf

# ~/17/data/pg_hba.conf
host all all 10.33.12.111/32 scram-sha-256

bash

# in postgres user
/usr/pgsql-17/bin/pg_ctl reload

ess-helm get startedcurl -sfL https://get.k3s.io | sh - --flannel-iface eth1
Config elementWeb/synapse ...
Install

bash

helm upgrade --install --namespace "ess" ess oci://ghcr.io/element-hq/ess-helm/matrix-stack -f ~/ess-config-values/hostnames.yaml -f ~/ess-config-values/tls.yaml -f ~/ess-config-values/postgresql.yaml  --wait

bash

# rocky linux
dnf install git tar # required for install helm

WARNING

kubectl create namespace ess: 报错 The connection to the server localhost:8080 was refused

bash

cp /etc/rancher/k3s/k3s.yaml .kube/config
kubectl create namespace ess

如果 helm upgrade 失败，请检查日志：kubectl get events -n ess --sort-by=.metadata.creationTimestamp

可能是自部署的 postgres 无法联通

pg

bash

# /home/postgres/17/data/postgresql.conf
listen_addresses = 'xx.xx.xx.xx'

bash

[
   "postgres",
   "-c",
   "max_connections=500",
   "-c",
   "shared_buffers=2611MB",
   "-c",
   "effective_cache_size=7834MB",
   "-c",
   "maintenance_work_mem=652MB",
   "-c",
   "checkpoint_completion_target=0.9",
   "-c",
   "wal_buffers=16MB",
   "-c",
   "default_statistics_target=100",
   "-c",
   "random_page_cost=1.1",
   "-c",
   "effective_io_concurrency=200",
   "-c",
   "work_mem=1337kB",
   "-c",
   "huge_pages=off",
   "-c",
   "min_wal_size=1024MB",
   "-c",
   "max_wal_size=4096MB",
   "-c",
   "max_worker_processes=16",
   "-c",
   "max_parallel_workers=16",
   "-c",
   "max_parallel_workers_per_gather=4",
   "-c",
   "max_parallel_maintenance_workers=4"
]

服务器调整

参考：

# /etc/sysctl.conf

# 文件描述符限制
fs.file-max=100000

# this sets the max, so that we can bump the JVB UDP single port buffer size.
net.core.rmem_max=10485760
net.core.netdev_max_backlog=100000

bash

prlimit --pid 1234 --nofile=100000:200000 # 为进行中的进程修改 max open files
prlimit --pid 1234 --nofile=100000: # 省略 hard limit

bash

sudo sysctl -p

# /etc/security/limits.conf
* soft nofile 100000
#(optional)* hard nofile 200000

bash

less /proc/717594/limits # 查看某个进程的限制

xxxx

修改 /etc/security/limits.conf 后，通常 不需要重启整个系统，但需要 重新登录用户会话 才能生效。

如何让修改立即生效

重新登录用户（最简单的方法）
bash
```
exit
ssh user@server
```
1
2
或者直接注销并重新登录。
使用 su 切换用户
bash
```
su - username
```
1
这样可以让新的 limits.conf 配置生效。
检查是否生效
bash
```
ulimit -a
```
1
你可以查看 nofile、nproc 等参数是否已更新。

特殊情况

如果是系统服务（如 Nginx、MySQL），可能需要 重启服务：
bash
```
systemctl restart nginx
```
1
如果是 SSH 连接，可能需要 重启 SSH 服务：
bash
```
systemctl restart sshd
```
1

如果你希望修改 已运行进程 的 ulimit，可以使用 prlimit：

bash

prlimit --pid 1234 --nofile=100000:200000

这样可以 动态修改 进程的文件描述符限制，而无需重启系统。

你可以在这里和这里了解更多详细信息！😊

fs.file-max vs nofile

不完全一致，fs.file-max 和 soft max open files（ulimit -Sn） 代表不同层面的文件描述符限制：

1. `fs.file-max`（系统级限制）

作用：控制整个 Linux 内核 能够分配的 文件描述符总数（所有进程共享）。
查看方式：
bash
```
sysctl -a | grep fs.file-max
```
1

修改方式：

bash

sysctl -w fs.file-max=1000000
echo "fs.file-max = 1000000" >> /etc/sysctl.conf
sysctl -p

影响范围：整个系统，所有进程共享这个限制。

2. `ulimit -Sn`（进程级软限制）

作用：限制 单个进程 可以打开的 最大文件描述符数（受 fs.file-max 影响）。
查看方式：
bash
```
ulimit -Sn
```
1

修改方式：

bash

ulimit -Sn 100000

或者：

bash

sudo nano /etc/security/limits.conf

添加：

plaintext

* soft nofile 100000
* hard nofile 200000

影响范围：仅限 当前进程，不会影响整个系统。

3. 关系总结

✅ fs.file-max 是 系统级 限制，决定 整个 Linux 内核 能分配的文件描述符总数。
✅ ulimit -Sn 是 进程级 限制，决定 单个进程 能打开的文件描述符数量。
✅ ulimit -Sn 不能超过 fs.file-max，否则进程无法打开更多文件。

如果你的服务器需要支持 高并发 WebSocket 或数据库连接，建议同时 提高 fs.file-max 和 ulimit -Sn，确保进程不会受限！你可以在这里和这里了解更多详细信息。😊

pg ulimit

这是因为 systemd 启动的服务 不会继承 shell 会话中的 ulimit 设置，而是使用 systemd 自身的资源限制。你需要在 PostgreSQL 的 systemd 配置 中显式设置 LimitNOFILE。

解决方法

检查 PostgreSQL 进程的文件描述符限制
bash
```
cat /proc/$(pgrep -u postgres -o postgres)/limits | grep "Max open files"
```
1
这会显示 PostgreSQL 进程的实际 max open files 限制。
修改 systemd 配置 编辑 PostgreSQL 的 systemd 服务文件：
bash
```
sudo systemctl edit postgresql-17
```
1
添加：
plaintext
```
[Service]
LimitNOFILE=100000
```
1
2

重新加载 systemd

bash

sudo systemctl daemon-reexec
sudo systemctl restart postgresql-17

验证修改是否生效

bash

cat /proc/$(pgrep -u postgres -o postgres)/limits | grep "Max open files"

为什么 `ulimit` 不生效？

ulimit -n 只影响 当前 shell 会话，但 systemd 启动的服务不会继承 shell 的 ulimit。
LimitNOFILE 是 systemd 级别的资源限制，必须在 service 文件 中显式配置。

你可以在 CSDN 和 51CTO 了解更多详细信息！😊

增加 k3s 节点

https://docs.k3s.io/zh/quick-start

bash

# on agent(worker) node
hostnamectl set-hostname k3s-worker-1
nano /etc/hosts
reboot
curl -sfL https://get.k3s.io | K3S_URL=https://x.x.x.x:6443 K3S_TOKEN=xxxx sh - --flannel-iface eth1

# on master node
k3s kubectl get nodes -o wide
kubectl label nodes k3s-worker-1 node.type=worker

# on pg servers
vi /var/lib/pgsql/17/data/pg_hba.conf
systemctl reload postgresql-17

私有网络网卡

参考：搬瓦工 private ip

bash

ip a # 输出的 eth1 10.xxxx 是私有网卡

如果安装 k3s 时没有通过命令参数设置 flannel-iface，那么可以有以下2种方式修改：

### systemd

server: /etc/systemd/system/k3s.service

ExecStart=/usr/local/bin/k3s
server
--flannel-iface=eth1 \

agent: /etc/systemd/system/k3s-agent.service

ExecStart=/usr/local/bin/k3s
agent
--flannel-iface=eth1 \


```bash
sudo systemctl daemon-reexec
sudo systemctl restart k3s[-agent]

k3s config.yaml

yml

# server/agent: /etc/rancher/k3s/config.yaml
flannel-iface: eth1

bash

sudo systemctl restart k3s

traefik config

yml

# /var/lib/rancher/k3s/server/manifests/traefik-custom.yaml
# https://docs.k3s.io/helm#customizing-packaged-components-with-helmchartconfig
# 保存文件之后 k3s 会自动更新
apiVersion: helm.cattle.io/v1
kind: HelmChartConfig
metadata:
  name: traefik
  namespace: kube-system
spec:
  valuesContent: |-
    deployment:
      enabled: true
      kind: DaemonSet

User status

Status	MAS	Synapse
locked	User account is temporarily disabled, cannot log in but account data is preserved. Can be unlocked by admin.	User is temporarily suspended, cannot perform actions but account exists. Can be unlocked via admin API.
deactivated	User account is permanently disabled, all sessions invalidated. Cannot be reactivated through MAS interface.	User account is permanently disabled, removed from rooms, profile cleared. Can only be reactivated via admin API with data loss.

在开启 MAS 的情况下，Synapse admin deactivate 用户之后如何激活

fetch("https://xxxx/\_synapse/admin/v2/users/@xxx:yyy.zzz", { method: "PUT", headers: { "Authorization": "Bearer YOUR_ADMIN_TOKEN", "Content-Type": "application/json" }, body: JSON.stringify({ deactivated: false // ⚠️ 不设置 password 字段，适用于 OIDC/MAS 模式 }) }) .then(response => response.json()) .then(data => console.log("Response:", data)) .catch(error => console.error("Error:", error));

滤镜

Setup with Ansible

Quick Start

Prerequisites

ess-helm

pg

服务器调整

xxxx

如何让修改立即生效

特殊情况

fs.file-max vs nofile

1. `fs.file-max`（系统级限制）

2. `ulimit -Sn`（进程级软限制）

3. 关系总结

pg ulimit

解决方法

为什么 `ulimit` 不生效？

增加 k3s 节点

私有网络网卡

server: /etc/systemd/system/k3s.service

agent: /etc/systemd/system/k3s-agent.service

k3s config.yaml

traefik config

User status

Preview:

Setup with Ansible ​

Quick Start ​

Prerequisites ​

ess-helm ​

pg ​

服务器调整 ​

xxxx ​

如何让修改立即生效 ​

特殊情况 ​

fs.file-max vs nofile ​

1. fs.file-max（系统级限制） ​

2. ulimit -Sn（进程级软限制） ​

3. 关系总结 ​

pg ulimit ​

解决方法 ​

为什么 ulimit 不生效？ ​

增加 k3s 节点 ​

私有网络网卡 ​

server: /etc/systemd/system/k3s.service ​

agent: /etc/systemd/system/k3s-agent.service ​

k3s config.yaml ​

traefik config ​

User status ​

Preview:

Setup with Ansible

Quick Start

Prerequisites

ess-helm

pg

服务器调整

xxxx

如何让修改立即生效

特殊情况

fs.file-max vs nofile

1. `fs.file-max`（系统级限制）

2. `ulimit -Sn`（进程级软限制）

3. 关系总结

pg ulimit

解决方法

为什么 `ulimit` 不生效？

增加 k3s 节点

私有网络网卡

server: /etc/systemd/system/k3s.service

agent: /etc/systemd/system/k3s-agent.service

k3s config.yaml

traefik config

User status