如何使用 AWS Elastic Beanstalk 可扩展的 Django 应用程序运行 celery 工作者？答案

【问题标题】：How to run a celery worker with Django app scalable by AWS Elastic Beanstalk?如何使用 AWS Elastic Beanstalk 可扩展的 Django 应用程序运行 celery 工作者？
【发布时间】：2017-04-30 21:50:32
【问题描述】：

如何将 Django 与 AWS Elastic Beanstalk 一起使用，它也只能在主节点上通过 celery 运行任务？

【问题讨论】：

如果你想要比芹菜更轻的东西，你可以试试pypi.org/project/django-eb-sqs-worker 包——它使用 Amazon SQS 来排队任务。

标签： django amazon-web-services celery amazon-elastic-beanstalk django-celery

【解决方案1】：

这就是我在弹性豆茎上使用 django 设置 celery 的方法，可扩展性工作正常。

请记住，container_commands 的 'leader_only' 选项仅适用于 environment rebuild 或 deployment应用程序的强>。如果服务工作的时间足够长，领导节点可能会被 Elastic Beanstalk 删除。要解决这个问题，您可能必须为领导节点应用实例保护。检查：http://docs.aws.amazon.com/autoscaling/latest/userguide/as-instance-termination.html#instance-protection-instance

为 celery worker 和 beat 配置添加 bash 脚本。

添加文件root_folder/.ebextensions/files/celery_configuration.txt：

#!/usr/bin/env bash

# Get django environment variables
celeryenv=`cat /opt/python/current/env | tr '\n' ',' | sed 's/export //g' | sed 's/$PATH/%(ENV_PATH)s/g' | sed 's/$PYTHONPATH//g' | sed 's/$LD_LIBRARY_PATH//g' | sed 's/%/%%/g'`
celeryenv=${celeryenv%?}

# Create celery configuraiton script
celeryconf="[program:celeryd-worker]
; Set full path to celery program if using virtualenv
command=/opt/python/run/venv/bin/celery worker -A django_app --loglevel=INFO

directory=/opt/python/current/app
user=nobody
numprocs=1
stdout_logfile=/var/log/celery-worker.log
stderr_logfile=/var/log/celery-worker.log
autostart=true
autorestart=true
startsecs=10

; Need to wait for currently executing tasks to finish at shutdown.
; Increase this if you have very long running tasks.
stopwaitsecs = 600

; When resorting to send SIGKILL to the program to terminate it
; send SIGKILL to its whole process group instead,
; taking care of its children as well.
killasgroup=true

; if rabbitmq is supervised, set its priority higher
; so it starts first
priority=998

environment=$celeryenv

[program:celeryd-beat]
; Set full path to celery program if using virtualenv
command=/opt/python/run/venv/bin/celery beat -A django_app --loglevel=INFO --workdir=/tmp -S django --pidfile /tmp/celerybeat.pid

directory=/opt/python/current/app
user=nobody
numprocs=1
stdout_logfile=/var/log/celery-beat.log
stderr_logfile=/var/log/celery-beat.log
autostart=true
autorestart=true
startsecs=10

; Need to wait for currently executing tasks to finish at shutdown.
; Increase this if you have very long running tasks.
stopwaitsecs = 600

; When resorting to send SIGKILL to the program to terminate it
; send SIGKILL to its whole process group instead,
; taking care of its children as well.
killasgroup=true

; if rabbitmq is supervised, set its priority higher
; so it starts first
priority=998

environment=$celeryenv"

# Create the celery supervisord conf script
echo "$celeryconf" | tee /opt/python/etc/celery.conf

# Add configuration script to supervisord conf (if not there already)
if ! grep -Fxq "[include]" /opt/python/etc/supervisord.conf
  then
  echo "[include]" | tee -a /opt/python/etc/supervisord.conf
  echo "files: celery.conf" | tee -a /opt/python/etc/supervisord.conf
fi

# Reread the supervisord config
supervisorctl -c /opt/python/etc/supervisord.conf reread

# Update supervisord in cache without restarting all services
supervisorctl -c /opt/python/etc/supervisord.conf update

# Start/Restart celeryd through supervisord
supervisorctl -c /opt/python/etc/supervisord.conf restart celeryd-beat
supervisorctl -c /opt/python/etc/supervisord.conf restart celeryd-worker

在部署期间注意脚本执行，但仅限于主节点（leader_only: true）。添加文件root_folder/.ebextensions/02-python.config：

container_commands:
  04_celery_tasks:
    command: "cat .ebextensions/files/celery_configuration.txt > /opt/elasticbeanstalk/hooks/appdeploy/post/run_supervised_celeryd.sh && chmod 744 /opt/elasticbeanstalk/hooks/appdeploy/post/run_supervised_celeryd.sh"
    leader_only: true
  05_celery_tasks_run:
    command: "/opt/elasticbeanstalk/hooks/appdeploy/post/run_supervised_celeryd.sh"
    leader_only: true

Beat 无需重新部署即可配置，具有单独的 django 应用程序：https://pypi.python.org/pypi/django_celery_beat。
将任务结果存储到：https://pypi.python.org/pypi/django_celery_beat

文件requirements.txt

celery==4.0.0
django_celery_beat==1.0.1
django_celery_results==1.0.1
pycurl==7.43.0 --global-option="--with-nss"

为 Amazon SQS 代理配置 celery （从列表中获取您想要的端点：http://docs.aws.amazon.com/general/latest/gr/rande.html） root_folder/django_app/settings.py：

...
CELERY_RESULT_BACKEND = 'django-db'
CELERY_BROKER_URL = 'sqs://%s:%s@' % (aws_access_key_id, aws_secret_access_key)
# Due to error on lib region N Virginia is used temporarily. please set it on Ireland "eu-west-1" after fix.
CELERY_BROKER_TRANSPORT_OPTIONS = {
    "region": "eu-west-1",
    'queue_name_prefix': 'django_app-%s-' % os.environ.get('APP_ENV', 'dev'),
    'visibility_timeout': 360,
    'polling_interval': 1
}
...

django django_app app 的 Celery 配置

添加文件root_folder/django_app/celery.py：

from __future__ import absolute_import, unicode_literals
import os
from celery import Celery

# set the default Django settings module for the 'celery' program.
os.environ.setdefault('DJANGO_SETTINGS_MODULE', 'django_app.settings')

app = Celery('django_app')

# Using a string here means the worker don't have to serialize
# the configuration object to child processes.
# - namespace='CELERY' means all celery-related configuration keys
#   should have a `CELERY_` prefix.
app.config_from_object('django.conf:settings', namespace='CELERY')

# Load task modules from all registered Django app configs.
app.autodiscover_tasks()

修改文件root_folder/django_app/__init__.py：

from __future__ import absolute_import, unicode_literals

# This will make sure the app is always imported when
# Django starts so that shared_task will use this app.
from django_app.celery import app as celery_app

__all__ = ['celery_app']

也检查一下：

How do you run a worker with AWS Elastic Beanstalk?（没有可扩展性的解决方案）
Pip Requirements.txt --global-option causing installation errors with other packages. "option not recognized"（解决来自弹性 beantalk 上过时的 pip 问题的解决方案，无法处理正确解决 pycurl 依赖的全局选项）

【讨论】：

你能看看这个问题吗？我按照你的例子，但得到以下错误stackoverflow.com/questions/43481540/…
@BorkoKovacev 谢谢，我已经更新了 supervisorctl restart 的设置修复。
@smentek 小编辑 - 添加 | sed 's/%/%%/g' 到 celeryenv 行有助于防止一些人在使用此配置时遇到问题，请参阅stackoverflow.com/questions/41231489/…
"如果服务工作的时间足够长，leader 节点可能会被 Elastic Beanstalk 移除。" -> 您可以保护特定实例不被负载均衡器移除。
感谢您提及实例保护。

【解决方案2】：

这就是我扩展@smentek 的答案以允许多个工作实例和单个节拍实例的方式——同样的事情也适用于你必须保护你的领导者的地方。（我还没有自动化的解决方案）。

请注意，在应用服务器重新启动之前，celery beat 或工作人员不会反映通过 EB cli 或 Web 界面对 EB 的 envvar 更新。这让我有一次措手不及。

单个 celery_configuration.sh 文件为 supervisord 输出两个脚本，注意 celery-beat 有autostart=false，否则实例重启后你会得到很多节拍：

# get django environment variables
celeryenv=`cat /opt/python/current/env | tr '\n' ',' | sed 's/export //g' | sed 's/$PATH/%(ENV_PATH)s/g' | sed 's/$PYTHONPATH//g' | sed 's/$LD_LIBRARY_PATH//g' | sed 's/%/%%/g'`
celeryenv=${celeryenv%?}

# create celery beat config script
celerybeatconf="[program:celeryd-beat]
; Set full path to celery program if using virtualenv
command=/opt/python/run/venv/bin/celery beat -A lexvoco --loglevel=INFO --workdir=/tmp -S django --pidfile /tmp/celerybeat.pid

directory=/opt/python/current/app
user=nobody
numprocs=1
stdout_logfile=/var/log/celery-beat.log
stderr_logfile=/var/log/celery-beat.log
autostart=false
autorestart=true
startsecs=10

; Need to wait for currently executing tasks to finish at shutdown.
; Increase this if you have very long running tasks.
stopwaitsecs = 10

; When resorting to send SIGKILL to the program to terminate it
; send SIGKILL to its whole process group instead,
; taking care of its children as well.
killasgroup=true

; if rabbitmq is supervised, set its priority higher
; so it starts first
priority=998

environment=$celeryenv"

# create celery worker config script
celeryworkerconf="[program:celeryd-worker]
; Set full path to celery program if using virtualenv
command=/opt/python/run/venv/bin/celery worker -A lexvoco --loglevel=INFO

directory=/opt/python/current/app
user=nobody
numprocs=1
stdout_logfile=/var/log/celery-worker.log
stderr_logfile=/var/log/celery-worker.log
autostart=true
autorestart=true
startsecs=10

; Need to wait for currently executing tasks to finish at shutdown.
; Increase this if you have very long running tasks.
stopwaitsecs = 600

; When resorting to send SIGKILL to the program to terminate it
; send SIGKILL to its whole process group instead,
; taking care of its children as well.
killasgroup=true

; if rabbitmq is supervised, set its priority higher
; so it starts first
priority=999

environment=$celeryenv"

# create files for the scripts
echo "$celerybeatconf" | tee /opt/python/etc/celerybeat.conf
echo "$celeryworkerconf" | tee /opt/python/etc/celeryworker.conf

# add configuration script to supervisord conf (if not there already)
if ! grep -Fxq "[include]" /opt/python/etc/supervisord.conf
  then
  echo "[include]" | tee -a /opt/python/etc/supervisord.conf
  echo "files: celerybeat.conf celeryworker.conf" | tee -a /opt/python/etc/supervisord.conf
fi

# reread the supervisord config
/usr/local/bin/supervisorctl -c /opt/python/etc/supervisord.conf reread
# update supervisord in cache without restarting all services
/usr/local/bin/supervisorctl -c /opt/python/etc/supervisord.conf update

然后在 container_commands 中我们只在 leader 上重启 beat：

container_commands:
  # create the celery configuration file
  01_create_celery_beat_configuration_file:
    command: "cat .ebextensions/files/celery_configuration.sh > /opt/elasticbeanstalk/hooks/appdeploy/post/run_supervised_celeryd.sh && chmod 744 /opt/elasticbeanstalk/hooks/appdeploy/post/run_supervised_celeryd.sh && sed -i 's/\r$//' /opt/elasticbeanstalk/hooks/appdeploy/post/run_supervised_celeryd.sh"
  # restart celery beat if leader
  02_start_celery_beat:
    command: "/usr/local/bin/supervisorctl -c /opt/python/etc/supervisord.conf restart celeryd-beat"
    leader_only: true
  # restart celery worker
  03_start_celery_worker:
    command: "/usr/local/bin/supervisorctl -c /opt/python/etc/supervisord.conf restart celeryd-worker"

【讨论】：

我想知道您是如何在 AWS 上部署它的。您是否使用了如下所示的工作环境：docs.aws.amazon.com/elasticbeanstalk/latest/dg/…。你对节拍实例是什么意思？运行 beat 只是将任务发送到队列，所以我不明白为什么要为此配备一台单独的机器。您是否有单独的 EC2 实例运行 Web 应用程序？
如何设置？如何确保在发生缩放时不会运行多个 celery 实例？
多个 celery worker 实例是可以的。你只想要一个节拍。老实说，我不久前就停止使用弹性豆茎，并将所有东西都移到了 kubernetes，我建议你也这样做。 @GregHolst 工作环境由于某种原因最终变得不合适。

【解决方案3】：

如果有人关注 smentek 的回答并收到错误消息：

05_celery_tasks_run: /usr/bin/env bash does not exist.

知道，如果您使用的是 Windows，您的问题可能是“celery_configuration.txt”文件在它应该具有 UNIX EOL 时却具有 WINDOWS EOL。如果使用 Notepad++，打开文件并点击“编辑 > EOL 转换 > Unix (LF)”。保存、重新部署，错误不再存在。

另外，对像我这样的业余爱好者有几点警告：

确保在 settings.py 文件的“INSTALLED_APPS”中包含“django_celery_beat”和“django_celery_results”。
要检查 celery 错误，请使用“eb ssh”连接到您的实例，然后使用“tail -n 40 /var/log/celery-worker.log”和“tail -n 40 /var/log/celery” -beat.log”（其中“40”是指要从文件中读取的行数，从末尾开始）。

希望这对某人有所帮助，它会为我节省几个小时！

【讨论】：