DX Cloud PaaS master

Adyen Connector module
- master
AI Accelerator module
- master
Algolia E-commerce connector
- master
API
- master
B-FY Connector module
- master
Backend Live
- master
Backup Extended module
- master
Bitbucket module
- master
Bot Protection module
- master
Campaign manager module
- master
CDN Helper module
- master
Celum DAM Connector module
- master
Cloudinary External DAM module
- master
Commenting module
- master
Configuration Injection module
- master
Content Diff module
- master
Content Exporter module
- master
Content Locking module
- master
Content Recommender module
- master
Content Translation Extended module
- master
Content Type models
- master
Custom CSS module
- master
Customer Journey Mapping module
- master
DALL-E light module
- master
DAM JCR Fastly renderer module
- master
Dotdigital Integration module
- master
DX Cloud PaaS
- master
DX Core
- 6.2
E-commerce Category Sync
- master
Ecommerce module
- master
Eight Eye Workflow module
- master
Elasticsearch provider module
- master
Extended Health Check module
- master
Form module
- master
Free trials docs
- master
Freeze module
- master
Frontify DAM connector
- master
Fullstory Integration module
- master
Headless
- master
Hi Magnolia
- master
Home
- master
Hooks API module
- master
Hybrid Assets module
- master
Image Focal module
- master
Image placement module
- master
Incubator Modules
- master
Instrumentation module
- master
Javascript Models
- 2.0.x
JavaScript UI module
- master
Language Availability module
- master
Link Mapper module
- master
Linkmapper Shared Database module
- master
Live Copy module
- latest
Magnolia CLI
- 4.x
- 3.x
- 2.x
Magnolia Cloud
- master
Magnolia Search Index Feeder module
- master
Magnolia Vercel App
- master
Microsoft DAM Connector module
- master
Migration Tool module
- master
mParticle Integration module
- master
Multi Assets Upload module
- master
Netlify Integration module
- master
Periscope Control module
- master
Piano Analytics Connector module
- master
Public User Registration Database module
- master
Publication Task Config
- master
REST module
- 3.0-SNAPSHOT
REST Proxy module
- master
RMQ Publication module
- master
Segment Integration module
- master
SEO module
- master
Shop module
- master
Siteimprove module
- master
Six Eye Workflow module
- master
Slack Integration module
- master
SSO Login Extension module
- master
SSO module
- 3.1.x
- 2.0.6
Task Email Notifications module
- master
Tasks cleaner module
- master
Throttling Filter module
- master
Two Factor Authentication module
- master
URI Mapping app
- master
URL Translation Module
- master
Veeva DAM Connector module
- master
Version Cleaner module
- master
Visual previews of fields in the documentation
- 6.2
Webhooks module
- 2.0-SNAPSHOT
- 1.0
WeChat Login module
- master
WordAI module
- master
Workflow Extended module
- master

Magnolia is "crash looping"

Symptom

A CustomerMagnoliaCrashLooping alert is firing. Kubernetes has restarted a Magnolia at least three times within 15 minutes.

CustomerMagnoliaCrashLooping alerts are sent to subscribers via email.

Kubernetes will restart a pod if it exceeds its memory limit. The Magnolia JVM typically cannot exceed its memory limit - the JVM max heap setting - but the JVM also will consume a small amount of non-heap memory (usually about 200MB) that can vary over time. Other containers running in the Magnolia pod may also consume memory but they usually use very small amounts (10s of MB). Temporary filesystems may use memory as well.

Observations

Here are the details on the alert:

Alert: CustomerMagnoliaCrashLooping

Expression

increase(kube_pod_container_status_restarts_total{container="magnolia-helm"}[15m]) > 3

Delay

2 minutes

Labels

team: customer

Annotations

source
summary
description
tenant
cluster_id
cluster_name
pod
instance

Check readiness and liveness probe config for Magnolia pod

The alert will note the affected Magnolia pod.

You can view probes configuration for the Magnolia pod in Rancher or with kubectl.

kubectl -n <namespace from alert> describe pod <Magnolia pod from alert>

Look for the "Liveness" and "Readiness" sections in the output:

    Liveness:       http-get http://:liveness-port/livez delay=240s timeout=10s period=10s #success=1 #failure=4
    Readiness:      http-get http://:liveness-port/readyz delay=2s timeout=1s period=2s #success=1 #failure=3

Check bootstrapper container log output for Magnolia pod

The liveness and readiness probes actually check the bootstrapper instead of Magnolia. The bootstrapper then checks Magnolia and returns a result depending on Magnolia’s state.

The bootstrapper’s log shows the results of liveness and readiness checks. You can view the bootstrapper log in the customer’s cockpit or in Grafana.

Be careful when adjusting the the readiness and liveness probes for the Magnolia pod: don’t set very long delays or failure thresholds until you have verified that Magnolia really needs to have more time when starting up.

Solutions

This section provides solutions that should help resolve the issue in most cases.

Stop failing readiness check

Magnolia make take longer to start up and pass its readiness probe for a variety of reasons (Lucene indexing, large JCR repository, lots of module startup tasks).

You can allow Magnolia more time in starting up by:

increasing the failureThreshold Helm chart value for readiness to increase the number failed readiness checks tolerated
increasing the initialDelaySeconds Helm chart value for readiness to increase the time before readiness is checked
increasing the periodSeconds Helm chart value for readiness to increase the time between readiness checks

Stop failing liveness check

Magnolia make take longer to start up and pass its liveness probe for a variety of reasons (Lucene indexing, large JCR repository, lots of module startup tasks).