Magnolia is "crash looping"

Magnolia Documentation Team

DX Cloud master

Adyen Connector module
- master
AI Accelerator module
- 3.x
- 2.2
- 1.4
ai12z AI chatbot
- master
Algolia E-commerce connector
- master
Amplience DAM Connector module
- master
API
- master
- 1.1
Architecture Compass
- master
B-FY Connector module
- master
Backend Live
- master
Backup Extended module
- master
Bitbucket module
- master
Bot Protection module
- master
Bynder Universal Compact View Integration Module
- 3.0
- 2.2
- 1.2
Campaign manager module
- 5.0
- 4.0
- 3.1
Canto DAM connector
- 2.0
- 1.0
CDN Helper module
- 3.0
- 1.0
CDP integration framework
- 3.0
- 1.1
Celum DAM Connector module
- 4.0
- 2.1
Cloudinary External DAM module
- 3.0
- 2.1
- 1.3
Commenting module
- 2.0
- 1.1
Configuration Injection module
- master
Content Diff module
- 2.0
- 1.0
Content Exporter module
- 3.0
- 2.0
- 1.0
Content Locking module
- 3.0
- 2.0
Content Recommender module
- 3.0
- 2.0
Content Translation Extended module
- 5.0
- 4.2
- 3.6
Content Type models
- master
Content Types module
- 2.0.0
Content Usage
- master
Custom CSS module
- master
Customer Journey Mapping module
- master
DAM App module
- 6.0
- 5.0
DAM Focal module
- 3.0
- 2.4
DAM JCR Fastly renderer module
- master
DAM module
- 6.0
- 5.0
- 4.0
Dotdigital Integration module
- master
DX Cloud
- master
DX Cloud Cockpit
- master
DX Core
- 6.4
- 6.3
- 6.2
Dynamic Form module
- 3.0
- 2.0
- 1.2
E-commerce Category Sync
- master
E-commerce module
- 3.0
- 2.0
- 1.3
Eight Eye Workflow module
- master
Elasticsearch provider module
- master
Extended Health Check module
- master
Freeze module
- 4.0
- 3.0
- 2.0
Frontify DAM connector
- 3.0
- 2.0
- 1.0
Fullstory Integration module
- master
Groovy shell scripts
- master
- 6.2.55
Home
- master
Hooks API module
- master
Hybrid Assets module
- master
Image Focal module
- 2.0
- 1.0
Image placement module
- master
Image Recognition module
- 4.0
- 3.0
- 2.0
Imaging module
- 5.0
- 4.1
Incubator Modules
- master
Instrumentation module
- 3.0
- 2.0
internal
- master
Javascript Models
- 4.0
- 3.0
- 2.0
JavaScript UI module
- 4.0
- 3.1
- 2.2
Language Availability module
- master
Link Mapper module
- master
Linkmapper Shared Database module
- master
Live Copy module
- 5.x
- 4.x
- 3.x
Magnolia 5 UI documentation
- master
Magnolia CLI
- 5.x
- 4.x
Magnolia Cloud
- master
Magnolia Search Index Feeder module
- master
Magnolia Support documentation
- master
Magnolia Vercel App
- master
MediaValet DAM connector
- 1.0
Microsoft DAM Connector module
- master
Migration Tool module
- master
Multi Assets Upload module
- master
Multisite module
- 3.0.0
Netlify Integration module
- master
Orchestrate module
- 1.0-SNAPSHOT
Package Manager module
- 2.0.2
- 1.0.0
Page-editor apps extension
- 3.0
- 2.0
Performance tuning guide
- 6.4
- 6.3
Periscope Control module
- master
Piano Analytics Connector module
- 2.0
- 1.0
Public User Registration Database module
- master
Publication Task Config
- master
REST module
- 4.0
- 3.1
REST Proxy module
- 3.0
- 2.0
- 1.0
RMQ Publication module
- master
Salesforce B2B Commerce connector
- master
Salesforce Commerce Cloud B2B connector API Reference
- master
SearchStax integration module
- master
SEO module
- master
Shop module
- master
Site module
- master
Siteimprove module
- 3.1
- 2.1
- 1.3
Six Eye Workflow module
- master
Slack Integration module
- master
SSO Login Extension module
- master
SSO module
- 5.0
- 4.0
- 3.1
- 2.0
Task Email Notifications module
- 6.4
- 6.2
Tasks cleaner module
- 3.0
- 1.0
Throttling Filter module
- master
Two Factor Authentication module
- 3.0
- 2.0
- 1.0
URI Mapping app
- 2.0
- 1.2
URL Translation Module
- 6.4
- 6.2
Veeva DAM Connector module
- 2.0
- 1.1
Version Cleaner module
- master
VWO AB Testing module
- 3.0
- 1.0
Webhooks module
- 3.0
- 2.0
- 1.0
WeChat Login module
- 1.0
Workflow Extended module
- master

Magnolia is "crash looping"

Symptom

A CustomerMagnoliaCrashLooping alert is firing. Kubernetes has restarted a Magnolia at least three times within 15 minutes.

CustomerMagnoliaCrashLooping alerts are sent to subscribers via email.

Kubernetes will restart a pod if it exceeds its memory limit. The Magnolia JVM typically cannot exceed its memory limit - the JVM max heap setting - but the JVM also will consume a small amount of non-heap memory (usually about 200MB) that can vary over time. Other containers running in the Magnolia pod may also consume memory but they usually use very small amounts (10s of MB). Temporary filesystems may use memory as well.

Observations

Here are the details on the alert:

Alert: CustomerMagnoliaCrashLooping

Expression

increase(kube_pod_container_status_restarts_total{container="magnolia-helm"}[15m]) > 3

Delay

2 minutes

Labels

team: customer

Annotations

source
summary
description
tenant
cluster_id
cluster_name
pod
instance

Check readiness and liveness probe config for Magnolia pod

The alert will note the affected Magnolia pod.

You can view probes configuration for the Magnolia pod in Rancher or with kubectl.

kubectl -n <namespace from alert> describe pod <Magnolia pod from alert>

Look for the "Liveness" and "Readiness" sections in the output:

    Liveness:       http-get http://:liveness-port/livez delay=240s timeout=10s period=10s #success=1 #failure=4
    Readiness:      http-get http://:liveness-port/readyz delay=2s timeout=1s period=2s #success=1 #failure=3

Check bootstrapper container log output for Magnolia pod

The liveness and readiness probes actually check the bootstrapper instead of Magnolia. The bootstrapper then checks Magnolia and returns a result depending on Magnolia’s state.

The bootstrapper’s log shows the results of liveness and readiness checks. You can view the bootstrapper log in the customer’s cockpit or in Grafana.

Be careful when adjusting the the readiness and liveness probes for the Magnolia pod: don’t set very long delays or failure thresholds until you have verified that Magnolia really needs to have more time when starting up.

Solutions

This section provides solutions that should help resolve the issue in most cases.

Stop failing readiness check

Magnolia may take longer to start up and pass its readiness probe for a variety of reasons (Lucene indexing, large JCR repository, lots of module startup tasks).

You can allow Magnolia more time in starting up by:

increasing the failureThreshold Helm chart value for readiness to increase the number failed readiness checks tolerated
increasing the initialDelaySeconds Helm chart value for readiness to increase the time before readiness is checked
increasing the periodSeconds Helm chart value for readiness to increase the time between readiness checks

Stop failing liveness check

Magnolia may take longer to start up and pass its liveness probe for a variety of reasons (Lucene indexing, large JCR repository, lots of module startup tasks).

You can allow Magnolia more time in starting up by:

increasing the failureThreshold Helm chart value for liveness to increase the number failed readiness checks tolerated
increasing the initialDelaySeconds Helm chart value for liveness to increase the time before readiness is checked
increasing the periodSeconds Helm chart value for liveness to increase the time between readiness checks

Feedback

PaaS

Magnolia is "crash looping"

Symptom

Observations

Alert: CustomerMagnoliaCrashLooping

Check readiness and liveness probe config for Magnolia pod

Check bootstrapper container log output for Magnolia pod

Solutions

Stop failing readiness check

Stop failing liveness check

Location

Main doc sections