Disable logging in `ClusterFormationFailureHelper` on shutdown. #125244

JeremyDahlgren · 2025-03-19T18:04:15Z

Modifies Coordinator to enable logging in ClusterFormationFailureHelper when started and disables logging in ClusterFormationFailureHelper when stopped. The warning scheduler handling and invariant check in the Coordinator are left as is, with the logging boolean set independently, eliminating the need to hold the mutex in doStop() when Coordinator.stop() is called when the Node is shutdown.

Closes #105559.

Modifies Coordinator to enable logging in ClusterFormationFailureHelper when started and disables logging in ClusterFormationFailureHelper when stopped. The warning scheduler handling and invariant check in the Coordinator are left as is, with the logging boolean set independently, eliminating the need to hold the mutex in doStop() when Coordinator.stop() is called when the Node is shutdown. Closes elastic#105559.

JeremyDahlgren · 2025-03-19T18:09:21Z

server/src/main/java/org/elasticsearch/cluster/coordination/ClusterFormationFailureHelper.java

+    public void setLoggingEnabled(boolean enabled) {
+        this.loggingEnabled = enabled;


Initially it seemed that a

synchronized(mutex) { clusterFormationFailureHelper.stop(); }

block in Coordinator::doStop() would be sufficient, but besides needing to hold the lock when the Node is being shut down, it seems a race condition could exist where the failure helper is stopped on shutdown but another thread immediately calls becomeCandidate() and starts it back up?

Also if calls to ClusterFormationFailureHelper start() and stop() were not synchronized it looks like there is the potential for a NullPointerException in start() if the warningScheduler is set to null between the assignment of a new WarningScheduler instance and the next line that calls scheduleNextWarning (Intellij warns about this):

public void start() { assert warningScheduler == null; warningScheduler = new WarningScheduler(); warningScheduler.scheduleNextWarning(); } public void stop() { warningScheduler = null; }

elasticsearchmachine · 2025-03-19T18:10:54Z

Hi @JeremyDahlgren, I've created a changelog YAML for you.

elasticsearchmachine · 2025-03-19T18:26:46Z

Pinging @elastic/es-distributed-coordination (Team:Distributed Coordination)

DaveCTurner

Neat solution, LGTM

…c#125244) Modifies Coordinator to enable logging in ClusterFormationFailureHelper when started and disables logging in ClusterFormationFailureHelper when stopped. The warning scheduler handling and invariant check in the Coordinator are left as is, with the logging boolean set independently, eliminating the need to hold the mutex in doStop() when Coordinator.stop() is called when the Node is shutdown. Closes elastic#105559. * Update docs/changelog/125244.yaml

elasticsearchmachine added the v9.1.0 label Mar 19, 2025

JeremyDahlgren commented Mar 19, 2025

View reviewed changes

JeremyDahlgren added Team:Distributed Coordination Meta label for Distributed Coordination team >bug :Distributed Coordination/Cluster Coordination Cluster formation and cluster state publication, including cluster membership and fault detection. labels Mar 19, 2025

Update docs/changelog/125244.yaml

ba1af37

JeremyDahlgren requested review from DaveCTurner and DiannaHohensee March 19, 2025 18:17

JeremyDahlgren marked this pull request as ready for review March 19, 2025 18:26

DaveCTurner approved these changes Mar 20, 2025

View reviewed changes

JeremyDahlgren merged commit a3dc918 into elastic:main Mar 20, 2025
17 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Disable logging in `ClusterFormationFailureHelper` on shutdown. #125244

Disable logging in `ClusterFormationFailureHelper` on shutdown. #125244

Uh oh!

JeremyDahlgren commented Mar 19, 2025 •

edited

Loading

Uh oh!

JeremyDahlgren Mar 19, 2025 •

edited

Loading

Uh oh!

elasticsearchmachine commented Mar 19, 2025

Uh oh!

elasticsearchmachine commented Mar 19, 2025

Uh oh!

DaveCTurner left a comment

Uh oh!

Uh oh!

Uh oh!

		public void setLoggingEnabled(boolean enabled) {
		this.loggingEnabled = enabled;

Disable logging in ClusterFormationFailureHelper on shutdown. #125244

Disable logging in ClusterFormationFailureHelper on shutdown. #125244

Uh oh!

Conversation

JeremyDahlgren commented Mar 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JeremyDahlgren Mar 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

elasticsearchmachine commented Mar 19, 2025

Uh oh!

elasticsearchmachine commented Mar 19, 2025

Uh oh!

DaveCTurner left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Disable logging in `ClusterFormationFailureHelper` on shutdown. #125244

Disable logging in `ClusterFormationFailureHelper` on shutdown. #125244

JeremyDahlgren commented Mar 19, 2025 •

edited

Loading

JeremyDahlgren Mar 19, 2025 •

edited

Loading