These were technical questions raised and issues found during a recent OI 22.1>23.1 upgrade.. Emphasis is to help other customers facing the same issues.
We're looking at the pre-requisite steps for upgrading 22.1 to 21.3 on-premise: OI (DX Platform)
Issue #1
We're confused about the step "Back Up the Elasticsearch Cluster Snapshot"
1) What does this step do? It looks like it reconfigures es-utils to backup ao_ indices that weren't being backed up via a snapshot. But when does this happen? Seems like we'd have to wait until 11 PM. I'm unclear how this helps us for our upgrade if we're going to upgrade immediately
Regarding Issue #1, you are right. The snapshot / backup steps are a precaution only. A file-system backup is generally not recommended for Elasticsearch.
2) One of the steps asks to query the snapshots from ES... when we did this, the most recent ES snapshot was Jan. 23rd. When looking at the es-utils pod log we see this repeatedly
tail jarvis-es-utils.logs:
INFO [main] UtilityController:165 - Kron service has not started yet. Will try again in 5 sec.
INFO [main] UtilityController:165 - Kron service has not started yet. Will try again in 5 sec.
INFO [main] UtilityController:165 - Kron service has not started yet. Will try again in 5 sec.
3) When looking at the jarvis-kron pod it "seems" to be running.
Bottom line, I'm not sure a) what we should be doing, b) why, and c) whether their environment is healthy - seems like no in this regard.
Finally, can't we get an ES backup by just backing up the ES directories on NFS via Linux rather than going through all of this?
i.e. this is only a precaution, right?
Issue #2
nfs-migration.sh - not sure what this is doing and why we need to make a whole copy of "some* of the data. Is this still needed? It also doesn't make sense to me, and they might not have enough disk space for it