Atomic Blocks and Durability Controls (MoonBit)
Overview
Golem provides automatic durable execution — all agents are durable by default. These APIs are advanced controls that most agents will never need. Only use them when you have specific requirements around persistence granularity, idempotency, or atomicity.
The high-level APIs are in the @api package (golemcloud/golem_sdk/api). Import it with an alias like @api. The types PersistenceLevel and RetryPolicy are re-exported from this package.
Atomic Operations
Group external, observable side effects (HTTP calls, calls to other agents, file/network I/O) so that on a crash the whole group is replayed together. If the agent fails partway through the block, recovery will re-execute the entire block from the start instead of resuming from the middle — so any external effects performed before the crash will be performed again.
What this is NOT.
with_atomic_operationis not an STM/transaction primitive and not for grouping in-memory state mutations. Golem agents are single-threaded, and in-memory state is automatically rebuilt by oplog replay on recovery, so wrapping plain in-memory updates in an atomic block does nothing useful. The terminology overlaps with Haskell STM, database transactions, andsynchronizedblocks, but the semantics are different: this is purely about how durable, externally-observable effects are re-executed across a crash boundary.It is also NOT how you reduce oplog size or speed up recovery. Despite the description’s mention of “persistence” and oplog-related controls,
with_atomic_operation/persistence-level/idempotency-mode APIs do not shrink the oplog or skip replay. If your concern is that the oplog is growing too large or recovery/replay is becoming slow (long-running agents, heartbeats, polling, recurring tasks), use snapshot-based recovery instead — seegolem-custom-snapshot-moonbit. You cannot opt out of oplog writes for a durable agent.Use it only when you have two or more external side effects that must not be left in a “first one happened, second one didn’t” state across a recovery.
Good use case — two external calls that must replay together. Use with_atomic_operation for automatic lifecycle management:
// Reserve inventory and charge the customer — if we crash between them,
// we want recovery to re-run BOTH calls, not skip the reservation.
fn place_order(item_id : String, qty : Int, customer : String, price : Double) -> Order {
@api.with_atomic_operation(fn() {
let reservation = inventory_api.reserve(item_id, qty)
let charge = payment_api.charge(customer, price)
Order::{ reservation, charge }
})
}For manual control, use mark_begin_operation / mark_end_operation:
fn place_order(item_id : String, qty : Int, customer : String, price : Double) -> Order {
let begin = @api.mark_begin_operation()
let reservation = inventory_api.reserve(item_id, qty)
let charge = payment_api.charge(customer, price)
@api.mark_end_operation(begin)
Order::{ reservation, charge }
}Bad use case — pure in-memory updates that already replay deterministically:
// DON'T do this. Wrapping in-memory mutations adds nothing — the oplog
// already rebuilds `state.balance` and `state.last_tx` deterministically.
@api.with_atomic_operation(fn() {
state.balance = state.balance - amount
state.last_tx = now
})Persistence Level Control
Adjust how the oplog is interpreted for a section of code. Setting the level to PersistNothing does not disable oplog recording — entries are still written, but they are treated only as an observable log and are not used for replay. On recovery, the side effects are not re-executed and not replayed; if the block naively runs the same side effects during replay, recovery will fail.
This is not a knob for application code. Its primary use case is authoring Golem-specific libraries that implement their own custom durability on top of raw side effects. Code inside such a block must:
- Explicitly check whether the agent is in live or replay mode (via
current_durable_execution_state().is_live). - Skip the raw side effects during replay.
- Use the durability APIs (
begin_durable_function/end_durable_function,persist_durable_function_invocation,read_persisted_durable_function_invocation) to record/recover state in a custom way.
Use with_persistence_level for automatic save/restore:
fn do_fast_work() -> Unit {
@api.with_persistence_level(@api.PersistenceLevel::PersistNothing, fn() {
// Oplog entries here are observable only, never used for replay.
// The block MUST check live vs replay mode and use custom durability
// primitives — naively running side effects will break recovery.
do_idempotent_work()
})
}For manual control, use get_oplog_persistence_level / set_oplog_persistence_level:
let original = @api.get_oplog_persistence_level()
@api.set_oplog_persistence_level(@api.PersistenceLevel::PersistNothing)
do_idempotent_work()
@api.set_oplog_persistence_level(original)PersistenceLevel Variants
| Variant | Behavior |
|---|---|
PersistNothing | Oplog entries are still written but only as an observable log; not used for replay. The block must implement custom durability — naive replay will fail |
PersistRemoteSideEffects | Only remote side effects are persisted for replay |
Smart | Default — Golem decides what to persist |
Idempotence Mode
Default:
true. Every outgoing HTTP request — includingPOST,PUT,PATCH, andDELETE— is treated as idempotent. This means status-code-keyed retry policies (seegolem-retry-policies-moonbit) already work out of the box forPOSTrequests. You do not need to wrap aPOSTin@api.with_idempotence_mode(true, ...)to make it retriable on a 5xx — that is the default.
Use @api.with_idempotence_mode(false, ...) only when you need to opt out for a specific
call. The flag controls how WriteRemote host functions are replayed when their previous
attempt’s outcome is unknown after a crash:
true(default): assume the previous attempt succeeded; do not re-invoke on replay. Combined with the host-side retry machinery, the request can be transparently re-sent when a matching retry policy fires.false: do not assume success; the worker traps so a higher-level retry decides what to do. Use this for non-idempotent side effects whose accidental duplication would be more harmful than missing the call entirely.
// Opt OUT of the default — at-most-once semantics for this non-idempotent call.
fn make_payment(amount : Double) -> String {
@api.with_idempotence_mode(false, fn() {
charge_payment(amount)
})
}For manual control:
@api.set_idempotence_mode(false)
let result = charge_payment(amount)
@api.set_idempotence_mode(true)Use @api.get_idempotence_mode() to read the current setting.
Oplog Commit
Wait until the oplog is replicated to a specified number of replicas before continuing:
// Ensure oplog is replicated to 3 replicas before proceeding
@api.oplog_commit(b'\x03')The argument is the desired replica count (Byte type).
Idempotency Key Generation
Generate a durable idempotency key that persists across agent restarts — safe for payment APIs and other exactly-once operations:
let key = @api.generate_idempotency_key()
// key is a @types.Uuid — use it with external APIs for exactly-once processing
// Or get it as a string directly:
let key_str = @api.generate_idempotency_key_string()Retry Policy
Override the default retry policy for a block of code. Use with_retry_policy for scoped control:
fn do_flaky_work() -> Unit {
@api.with_retry_policy(
@api.RetryPolicy::{
max_attempts: 5U,
min_delay: 100UL, // milliseconds
max_delay: 5000UL, // milliseconds
multiplier: 2.0,
max_jitter_factor: Some(0.1),
},
fn() { do_flaky_operation() },
)
}For manual control, use get_retry_policy / set_retry_policy:
let original = @api.get_retry_policy()
@api.set_retry_policy(@api.RetryPolicy::{
max_attempts: 5U,
min_delay: 100UL,
max_delay: 5000UL,
multiplier: 2.0,
max_jitter_factor: Some(0.1),
})
do_flaky_operation()
@api.set_retry_policy(original)Import Path
Add the api package to your moon.pkg imports:
import {
"golemcloud/golem_sdk/api" @api,
}All durability, persistence, idempotency, retry, and oplog APIs are available from @api.
Low-Level Durability API
For advanced use cases (e.g., manually persisting function invocations for replay), the raw durability APIs are available in golemcloud/golem_sdk/interface/golem/durability/durability:
begin_durable_function(function_type)/end_durable_function(function_type, begin_index, forced_commit)current_durable_execution_state()— returnsDurableExecutionState { is_live, persistence_level }persist_durable_function_invocation(function_name, request, response, function_type)read_persisted_durable_function_invocation()
These are rarely needed — prefer the @api wrappers above.