Actual sequences design as of 26-01-09

Historical context

Previous design (before April 2025)

The system previously used multiple separate sequences for different record types:

CRecordIDSequence - For CRecords (Collection Records)
- Started from: 322685000131072
- Purpose: Efficient CRecord caching on DBMS side (most CRecords in same partition)
- Enabled simple iteration over CRecords
OWRecordIDSequence - For ORecords/WRecords (O/W Records)
- Started from: 322680000131072
- Purpose: Sequential IDs for O/W Records
WLogOffsetSequence - For WLog offsets
- Started from: 1
PLogOffsetSequence - For PLog offsets
- Started from: 1

The problem

As documented on March 1, 2025, this design had critical issues:

Record ID sequences could overlap
Only 5,000,000,000 IDs available for OWRecords before colliding with CRecord IDs
OWRecord IDs started at 322680000131072
CRecord IDs started at 322685000131072
Gap between sequences: only 5 billion IDs

The change (April 29, 2025)

Commit de5532b17a255f1eaac84bf72187283502d69bda - PR #3620 addressing issue #3600:

"#3600 one sequence for all records"

The system was refactored to use a single unified sequence:

QNameRecordIDSequence - Single sequence for ALL record IDs
- Replaces both QNameCRecordIDSequence and QNameOWRecordIDSequence
- Starts from: FirstUserRecordID = 200001
- Much more human-readable IDs
- Simpler Command Processor implementation

Decision rationale:

Pros:

Clean for Voedger users
IDs are more human-readable (200001, 200002, ... vs 322685000131072, ...)
Simpler Command Processor
No collision risk

Cons:

CRecords are not cached as efficiently (accepted tradeoff)

Current sequences

Only 2 sequences remain:

QNameRecordIDSequence - All record IDs (starts from 200001)
QNameWLogOffsetSequence - WLog offsets (starts from 1)

Note: PLog offset is managed per partition in appPartition.nextPLogOffset, not as a separate sequence.

Architecture overview

The system uses in-memory per-workspace state with PLog-based recovery. There are three main components:

appPartition - One per partition, tracks PLog offset
workspace - One per WSID, tracks WLog offset and ID generator
IIDGenerator - Simple incrementing ID generator per workspace

Data structures

type appPartition struct {
	workspaces     map[istructs.WSID]*workspace
	nextPLogOffset istructs.Offset
}

type workspace struct {
	NextWLogOffset istructs.Offset
	idGenerator    istructs.IIDGenerator
}

ID generator implementation

type implIIDGenerator struct {
	nextRecordID istructs.RecordID
	onNewID      func(rawID, storageID istructs.RecordID) error
}

func (g *implIIDGenerator) NextID(rawID istructs.RecordID) (storageID istructs.RecordID, err error) {
	storageID = g.nextRecordID
	g.nextRecordID++
	// ...
}

Initialization flow

On first access to a workspace:

func (ap *appPartition) getWorkspace(wsid istructs.WSID) *workspace {
	ws, ok := ap.workspaces[wsid]
	if !ok {
		ws = &workspace{
			NextWLogOffset: istructs.FirstOffset,
			idGenerator:    istructsmem.NewIDGenerator(),
		}
		ap.workspaces[wsid] = ws
	}
	return ws
}

Initial values:

NextWLogOffset = istructs.FirstOffset (which is 1)
nextRecordID = istructs.FirstUserRecordID (which is 200001)

Recovery flow

On partition restart, the entire PLog is scanned:

cb := func(plogOffset istructs.Offset, event istructs.IPLogEvent) (err error) {
	ws := ap.getWorkspace(event.Workspace())

	for rec := range event.CUDs {
		if rec.IsNew() {
			ws.idGenerator.UpdateOnSync(rec.ID())
		}
	}
	// ... handle ODoc IDs ...
	ws.NextWLogOffset = event.WLogOffset() + 1
	ap.nextPLogOffset = plogOffset + 1
	// ...
}

err := cmd.appStructs.Events().ReadPLog(ctx, cmd.cmdMes.PartitionID(), 
	istructs.FirstOffset, istructs.ReadToTheEnd, cb)

The UpdateOnSync method updates the generator's next ID:

func (g *implIIDGenerator) UpdateOnSync(syncID istructs.RecordID) {
	if syncID >= g.nextRecordID {
		g.nextRecordID = syncID + 1
	}
	// ...
}

Command processing flow

1. Get workspace:

func (cmdProc *cmdProc) getWorkspace(_ context.Context, cmd *cmdWorkpiece) (err error) {
	cmd.workspace = cmd.appPartition.getWorkspace(cmd.cmdMes.WSID())
	return nil
}

2. Build raw event with current offsets:

func (cmdProc *cmdProc) getRawEventBuilder(_ context.Context, cmd *cmdWorkpiece) (err error) {
	grebp := istructs.GenericRawEventBuilderParams{
		HandlingPartition: cmd.cmdMes.PartitionID(),
		Workspace:         cmd.cmdMes.WSID(),
		QName:             cmd.cmdQName,
		RegisteredAt:      istructs.UnixMilli(cmdProc.time.Now().UnixMilli()),
		PLogOffset:        cmd.appPartition.nextPLogOffset,
		WLogOffset:        cmd.workspace.NextWLogOffset,
	}
	// ...
}

3. Generate IDs for new records:

func (cud *cudType) regenerateIDsPlan(generator istructs.IIDGenerator) (newIDs newIDsPlanType, err error) {
	plan := make(newIDsPlanType)
	for _, rec := range cud.creates {
		id := rec.ID()
		if !id.IsRaw() {
			generator.UpdateOnSync(id)
			continue
		}

		var storeID istructs.RecordID

		if singleton, ok := rec.typ.(appdef.ISingleton); ok && singleton.Singleton() {
			storeID, err = cud.appCfg.singletons.ID(rec.QName())
		} else {
			storeID, err = generator.NextID(id)  // <-- Simple increment
		}
		// ...
	}
	return plan, nil
}

4. Write to PLog and increment offset:

func (cmdProc *cmdProc) putPLog(_ context.Context, cmd *cmdWorkpiece) (err error) {
	if cmd.pLogEvent, err = cmd.appStructs.Events().PutPlog(cmd.rawEvent, nil, cmd.idGeneratorReporter); err != nil {
		cmd.appPartitionRestartScheduled = true
	} else {
		cmd.appPartition.nextPLogOffset++
	}
	return
}

5. Write to WLog and increment offset:

err = cmd.appStructs.Events().PutWlog(cmd.pLogEvent)
if err != nil {
	cmd.appPartitionRestartScheduled = true
} else {
	cmd.workspace.NextWLogOffset++
}

Key constants

const NullRecordID = RecordID(0)
const MinRawRecordID = RecordID(1)
const MaxRawRecordID = RecordID(0xffff)  // 65535

const MinReservedRecordID = MaxRawRecordID + 1  // 65536
const MaxReservedRecordID = RecordID(200000)

const FirstSingletonID = MinReservedRecordID  // 65536
const MaxSingletonID = FirstSingletonID + 0x1ff  // 66047

const FirstUserRecordID = MaxReservedRecordID + 1  // 200001

Summary

Current design characteristics:

Single sequence for all records (since April 2025)
- Replaced separate CRecordIDSequence and OWRecordIDSequence
- Starts from 200001 (human-readable)
- No collision risk
All workspace state is in memory (map of workspaces per partition)
Simple increment for ID generation (no complex sequencer logic)
Full PLog scan on recovery to rebuild all workspace states
No persistent sequence storage - everything derived from PLog
No batching or caching - direct increment per request
Memory grows with number of active workspaces per partition
Recovery time grows with PLog size

Historical note:

Before April 2025, the system used separate sequences for CRecords (starting at 322685000131072) and OWRecords (starting at 322680000131072), which created a collision risk with only 5 billion IDs available for OWRecords. The unified sequence design eliminated this problem while simplifying the implementation.

Future evolution:

This is fundamentally different from the sequencer design documented in sequences.md, which was intended to solve scalability issues (memory growth, recovery time) but has not been integrated. The current simple design prioritizes correctness and simplicity over performance optimization.

Historical context​

Previous design (before April 2025)​

The problem​

The change (April 29, 2025)​

Current sequences​

Architecture overview​

Data structures​

ID generator implementation​

Initialization flow​

Recovery flow​

Command processing flow​

Key constants​

Summary​