Struct SchbenchConfig

Source

pub struct SchbenchConfig {
    pub message_threads: usize,
    pub worker_threads: usize,
    pub cache_footprint_kib: usize,
    pub operations: usize,
    pub sleep_usec: u64,
    pub skip_locking: bool,
    pub requests_per_sec: usize,
    pub auto_rps: usize,
    pub split_percent: Option<usize>,
    pub pipe_transfer_bytes: usize,
}

Expand description

User-facing config for the Schbench workload. Declarative config for the Schbench workload. Construct via SchbenchConfig::default (schbench’s own defaults) plus the chainable setters, e.g. SchbenchConfig::default().message_threads(2).worker_threads(4). Derives Clone/Debug/PartialEq/Eq/Hash/serde; the builder shape follows WorkloadConfig, but Eq+Hash (which WorkloadConfig and WorkSpec omit because of their transitive f64) are available here since every field is integer/bool – the ktstr f64-free leaf-config convention.

§schbench(8) CLI parity

This port re-expresses schbench’s default (matrix-work) mode natively, so its tunables are config fields and topology rather than CLI flags. The mapping to schbench’s option table (schbench.c:138-187):

schbench flag	ktstr
`-m` message-threads	`message_threads`
`-t` threads	`worker_threads` (workers per message thread; `0` = `ceil(cpuset_cpus / message_threads)`, see below)
`-F` cache_footprint	`cache_footprint_kib`
`-n` operations	`operations`
`-s` sleep_usec	`sleep_usec`
`-L` no-locking	`skip_locking`
`-R` rps	`requests_per_sec`
`-A` auto-rps	`auto_rps`
`--split` (long-only)	`split_percent` (`None` = no split, all-private)
`-p` pipe (also `--pipe`)	`pipe_transfer_bytes` (`0` = off; memory-transfer mode, no matrix work)
`-r` runtime	in-VM: the scenario engine’s run window (the engine runs until `stop`); host-side: the `run_secs` argument to `run_standalone`

§Set by ktstr topology, not a flag

-t default: with worker_threads = 0, ktstr matches schbench’s -t 0-default – it divides the CPU count across the message threads, ceil(cpus / message_threads) per thread (schbench.c:1849-1852), so the total worker count stays near the CPU count. ktstr scopes “cpus” to the allocated guest cpuset (the worker’s sched_getaffinity mask, set by the scenario’s topology / CgroupDef) rather than schbench’s get_nprocs, so the total is ≈ the cpuset’s CPU count. An explicit non-zero worker_threads is workers-per-message-thread in both.
-M (message-cpus) / -W (worker-cpus) thread pinning: ktstr places threads through its affinity / cpuset layer, so there is deliberately no per-thread-pin knob.

§Observability flags -> the metric API

schbench’s -w (warmuptime), -i (intervaltime), -z (zerotime), -j (json), and -J (jobname) shape its streaming stderr/JSON report. ktstr’s numbers flow through the metric API instead – per-phase attribution and the sidecar – so these have no flag equivalent. ktstr-schbench-validate reproduces schbench’s stderr-table shape for a side-by-side comparison.

§Split mode (`--split`)

Some(p) partitions cache_footprint_kib into a per-thread private matrix (p%) and ONE process-global shared matrix (100-p%) that every worker multiplies into concurrently, reproducing schbench’s cross-core shared-working-set cache contention (schbench.c:1390-1404, :1858-1863). ktstr models the shared matrix with AtomicU64 Relaxed accesses. Like schbench’s emitted code, the shared kernel keeps the running sum in a register and STORES it to each shared C cell on every inner (k) iteration – C is write-only in the loop (A and B are loaded each k, C is never reloaded), and that per-k store is what generates the contention. Both gcc and clang keep the per-k store: do_some_math reads m1/m2/m3 as offsets into one base pointer, so neither can prove the m3 store doesn’t alias the next k’s m1/m2 loads. On x86-64 a Relaxed load/store lowers to a plain MOV (no LOCK), so the contention is identical to schbench’s plain shared-memory race – but sound (atomics, no data race), with zero unsafe. None (default) is the legacy all-private single matrix.

§Pipe mode (`-p`)

pipe_transfer_bytes > 0 REPLACES the matrix workload with schbench’s memory-transfer simulation (schbench.c:177, pipe_test). It rides the message-handshake path: the message thread memsets each woken worker’s per-thread page to 1 (schbench.c:980-981) and the worker memsets its own page to 2 before blocking (schbench.c:1003-1004), pipe_transfer_bytes bytes each per handshake cycle (clamped to 1 MiB, PIPE_TRANSFER_BUFFER). do_work and the think-sleep are skipped (schbench.c:1448), so the only per-cycle work is the wakeup handshake + the two memsets; the report is the PER-WORKER memory-transfer throughput (avg worker transfer = the aggregate rate divided by the worker count, schbench.c:1697,1942-1943,1979) alongside the wakeup-latency table, not request latency.

ktstr does NOT compose -p with -R: in pipe mode it always runs the message-handshake waker (so BOTH pipe memsets fire — a full transfer) and never starts the RPS injector. schbench instead COMPOSES them, half-broken: it has no precedence (-R alone picks the waker, schbench.c:1594), so -p -R runs the RPS injector while the worker-side memset still fires unconditionally (schbench.c:1003-1004) but the waker-side memset — which lives only in xlist_wake_all (schbench.c:980-981) — does not, yielding a degenerate half-pipe. ktstr’s full pipe is the more faithful -p behavior; the realistic use is -p without -R. schbench also zeroes warmuptime in pipe mode (schbench.c:296); ktstr has no warmuptime concept, so that is a no-op here.

§Modes not ported

-C (calibrate): a tuning aid that times schbench’s own work loop and forces -L (schbench.c:166, :389). Intentionally out of scope – ktstr measures through the metric path.

Fields§

§message_threads: usize

Number of message threads (schbench.c -m, default 1).

§worker_threads: usize

Worker threads per message thread (schbench.c -t). 0 resolves to ceil(cpuset_cpus / message_threads) – the CPU count of the allocated guest cpuset (the worker’s sched_getaffinity mask, per ruling) divided across the message threads, matching schbench’s 0-default (schbench.c:1849-1852) scoped to the cpuset rather than get_nprocs. See resolve_worker_count and the CLI-parity section above.

§cache_footprint_kib: usize

Per-worker matrix cache footprint in KiB (schbench.c -F, default 256); sets the matrix dimension.

§operations: usize

Matrix multiplications per work cycle (schbench.c -n, default 5).

§sleep_usec: u64

Think-time sleep before the matrix work, microseconds (schbench.c -s, default 100); simulates networking. 0 disables.

§skip_locking: bool

Skip the per-CPU lock around the matrix work (schbench.c -L, default false: locking on).

§requests_per_sec: usize

Fixed request rate, requests/second (schbench.c -R, default 0 = off). 0 selects the default message-handshake mode (each worker is woken by its message thread); non-zero switches to the RPS-injector mode, where a dedicated thread enqueues requests_per_sec requests/second round-robin across the workers (schbench.c run_rps_thread, :1258).

§auto_rps: usize

Auto-RPS target CPU-busy percentage (schbench.c -A, default 0 = off). Non-zero turns on closed-loop rate control: a once-per-second control thread grows/shrinks the live request rate toward this host-busy% target (schbench.c auto_scale_rps, :1180). Setting it seeds the rate to 10 when requests_per_sec is 0 (schbench.c:286), so auto-RPS starts low and climbs.

§split_percent: Option<usize>

Percent of the cache footprint that is PRIVATE per worker thread (schbench.c --split, long-only, 0-100). None = no split: schbench’s legacy all-private single matrix (schbench.c:1405-1408, :1879-1880). Some(p) partitions cache_footprint_kib into a per-thread private matrix (p%) and ONE process-global shared matrix (100-p%) that every worker multiplies into concurrently, reproducing schbench’s cross-core shared-working-set cache contention (schbench.c:1390-1404, :1858-1863). Some(0) = all shared, Some(100) = all private (same matrix sizes as None, but routed through the split branch, matching schbench’s split_specified path). Out-of-range Some(p > 100) panics when the engine consumes it (schbench.c:362-365 exits on the same); the builder also debug-asserts the bound.

§pipe_transfer_bytes: usize

Pipe-mode transfer size in bytes (schbench.c -p/--pipe, default 0 = off, clamped to 1 MiB PIPE_TRANSFER_BUFFER). Non-zero REPLACES the matrix workload with schbench’s memory-transfer simulation: the message thread memsets each woken worker’s per-thread page to 1 and the worker memsets its own page to 2 (schbench.c:980-981/:1003-1004), pipe_transfer_bytes bytes each per cycle, while do_work + the think-sleep are skipped (schbench.c:1448). Reports PER-WORKER memory-transfer throughput (avg worker transfer) rather than request latency.

Struct SchbenchConfig Copy item path

§schbench(8) CLI parity

§Set by ktstr topology, not a flag

§Observability flags -> the metric API

§Split mode (--split)

§Pipe mode (-p)

§Modes not ported

Fields§

Implementations§

impl SchbenchConfig

pub fn message_threads(self, n: usize) -> Self

pub fn worker_threads(self, n: usize) -> Self

pub fn cache_footprint_kib(self, kib: usize) -> Self

pub fn operations(self, n: usize) -> Self

pub fn sleep_usec(self, usec: u64) -> Self

pub fn skip_locking(self, skip: bool) -> Self

pub fn requests_per_sec(self, rps: usize) -> Self

pub fn auto_rps(self, target_pct: usize) -> Self

pub fn split_percent(self, percent: Option<usize>) -> Self

pub fn pipe_transfer_bytes(self, bytes: usize) -> Self

Trait Implementations§

impl Clone for SchbenchConfig

fn clone(&self) -> SchbenchConfig

fn clone_from(&mut self, source: &Self)

impl Debug for SchbenchConfig

fn fmt(&self, f: &mut Formatter<'_>) -> Result

impl Default for SchbenchConfig

fn default() -> Self

impl<'de> Deserialize<'de> for SchbenchConfig

fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where __D: Deserializer<'de>,

impl Hash for SchbenchConfig

fn hash<__H: Hasher>(&self, state: &mut __H)

fn hash_slice<H>(data: &[Self], state: &mut H)where H: Hasher, Self: Sized,

impl PartialEq for SchbenchConfig

fn eq(&self, other: &SchbenchConfig) -> bool

fn ne(&self, other: &Rhs) -> bool

impl Serialize for SchbenchConfig

fn serialize<__S>(&self, __serializer: __S) -> Result<__S::Ok, __S::Error>where __S: Serializer,

impl Eq for SchbenchConfig

impl StructuralPartialEq for SchbenchConfig

Auto Trait Implementations§

impl Freeze for SchbenchConfig

impl RefUnwindSafe for SchbenchConfig

impl Send for SchbenchConfig

impl Sync for SchbenchConfig

impl Unpin for SchbenchConfig

impl UnwindSafe for SchbenchConfig

Blanket Implementations§

impl<T> Any for Twhere T: 'static + ?Sized,

fn type_id(&self) -> TypeId

impl<T> Borrow<T> for Twhere T: ?Sized,

fn borrow(&self) -> &T

impl<T> BorrowMut<T> for Twhere T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

impl<T> CloneToUninit for Twhere T: Clone,

unsafe fn clone_to_uninit(&self, dest: *mut u8)

impl<Q, K> Equivalent<K> for Qwhere Q: Eq + ?Sized, K: Borrow<Q> + ?Sized,

fn equivalent(&self, key: &K) -> bool

impl<Q, K> Equivalent<K> for Qwhere Q: Eq + ?Sized, K: Borrow<Q> + ?Sized,

fn equivalent(&self, key: &K) -> bool

impl<Q, K> Equivalent<K> for Qwhere Q: Eq + ?Sized, K: Borrow<Q> + ?Sized,

fn equivalent(&self, key: &K) -> bool

impl<Q, K> Equivalent<K> for Qwhere Q: Eq + ?Sized, K: Borrow<Q> + ?Sized,

fn equivalent(&self, key: &K) -> bool

impl<T> From<T> for T

fn from(t: T) -> T

impl<T> Instrument for T

fn instrument(self, span: Span) -> Instrumented<Self>

fn in_current_span(self) -> Instrumented<Self>

impl<T, U> Into<U> for Twhere U: From<T>,

fn into(self) -> U

impl<T> IntoEither for T

fn into_either(self, into_left: bool) -> Either<Self, Self>

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>where F: FnOnce(&Self) -> bool,

impl<T> Pointable for T

const ALIGN: usize

type Init = T

unsafe fn init(init: <T as Pointable>::Init) -> usize

unsafe fn deref<'a>(ptr: usize) -> &'a T

unsafe fn deref_mut<'a>(ptr: usize) -> &'a mut T

Struct SchbenchConfig

§Split mode (`--split`)

§Pipe mode (`-p`)

fn deserialize<D>(deserializer: D) -> Result<Self, D::Error>
where __D: Deserializer<'de>,

fn hash<H: Hasher>(&self, state: &mut H)

fn hash_slice<H>(data: &[Self], state: &mut H)
where H: Hasher, Self: Sized,

fn serialize<S>(&self, serializer: S) -> Result<S::Ok, S::Error>
where S: Serializer,

impl<T> Any for T
where T: 'static + ?Sized,

impl<T> Borrow<T> for T
where T: ?Sized,

impl<T> BorrowMut<T> for T
where T: ?Sized,

impl<T> CloneToUninit for T
where T: Clone,

impl<Q, K> Equivalent<K> for Q
where Q: Eq + ?Sized, K: Borrow<Q> + ?Sized,

impl<Q, K> Equivalent<K> for Q
where Q: Eq + ?Sized, K: Borrow<Q> + ?Sized,

impl<Q, K> Equivalent<K> for Q
where Q: Eq + ?Sized, K: Borrow<Q> + ?Sized,

impl<Q, K> Equivalent<K> for Q
where Q: Eq + ?Sized, K: Borrow<Q> + ?Sized,

impl<T, U> Into<U> for T
where U: From<T>,

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

impl<T> PolicyExt for T
where T: ?Sized,

fn and<P, B, E>(self, other: P) -> And<T, P>
where T: Policy<B, E>, P: Policy<B, E>,

fn or<P, B, E>(self, other: P) -> Or<T, P>
where T: Policy<B, E>, P: Policy<B, E>,

impl<T> ToOwned for T
where T: Clone,

impl<T, U> TryFrom<U> for T
where U: Into<T>,

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,

impl<T> DeserializeOwned for T
where T: for<'de> Deserialize<'de>,

impl<T> MaybeSend for T
where T: Send,

impl<T> MaybeSend for T
where T: Send,