AluWidth

Enum AluWidth 

Source
pub enum AluWidth {
    Scalar,
    Vec128,
    Vec256,
    Vec512,
    Amx,
    Widest,
}
Expand description

ALU/SIMD execution width for WorkType::AluHot.

Selects the widest data-path the worker exercises per multiply chain. Today every variant executes the same scalar four-stream multiply chain — the width selector is preserved on the wire so a downstream classifier can distinguish runs that requested SIMD from runs that requested scalar even though the dispatch is uniform. Wider variants WILL drive more functional-unit pressure and (for AVX-512 / AMX) draw the package into a frequency-throttled mode the kernel scheduler must observe once SIMD intrinsics land per-arm. The serde wire form is snake_case ("scalar", "vec128", "vec256", "vec512", "amx", "widest").

§Current behaviour

All widths run the same four-stream scalar multiply path; the width selector is preserved on the wire (the WorkType::AluHot / WorkPhase::AluHot config carries width) so a downstream classifier can distinguish runs that requested SIMD from runs that requested scalar even though the dispatch is uniform.

§Default semantics

Scalar is the type-level Rust default (the #[derive(Default)] fallback that serde uses when an AluWidth field is missing on the wire — keeps backward- compat for older capture data). Widest is the workload-level default the super::defaults::ALU_HOT_WIDTH constant resolves at runtime via resolve_alu_width: tests that take WorkType::from_name("AluHot") get the host’s widest available data-path, not the type-level scalar fallback. The asymmetry is deliberate — type-level Default favours “always available everywhere”; workload-level default favours “stress the host as hard as it can run.”

§Resolution rules

Widest is a runtime-resolved sentinel: at worker entry the dispatch arm probes the host CPU via std::is_x86_feature_detected! (x86_64) and picks the widest available variant in the order Amx > Vec512 > Vec256 > Vec128 > Scalar. On aarch64 only Scalar and Vec128 (NEON) are available; Vec256 / Vec512 / Amx are absent and Widest resolves to NEON when present, falling back to Scalar. A configured value that the host cannot run is downgraded to the next-widest available variant with a one-shot tracing::warn! so the test still produces useful telemetry rather than hard-failing — silent downgrade without the warn would mask the host capability gap.

§Frequency throttle on x86_64

On Intel client / server SKUs the AVX-512 license raises the per-core voltage and lowers the all-core turbo for the package; running Vec512 workers under one scheduler while other workers run under another biases the comparison because the throttle is package-wide, not per-task. Tests that A/B-compare schedulers under Vec512 or Amx need the runs serialized on the same package — the framework does not currently coordinate this serialization across worker groups.

Variants§

§

Scalar

64-bit scalar integer multiply chain. Drives the integer pipeline only; no SIMD or AVX licensing involved. Available on every supported architecture.

§

Vec128

128-bit vector integer multiply chain (SSE2 on x86_64, NEON on aarch64). The widest baseline both architectures support; a reasonable default when the test cares about “vectorized ALU” without architecture-specific tuning.

§

Vec256

256-bit vector integer multiply chain (AVX2 on x86_64). Not available on aarch64 — falls back to Vec128 (NEON) at worker entry with a one-shot warn.

§

Vec512

512-bit vector integer multiply chain (AVX-512F on x86_64). Triggers the package-wide frequency throttle described above. Not available on aarch64 — falls back to Vec128 (NEON) at worker entry.

§

Amx

AMX tile multiply chain (x86_64 server SKUs with AMX-INT8 or AMX-BF16). The widest data-path on x86_64; uses XFD gating in the kernel (the first AMX instruction raises a #NM trap that arch/x86/kernel/traps.c::handle_xfd_event handles, calling arch/x86/kernel/fpu/xstate.c::__xfd_enable_feature to allocate the dynamic XSAVE area) so the kernel allocates the dynamic XSAVE area lazily — adds a one-time per-task latency spike on first use.

AMX additionally requires prctl(ARCH_REQ_XCOMP_PERM, XFEATURE_XTILE_DATA) per process before the first AMX instruction; the framework does NOT issue this prctl, so AMX is not yet runnable. resolve_alu_width therefore downgrades AluWidth::Amx to the host’s widest stable-detectable variant; AMX is not currently runnable end-to-end on this framework.

Not available on aarch64 — falls back to Vec128.

§

Widest

Resolve to the widest variant the host supports at worker entry. See the type-level doc for the resolution order. Useful as a default when the test author wants “as much ALU pressure as the host can sustain” without hardcoding an architecture or feature level.

Trait Implementations§

Source§

impl Clone for AluWidth

Source§

fn clone(&self) -> AluWidth

Returns a duplicate of the value. Read more
1.0.0 · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more
Source§

impl Debug for AluWidth

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
Source§

impl Default for AluWidth

Source§

fn default() -> AluWidth

Returns the “default value” for a type. Read more
Source§

impl<'de> Deserialize<'de> for AluWidth

Source§

fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>
where __D: Deserializer<'de>,

Deserialize this value from the given Serde deserializer. Read more
Source§

impl Hash for AluWidth

Source§

fn hash<__H: Hasher>(&self, state: &mut __H)

Feeds this value into the given Hasher. Read more
1.3.0 · Source§

fn hash_slice<H>(data: &[Self], state: &mut H)
where H: Hasher, Self: Sized,

Feeds a slice of this type into the given Hasher. Read more
Source§

impl PartialEq for AluWidth

Source§

fn eq(&self, other: &AluWidth) -> bool

Tests for self and other values to be equal, and is used by ==.
1.0.0 · Source§

fn ne(&self, other: &Rhs) -> bool

Tests for !=. The default implementation is almost always sufficient, and should not be overridden without very good reason.
Source§

impl Serialize for AluWidth

Source§

fn serialize<__S>(&self, __serializer: __S) -> Result<__S::Ok, __S::Error>
where __S: Serializer,

Serialize this value into the given Serde serializer. Read more
Source§

impl Copy for AluWidth

Source§

impl Eq for AluWidth

Source§

impl StructuralPartialEq for AluWidth

Auto Trait Implementations§

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> CloneToUninit for T
where T: Clone,

Source§

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)
Performs copy-assignment from self to dest. Read more
§

impl<Q, K> Equivalent<K> for Q
where Q: Eq + ?Sized, K: Borrow<Q> + ?Sized,

§

fn equivalent(&self, key: &K) -> bool

Checks if this value is equivalent to the given key. Read more
§

impl<Q, K> Equivalent<K> for Q
where Q: Eq + ?Sized, K: Borrow<Q> + ?Sized,

§

fn equivalent(&self, key: &K) -> bool

Compare self to key and return true if they are equal.
§

impl<Q, K> Equivalent<K> for Q
where Q: Eq + ?Sized, K: Borrow<Q> + ?Sized,

§

fn equivalent(&self, key: &K) -> bool

Checks if this value is equivalent to the given key. Read more
§

impl<Q, K> Equivalent<K> for Q
where Q: Eq + ?Sized, K: Borrow<Q> + ?Sized,

§

fn equivalent(&self, key: &K) -> bool

Checks if this value is equivalent to the given key. Read more
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

§

impl<T> Instrument for T

§

fn instrument(self, span: Span) -> Instrumented<Self>

Instruments this type with the provided [Span], returning an Instrumented wrapper. Read more
§

fn in_current_span(self) -> Instrumented<Self>

Instruments this type with the current Span, returning an Instrumented wrapper. Read more
Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T> IntoEither for T

Source§

fn into_either(self, into_left: bool) -> Either<Self, Self>

Converts self into a Left variant of Either<Self, Self> if into_left is true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
Source§

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

Converts self into a Left variant of Either<Self, Self> if into_left(&self) returns true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
§

impl<T> Pointable for T

§

const ALIGN: usize

The alignment of pointer.
§

type Init = T

The type for initializers.
§

unsafe fn init(init: <T as Pointable>::Init) -> usize

Initializes a with the given initializer. Read more
§

unsafe fn deref<'a>(ptr: usize) -> &'a T

Dereferences the given pointer. Read more
§

unsafe fn deref_mut<'a>(ptr: usize) -> &'a mut T

Mutably dereferences the given pointer. Read more
§

unsafe fn drop(ptr: usize)

Drops the object pointed to by the given pointer. Read more
§

impl<T> PolicyExt for T
where T: ?Sized,

§

fn and<P, B, E>(self, other: P) -> And<T, P>
where T: Policy<B, E>, P: Policy<B, E>,

Create a new Policy that returns [Action::Follow] only if self and other return Action::Follow. Read more
§

fn or<P, B, E>(self, other: P) -> Or<T, P>
where T: Policy<B, E>, P: Policy<B, E>,

Create a new Policy that returns [Action::Follow] if either self or other returns Action::Follow. Read more
Source§

impl<T> Same for T

Source§

type Output = T

Should always be Self
Source§

impl<T> ToOwned for T
where T: Clone,

Source§

type Owned = T

The resulting type after obtaining ownership.
Source§

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more
Source§

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.
§

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

§

fn vzip(self) -> V

§

impl<T> WithSubscriber for T

§

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,

Attaches the provided Subscriber to this type, returning a [WithDispatch] wrapper. Read more
§

fn with_current_subscriber(self) -> WithDispatch<Self>

Attaches the current default Subscriber to this type, returning a [WithDispatch] wrapper. Read more
Source§

impl<T> DeserializeOwned for T
where T: for<'de> Deserialize<'de>,

§

impl<T> MaybeSend for T
where T: Send,

§

impl<T> MaybeSend for T
where T: Send,