Appendix I - Dataset Conformance Testing

Appendix I: Dataset Conformance Testing (Informative)

This appendix documents systematic conformance testing performed against a large-scale, multi-modal autonomous driving dataset. The results validated the completeness and expressiveness of the SpatialDDS 1.5 sensing and semantics profiles and directly informed several normative additions to this specification.

Motivation

Sensor-data specifications risk becoming disconnected from real-world workloads if they are designed in isolation. To guard against this, the SpatialDDS 1.5 extension profiles (Appendix D) were validated against a well-known public reference dataset -- the nuScenes autonomous driving corpus -- which exercises all five sensor modalities simultaneously: camera, lidar, radar, semantic annotations, and shared conventions (coordinate frames, quaternion order, timing).

The goal was not to certify a particular dataset but to answer a concrete question: Can every field, enum, and convention in the nuScenes schema be losslessly mapped to SpatialDDS 1.5 IDL without workarounds or out-of-band agreements?

Reference Dataset

nuScenes (Motional / nuTonomy) is a multimodal dataset containing:

Dimension	Value
Scenes	1,000 (20 s each)
Cameras	6 surround-view (FRONT, FRONT_LEFT, FRONT_RIGHT, BACK, BACK_LEFT, BACK_RIGHT)
Lidar	1 x 32-beam spinning (Velodyne HDL-32E), ~34 k points/scan
Radar	5 x Continental ARS 408 (FRONT, FRONT_LEFT, FRONT_RIGHT, BACK_LEFT, BACK_RIGHT)
3D annotations	1.4 M oriented bounding boxes, 23 object classes
Annotation metadata	visibility tokens, attribute tokens, per-box lidar/radar point counts
Coordinate convention	Right-handed; quaternions in (w, x, y, z) order

The dataset was chosen because it stresses sensor diversity (six camera rigs, five radar units, one lidar), per-detection radar fields rarely found in other corpora (compensated velocity, dynamic property, RCS), and rich annotation metadata (visibility, attributes, evidence counts).

Methodology

A conformance harness was constructed as a Python script that:

Mirrors the SpatialDDS 1.5 IDL as Python data structures (enum values, struct field lists, normative prose flags).
Mirrors the nuScenes schema as synthetic data (camera names, radar point fields, lidar layout, annotation fields).
Runs 27 targeted checks across five modalities, each producing a verdict:

Verdict	Meaning
PASS	nuScenes field maps losslessly to an existing SpatialDDS type or enum value.
GAP	A mapping exists but requires additional prose, an enum value, or a convention note.
MISSING	No SpatialDDS construct exists for the nuScenes field; a new IDL member is needed.

Reports a per-modality scorecard comparing current results against a baseline.

The harness does not require network access, a DDS runtime, or the actual nuScenes database. It operates as a static schema-vs-schema dry run, making it reproducible in any CI environment.

Checks Performed

Radar (6 checks)

ID	Check	Description
R-01	Detection-centric profile	`RadDetection` struct exists with per-detection xyz, velocity, RCS, dyn_prop.
R-02	Per-detection velocity	Cartesian `velocity_xyz` (preferred) + scalar `v_r_mps` (fallback), both with `has_*` guards.
R-03	Ego-compensated velocity	`velocity_comp_xyz` field for ego-motion-compensated velocity.
R-04	Dynamic property enum	`RadDynProp` covers all 7 nuScenes values (UNKNOWN through STOPPED).
R-05	Per-detection RCS	`rcs_dbm2` field in dBm^2 with `has_rcs_dbm2` guard.
R-06	Sensor type enum	`RadSensorType` differentiates SHORT_RANGE, LONG_RANGE, IMAGING_4D, etc.

Vision (5 checks)

ID	Check	Description
V-01	RigRole coverage	`RigRole` enum includes FRONT, FRONT_LEFT, FRONT_RIGHT, BACK, BACK_LEFT, BACK_RIGHT.
V-02	Pre-rectified images	Normative prose documents `dist = NONE` with `model = PINHOLE` semantics.
V-03	Image dimensions	`CamIntrinsics.width` / `height` are REQUIRED; zero values are malformed.
V-04	Keyframe flag	`VisionFrame.is_key_frame` boolean.
V-05	Quaternion reorder	§2 table maps nuScenes `(w,x,y,z)` to SpatialDDS `(x,y,z,w)`.

Lidar (6 checks)

ID	Check	Description
L-01	BIN_INTERLEAVED encoding	`CloudEncoding` value for raw interleaved binary with normative record layout table.
L-02	Per-point timestamps	`PointLayout.XYZ_I_R_T` and `XYZ_I_R_T_N` with normative prose for the `t` field.
L-03	Metadata guards	`LidarMeta` uses `has_range_limits`, `has_horiz_fov`, `has_vert_fov` guards.
L-04	Timestamp presence flag	`LidarFrame.has_per_point_timestamps` signals per-point timing in the blob.
L-05	t_end computation	Normative guidance for computing `t_end` from `t_start + 1/rate_hz` or `max(point.t)`.
L-06	Ring field	`PointLayout.XYZ_I_R` carries ring as `uint16`.

Semantics (5 checks)

ID	Check	Description
S-01	Size convention	Normative: `size[0]` = width (X), `size[1]` = height (Z), `size[2]` = depth (Y). nuScenes `(w,l,h)` -> `(w,h,l)` mapping documented.
S-02	Attributes	`Detection3D.attributes` as `sequence<MetaKV, 8>` with `has_attributes` guard.
S-03	Visibility	`Detection3D.visibility` float [0..1] with `has_visibility` guard.
S-04	Evidence counts	`num_lidar_pts` + `num_radar_pts` with `has_num_pts` guard.
S-05	Quaternion reorder	§2 table covers annotation quaternion conversion.

Common / Core (5 checks)

ID	Check	Description
C-01	Quaternion table	§2 convention table covering GeoPose, ROS 2, nuScenes, Eigen, Unity, Unreal, OpenXR, glTF.
C-02	FQN guidance	`FrameRef { uuid, fqn }` semantics documented; UUID is authoritative.
C-03	Local-frame coverage	§3.3.4 covers local-only deployments.
C-04	has_* pattern consistency	All new optional fields use the `has_*` guard pattern uniformly.
C-05	Sequence bounds	Standard bounds table: SZ_MEDIUM (2048), SZ_SMALL (256), SZ_XL (32768), SZ_LARGE (8192).

Results

All 27 checks pass against the SpatialDDS 1.5 specification as published.

Modality	Checks	Pass
Radar	6	6
Vision	5	5
Lidar	6	6
Semantics	5	5
Common / Core	5	5
Total	27	27

Spec Changes Informed by Testing

The conformance harness was first run against an early draft of SpatialDDS 1.5, which produced 29 gaps. The following normative changes were made in response:

Change	Profile	Origin
Complete radar profile replacement: tensor-based -> detection-centric (`RadDetection`, `RadDetectionSet`, `RadSensorMeta`)	Radar	R-01 through R-06
`RigRole` enum expanded with FRONT, FRONT_LEFT, FRONT_RIGHT, BACK, BACK_LEFT, BACK_RIGHT	Vision	V-01
Normative prose for `dist = NONE` pre-rectified image semantics	Vision	V-02
`CamIntrinsics.width` / `height` made REQUIRED with malformed-sample guidance	Vision	V-03
`VisionFrame.is_key_frame` boolean added	Vision	V-04
`CloudEncoding.BIN_INTERLEAVED` added with normative record layout table	Lidar	L-01
`PointLayout.XYZ_I_R_T` and `XYZ_I_R_T_N` added with per-point timestamp prose	Lidar	L-02, L-04
`has_range_limits`, `has_horiz_fov`, `has_vert_fov` guards added to `LidarMeta`	Lidar	L-03
`t_end` computation guidance for spinning lidars	Lidar	L-05
Size convention normative prose with nuScenes mapping	Semantics	S-01
`Detection3D.attributes`, `.visibility`, `.num_lidar_pts`, `.num_radar_pts` added	Semantics	S-02, S-03, S-04
§2 quaternion convention table with ecosystem mappings	Common	C-01, V-05, S-05
`FrameRef` FQN guidance and local-frame coverage section	Common	C-02, C-03

Reproducing the Test

The conformance harness is a single self-contained Python 3 script (scripts/nuscenes_harness_v2.py) with no external dependencies. To run:

python3 scripts/nuscenes_harness_v2.py

The script mirrors the IDL structures from this specification as Python dictionaries and checks them against the nuScenes schema. It produces a plain-text report and a JSON results file. No DDS runtime, network access, or nuScenes database download is required.

Implementers are encouraged to adapt the harness for additional reference datasets (e.g., Waymo Open, KITTI, Argoverse 2) to validate coverage for sensor configurations and annotation conventions not present in nuScenes.

Limitations

This testing validates schema expressiveness -- whether every nuScenes field has a lossless SpatialDDS mapping. It does not validate:

Wire interoperability -- actual DDS serialization/deserialization round-trips.
Performance -- throughput, latency, or memory footprint under real sensor loads.
Semantic correctness -- whether a particular producer's mapping preserves the intended meaning of each field.
Multi-dataset coverage -- datasets with different sensor configurations (e.g., solid-state lidar, event cameras, ultrasonic sensors) may surface additional gaps.

These areas are appropriate targets for future conformance work.