`ista`#

Datasets from the Department of Engineering Acoustics, TU Berlin, Berlin Germany.

MIRACLE: Microphone Array Impulse Response Dataset for Acoustic Learning.
SRIRACHA: Shoebox Room Impulse Response Archive with Varying Absorption.

class irdl.ista.IstaBaseDataset#

Bases: BaseDataset

Base class for HDF5-based datasets from ISTA (MIRACLE, SRIRACHA).

Both MIRACLE and SRIRACHA share identical HDF5 file structure and can use the same ingestion logic to convert HDF5 to SOFA format.

Attributes:

room_volumefloat: Room volume in cubic meters, used for SOFA metadata.

_ingest(ingest_path: Path) → Sofa#

Convert a MIRACLE/SRIRACHA HDF5 file into a SOFA object.

Both datasets share an identical HDF5 layout, so this single implementation covers both subclasses. The output follows the SingleRoomMIMOSRIR SOFA convention.

Parameters:

ingest_pathpathlib.Path: Path to the HDF5 file.

Returns:

sofar.Sofa: SOFA object in the SingleRoomMIMOSRIR convention.

_source_filename(**dataset_kwargs) → str#

Construct the raw input filename with extension.

Shared implementation for MIRACLE and SRIRACHA datasets.

Parameters:

**dataset_kwargsdict: Must contain ‘scenario’. May contain ‘dataset_split’.

Returns:

str: Filename in format “{scenario}[-{split}].h5”.

class irdl.ista.MiracleDataset#

Bases: IstaBaseDataset

Download the MIRACLE database from DepositOnce.

Attributes:

namestr: Dataset name (“miracle”).
doistr: Digital Object Identifier (“10.14279/depositonce-20837”).
room_volumefloat: Room volume in cubic meters (830).
measurement_datefloat: Release date in POSIX seconds, used as the SOFA MeasurementDate (no per-measurement date is available).

_category = 'room_impulse_responses'#

_download(provider_dir: Path, **dataset_kwargs) → Path#

Download MIRACLE dataset file.

Downloads the full scenario HDF5 file. If a split is requested, the split will be extracted in _process().

Parameters:

provider_dirpathlib.Path: Provider directory (e.g., cache/MIRACLE/provider/).
**dataset_kwargsdict: Must contain ‘scenario’. May contain ‘dataset_split’.

Returns:

pathlib.Path: Path to the downloaded full scenario HDF5 file inside the provider directory.

_extract_split(ingest_path: Path, dataset_split: str, output_path: Path) → Path#

Extract a dataset split from a full MIRACLE HDF5 file.

Reads the full file, indexes the requested quadrant of the source grid, and writes the result to a new HDF5 file.

Parameters:

ingest_pathpathlib.Path: Path to the full HDF5 file in the provider directory.
dataset_splitstr: Split to extract. One of ‘C1’, ‘C2’, ‘C3’, ‘C4’.
output_pathpathlib.Path: Target path in the ingest directory.

Returns:

pathlib.Path: Path to the extracted split HDF5 file.

_process(provider_artifact: Path, ingest_path: Path, **dataset_kwargs) → Path#

Post-process MIRACLE file if needed.

If a dataset_split is requested and the file is the full scenario file, extracts the corresponding quadrant split into the ingest directory. Otherwise promotes the provider file to the ingest stage.

Parameters:

provider_artifactpathlib.Path: Path to the provider file (full scenario HDF5).
ingest_pathpathlib.Path: Path to the HDF5 file in the ingest directory.
**dataset_kwargsdict: Must contain ‘scenario’. May contain ‘dataset_split’.

Returns:

pathlib.Path: Path to the processed file in the ingest directory.

_validate_params(**dataset_kwargs) → None#

Validate MIRACLE-specific parameters.

Parameters:

**dataset_kwargsdict: Must contain ‘scenario’ (one of ‘A1’, ‘A2’, ‘D1’, ‘R2’). May contain ‘dataset_split’ (one of ‘C1’, ‘C2’, ‘C3’, ‘C4’, or None). Scenario ‘D1’ cannot be split. output_format is also passed but unused here.

Raises:

ValueError: If scenario or split is out of range, or ‘D1’ is combined with a split.

doi: str = '10.14279/depositonce-20837'#

Download the MIRACLE database from DepositOnce.

DOI: https://doi.org/10.14279/depositonce-20837

Parameters:

cache_dirstr: Cache directory for downloads. Defaults is the OS user cache directory. This default can be overridden by setting IRDL_CACHE_DIR environment variable.
export_dirstr, optional: Directory for final output. If specified, the data will be exported to <export_dir/MIRACLE/>. Else, it remains in <cache_dir/output/>.
output_formatstr: Output format: ‘pyfar’, ‘numpy’, ‘hdf5’, ‘sofa’, or ‘raw’.
scenariostr: Scenario to download. One of ‘A1’, ‘A2’, ‘D1’, ‘R2’.
dataset_splitstr or None, optional: Artificial dataset split. One of ‘C1’, ‘C2’, ‘C3’, ‘C4’ or None. Dense scenarios (D1) cannot be split.

Returns:

dict or Path: For ‘pyfar’ / ‘numpy’: dict of in-memory objects. For ‘sofa’ / ‘hdf5’ / ‘raw’: Path to file on disk.

measurement_date = 1697068800.0#

name: str = 'miracle'#

room_volume = 830#

class irdl.ista.SrirachaDataset#

Bases: IstaBaseDataset

Download and merge the SRIRACHA database from DepositOnce.

Attributes:

namestr: Dataset name (“sriracha”).
doistr: Digital Object Identifier (“10.14279/depositonce-23943”).
room_volumefloat: Room volume in cubic meters (73.5).
measurement_datefloat: Release date in POSIX seconds, used as the SOFA MeasurementDate (no per-measurement date is available).

_category = 'room_impulse_responses'#

_download(provider_dir: Path, **dataset_kwargs) → Path#

Download SRIRACHA dataset file(s) to the provider directory.

For dense scenarios or explicit splits, downloads a single file. For non-dense full-plane scenarios, downloads all 4 split files and returns the provider directory path.

Parameters:

provider_dirpathlib.Path: Provider directory (e.g., cache/SRIRACHA/provider/).
**dataset_kwargsdict: Must contain ‘scenario’. May contain ‘dataset_split’.

Returns:

pathlib.Path: Path to the downloaded file (inside provider) or the provider directory (for non-dense full-plane scenarios).

_merge_split_files(scenario: str, provider_artifact: Path, ingest_path: Path) → Path#

Merge four quadrant HDF5 files into a full-plane file.

Reads metadata from the first split file in the provider directory, allocates output datasets with the full source-grid shape, copies each split’s measurements into the interleaved grid positions, and deletes the provider split files afterwards.

Parameters:

scenariostr: Scenario name (e.g. ‘SR1’).
provider_artifactPath: Provider directory where split files are downloaded.
ingest_pathPath: Target path in the ingest directory for the merged file.

Returns:

Path: Path to the merged HDF5 file.

_process(provider_artifact: Path, ingest_path: Path, **dataset_kwargs) → Path#

Post-process SRIRACHA file if needed.

For non-dense full-plane scenarios, merges the 4 downloaded split files from the provider directory into a single file in the ingest directory. Otherwise promotes the single file to the ingest stage.

Parameters:

provider_artifactPath: Path to the downloaded file or the provider directory.
ingest_pathpathlib.Path: Path to the HDF5 file in the ingest directory.
**dataset_kwargsdict: Must contain ‘scenario’. May contain ‘dataset_split’.

Returns:

Path: Path to the processed file in the ingest directory.

_validate_params(**dataset_kwargs) → None#

Validate SRIRACHA-specific parameters.

Parameters:

**dataset_kwargsdict: Must contain ‘scenario’ (one of ‘SR1’, ‘SRA1’, ‘SR1-D’, ‘SRA1-D’, ‘SR2’, ‘SRA2’, ‘SR2-D’, ‘SRA2-D’). May contain ‘dataset_split’ (one of ‘C1’, ‘C2’, ‘C3’, ‘C4’, or None). Dense scenarios (ending in ‘-D’) cannot be split. output_format is also passed and used to forbid ‘raw’ for non-dense full-plane scenarios.

Raises:

ValueError: If scenario or split is invalid, a dense scenario is combined with a split, or ‘raw’ is requested for a non-dense full plane.

doi: str = '10.14279/depositonce-23943'#

Download and merge the SRIRACHA database from DepositOnce.

DOI: https://doi.org/10.14279/depositonce-23943

Parameters:

cache_dirstr: Cache directory for downloads. Defaults is the OS user cache directory. This default can be overridden by setting IRDL_CACHE_DIR environment variable.
export_dirstr, optional: Directory for final output. If specified, the data will be exported to <export_dir/SRIRACHA/>. Else, it remains in <cache_dir/output/>.
output_formatstr: Output format: ‘pyfar’, ‘numpy’, ‘hdf5’, ‘sofa’, or ‘raw’.
scenariostr, optional: Scenario to download. One of ‘SR1’, ‘SRA1’, ‘SR1-D’, ‘SRA1-D’, ‘SR2’, ‘SRA2’, ‘SR2-D’, ‘SRA2-D’. Default is ‘SR1-D’.
dataset_splitstr or None, optional: Optional dataset split for full-plane scenarios. One of ‘C1’, ‘C2’, ‘C3’, ‘C4’ or None. Dense scenarios (ending in ‘-D’) do not have splits. Default is None.

Returns:

dict or Path: For ‘pyfar’ / ‘numpy’: dict of in-memory objects. For ‘sofa’ / ‘hdf5’ / ‘raw’: Path to file on disk.

measurement_date = 1755648000.0#

name: str = 'sriracha'#

room_volume = 73.5#

ista#

`ista`#