[STABLE ABI] Porting audio to Torch Stable ABI

This is a tracking issue for porting torchaudio extension modules to use torch stable ABI.

**Must have**:
- [ ] `AT_DISPATCH_FLOATING_TYPES_AND_HALF`, `AT_DISPATCH_FLOATING_TYPES`. Prototypes of these macros are implemented in https://github.com/pytorch/audio/pull/4079 (see dispatch.h) and moved to upstream, see https://github.com/pytorch/pytorch/pull/163973. The plan is to move these macros to headeronly, see ghstack starting from https://github.com/pytorch/pytorch/pull/164350 and https://github.com/pytorch/pytorch/pull/165695

  UPDATE: when https://github.com/pytorch/pytorch/pull/165856 lands, we'll use AT_DISPATCH V2 macros.
- [ ] `mutable/const_data_ptr` methods and templates. Prototypes are implemented in https://github.com/pytorch/audio/pull/4079 (see ops.h) and moved to upstream, see https://github.com/pytorch/pytorch/pull/161891
   - **Why we need this**: dependency of accessors
- [ ] tensor accessors for both CPU and CUDA. Prototypes of these templates are implemented in https://github.com/pytorch/audio/pull/4079 (see TensorAccessor.h) and moved to upstream, see https://github.com/pytorch/pytorch/pull/164123, landing requires ArrayRef
  - **Why we need this**: The codebase currently relies on accessors [with up to 3 dimensions] (https://github.com/pytorch/audio/blob/87ff22e49ed0e92576c4935ccb8c143daac4a3cd/src/libtorchaudio/forced_align/cpu/compute.cpp#L40-L42). We technically could work around this by relying on raw pointers and enforcing contiguity, but that would obfuscate what would otherwise be [simple indexing](https://github.com/pytorch/audio/blob/87ff22e49ed0e92576c4935ccb8c143daac4a3cd/src/libtorchaudio/forced_align/cpu/compute.cpp#L89) calls.

  UPDATE: https://github.com/pytorch/pytorch/pull/166855 refactors TensorAccessor to headeronly 
- [ ] `parallel_for`, in-progress by torch stable ABI team: https://github.com/pytorch/pytorch/pull/161320
  - Alternative is to not use `parallel_for`, for decreased performance. Acceptable, but not ideal.


**Nice to have**:

We think we can reasonably work around these without incurring too much debt in TorchAudio, so we can treat them as nice-to-have for now.

- [ ] Tensor operations as stable ABI operations:
  - [ ] `item<T>()` template, prototype available in https://github.com/pytorch/audio/pull/4079 (see ops.h)
  - [ ] `to` function (to cuda, to cpu, as an alternative to `cpu` and `cuda` functions below)
  - [ ] `cpu` function, prototype available in https://github.com/pytorch/audio/pull/4079 (see ops.h), moved to upstream, see https://github.com/pytorch/pytorch/pull/161911
  - [ ] `cuda` function, prototype available in https://github.com/pytorch/audio/pull/4079 (see ops.h)
  - [x] `copy_` function, done in https://github.com/pytorch/pytorch/pull/161896
  - [x] `clone` function, done in https://github.com/pytorch/pytorch/pull/161895
  - [ ] `index` function, requires `Slice`, workaround using other aten methods
  - [ ] `new_zeros` function, prototype available in https://github.com/pytorch/audio/pull/4079 (see ops.h)
  - [ ] `new_empty` with device support, prototypes in https://github.com/pytorch/audio/pull/4079 and https://github.com/pytorch/pytorch/pull/161894
  - [ ] `tensor` function
  - [ ] `max` function
  - [ ] `select` function
  - [ ] `unsqueeze` function
  - [ ] `squeeze` function


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[STABLE ABI] Porting audio to Torch Stable ABI #4114

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

	auto logProbs_a = logProbs.accessor<scalar_t, 3>();
	auto targets_a = targets.accessor<target_t, 2>();
	auto paths_a = paths.accessor<target_t, 2>();

[STABLE ABI] Porting audio to Torch Stable ABI #4114

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions