Left: Input data for the following operating modes: (a) refinement, (b) implicit, (c) mixed. Right: Overview diagram of the neural network. XN refers to the n-th order Ambisonics mixture, X1 is the first order Ambisonics mixture, is the scaled target direction, corresponds to the output of a max-rE beamformer at the target direction, and is the estimated separated signal.

