Existing audio language models typically rely on task-specific fine-tuning to accomplish particular audio tasks. In contrast, humans are able to generalize to new audio tasks with only a few examples ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results