arxiv Conformer: Convolution-augmented Transformer for Speech Recognition