Explain Transformer, GPT vs BERT, and PR metrics | ByteDance