发送短信 : Associating multiple vision transformer layers for fine-grained image representation