I am using the following environment to perform SFT on the Qwen3.5 MoE model:
transformers=5.2.0
transformer-engine=2.12.0
flash-attn=2.8.3
The following error occurred when transformer-engine imported flash-attn:

Online research suggests this might be due to version incompatibility between transformer-engine and flash-attn. Could you please advise on how to adjust this, or could there be other causes?