CUDA已经是11.4以上了,安装flash-attention库的时候报错。
× python setup.py egg_info did not run successfully.
│ exit code: 1
╰─> [11 lines of output]
Traceback (most recent call last):
File "", line 2, in
File "", line 34, in
File "/mnt/20230808/Qwen-7B-main/flash-attention/setup.py", line 113, in
raise RuntimeError("FlashAttention is only supported on CUDA 11 and above")
RuntimeError: FlashAttention is only supported on CUDA 11 and above
torch.__version__ = 2.0.1+cu117
[end of output]
ModelScope旨在打造下一代开源的模型即服务共享平台,为泛AI开发者提供灵活、易用、低成本的一站式模型服务产品,让模型应用更简单!欢迎加入技术交流群:微信公众号:魔搭ModelScope社区,钉钉群号:44837352