开发者社区 > 大数据与机器学习 > 人工智能平台PAI > 正文

机器学习PAI报错怎么解决?

机器学习PAI报错怎么解决?INFO: Found applicable config definition build:dynamic_kernels in file /home/pangjun/BladeDISC_GPU/tf_community/.bazelrc: --define=dynamic_loaded_kernels=true --copt=-DAUTOLOAD_DYNAMIC_KERNELS
Loading:
WARNING: The following configs were expanded more than once: [cuda]. For repeatable flags, repeats are counted twice and may lead to unexpected behavior.
Loading:
Loading: 0 packages loaded
INFO: Build options --action_env, --compilation_mode, --copt, and 2 more have changed, discarding analysis cache.
Analyzing: 2 targets (1 packages loaded, 0 targets configured)
INFO: Analyzed 2 targets (195 packages loaded, 13606 targets configured).
checking cached actions
INFO: Found 1 target and 1 test target...
[0 / 4] [Prepa] BazelWorkspaceStatusAction stable-status.txt
WARNING: /home/pangjun/.cache/bazel/_bazel_pangjun/a92cb0e935d0b101686941713fa06780/external/org_disc_compiler/mlir/disc/BUILD:2133:8: input 'mlir/disc/cutlass' to @org_disc_compiler//mlir/disc:cutlass_header_preprocess is a directory; dependency checking of directories is unsound
[6,483 / 8,118] Compiling llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp; 0s local, remote-cache ... (191 actions, 190 running)
[6,483 / 8,118] Compiling llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp; 1s local, remote-cache ... (192 actions running)
[6,485 / 8,118] Compiling llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp; 2s local, remote-cache ... (191 actions, 190 running)
ERROR: /home/pangjun/.cache/bazel/_bazel_pangjun/a92cb0e935d0b101686941713fa06780/external/org_disc_compiler/mlir/disc/BUILD:2133:8: Executing genrule @org_disc_compiler//mlir/disc:cutlass_header_preprocess failed: (Exit 2): bash failed: error executing command (from target @org_disc_compiler//mlir/disc:cutlass_header_preprocess) /bin/bash -c ... (remaining 1 argument skipped)
cc1plus: fatal error: cuda_runtime.h: No such file or directory 这个需要怎么设置环境变量能让他找到 conda 环境里的cuda runtime?

展开
收起
真的很搞笑 2023-12-24 18:42:42 66 0
1 条回答
写回答
取消 提交回答
  • 7ceec8d13bc2d784b6e3745f33c58a32.png
    对一下这里通过 nvcc 找到的路径或者在 L30 上直接加一下conda cuda的路径试试 ,此回答整理自钉群“BladeDISC用户支持群”

    2023-12-25 06:20:41
    赞同 展开评论 打赏

人工智能平台 PAI(Platform for AI,原机器学习平台PAI)是面向开发者和企业的机器学习/深度学习工程平台,提供包含数据标注、模型构建、模型训练、模型部署、推理优化在内的AI开发全链路服务,内置140+种优化算法,具备丰富的行业场景插件,为用户提供低门槛、高性能的云原生AI工程化能力。

相关产品

  • 人工智能平台 PAI
  • 相关电子书

    更多
    微博机器学习平台架构和实践 立即下载
    机器学习及人机交互实战 立即下载
    大数据与机器学习支撑的个性化大屏 立即下载