from modelscope.msdatasets import MsDataset ds = MsDataset.load('modelscope/Youku-AliceMind', subset_name='classification', split='train', use_streaming=True)
2023-06-14 09:24:59,308 - modelscope - INFO - PyTorch version 1.11.0 Found. 2023-06-14 09:24:59,310 - modelscope - INFO - Loading ast index from /home/xxx/.cache/modelscope/ast_indexer 2023-06-14 09:24:59,323 - modelscope - INFO - Loading done! Current index file version is 1.0.3, with md5 xxx (一直卡在这里)
可以尝试先删除一下缓存,即该文件夹下的内容: ~/.cache/modelscope/hub/datasets/modelscope/Youku-AliceMind/master/meta
然后再执行如下代码: from modelscope.msdatasets import MsDataset
ds = MsDataset.load('modelscope/Youku-AliceMind', subset_name='classification', split='train', use_streaming=True)
for line in ds: print(line)
结果示例如下: