Skip to content

Conversation

@JunKnows
Copy link

@JunKnows JunKnows commented Nov 21, 2025

修复两个小问题:

  1. opencl depthwise deconv算子存在计算错误,对照opencl deconv的逻辑检查并且手动验算验证修复;
  2. opencl depthwise 的cl代码没有支持切换FP16,对照opencl deconv的逻辑检查并且手动验算验证修复;
  3. vulkan的barrier设置缺少sType、pNext、srcQueueFamilyIndex、dstQueueFamilyIndex,在Adreno (TM) 830必现卷积结果异常,详见issue Vulkan后端-Adreno (TM) 830必现卷积结果异常(出现Nan或者数值巨大)-- 已提交PR修复 #4015

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant