Implement CPU operator for DCNv3 #324

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

hxaxd wants to merge 1 commit into OpenGVLab:master from hxaxd:master

hxaxd commented Nov 7, 2025 •

edited

Loading

Purpose

Reduce the deployment difficulty of small-scale visual models using DCNv3 on different devices (using the same precompiled package for downgrading)

Work

Provide CPU downgrade by modifying dcnv3.h
Modify setup.py to provide CPU-Only compilation method and enable O2 optimization
Fully implement CPU operators

Effects

Minimize intrusion into the original code and compilation methods as much as possible
Accuracy passes tests (based on the original tests by modifying CUDA interfaces to corresponding CPU interfaces)

Issues

In the scenario of multi-core x86 CPU supporting SIMD, the speed is only 0.27x of the PyTorch CPU version


          Implement CPU operator for DCNv3

6d2a760

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet