跳转至

CPU Instructions & Intrinsics


Overview

Assembly

SIMD

Intel MMX & SSE

ARM NEON

Arm NEON technology is an advanced SIMD (single instruction multiple data) architecture extension for the Arm Cortex-A series and Cortex-R52 processors.

Compiler Options:

  • test ARM NEON
Bash
gcc -dM -E -x c /dev/null | grep -i -E "(SIMD|NEON|ARM)"
  • Raspberry Pi 3 Model B

  • g++ options

    Bash
    -std=c++11 -O3 -march=native -mfpu=neon-vfpv4 -mfloat-abi=softfp -ffast-math
    

  • for the compilation error error: ‘vfmaq_f32’ was not declared in this scope, you might add the option -mfpu=neon-vfpv4 to enable __ARM_FEATURE_FMA in arm_neon.h

Reference Books:

  • NEON Programmer’s Guide
  • ARM® NEON Intrinsics Reference

Converter