Skip to content

GH-47784: [C++] Patch vendored pcg library to enable msvc arm64 intrinsics#47779

Merged
pitrou merged 1 commit into
apache:mainfrom
jgiannuzzi:windows-arm64-cpp
Oct 15, 2025
Merged

GH-47784: [C++] Patch vendored pcg library to enable msvc arm64 intrinsics#47779
pitrou merged 1 commit into
apache:mainfrom
jgiannuzzi:windows-arm64-cpp

Conversation

@jgiannuzzi

@jgiannuzzi jgiannuzzi commented Oct 10, 2025

Copy link
Copy Markdown
Contributor

Rationale for this change

This change enables building Arrow C++ for Windows ARM64 with MSVC when setting ARROW_SIMD_LEVEL to NONE. This is useful to be able to build Arrow with vcpkg and use it as a dependency.

Setting ARROW_SIMD_LEVEL to NONE is done to disable the use of xsimd, which does not yet support msvc arm64 intrinsics, and is non-trivial to fix.

What changes are included in this PR?

A patch to the vendored pcg library, based on imneme/pcg-cpp#99. The upstream pcg library has not been updated in the past 3 years, so this may never get merged.

Are these changes tested?

Yes, the changes have been tested in microsoft/vcpkg#47750 (the same patch for vcpkg, which this change would alleviate) and in https://github.com/jgiannuzzi/ParquetSharp/actions/runs/18354286294 (a full run of the ParquetSharp CI, using this patch to build Arrow with vcpkg).

Are there any user-facing changes?

Not really, unless you consider adding the ability to build Arrow on Windows ARM64 user-facing?

@github-actions

Copy link
Copy Markdown

Thanks for opening a pull request!

If this is not a minor PR. Could you open an issue for this pull request on GitHub? https://github.com/apache/arrow/issues/new/choose

Opening GitHub issues ahead of time contributes to the Openness of the Apache Arrow project.

Then could you also rename the pull request title in the following format?

GH-${GITHUB_ISSUE_ID}: [${COMPONENT}] ${SUMMARY}

or

MINOR: [${COMPONENT}] ${SUMMARY}

See also:

@jgiannuzzi jgiannuzzi changed the title [C++] Patch vendored pcg library to enable msvc arm64 intrinsics GH-47784: [C++] Patch vendored pcg library to enable msvc arm64 intrinsics Oct 10, 2025
@github-actions

Copy link
Copy Markdown

⚠️ GitHub issue #47784 has been automatically assigned in GitHub to PR creator.

@jgiannuzzi jgiannuzzi marked this pull request as ready for review October 10, 2025 17:35
@kou

kou commented Oct 11, 2025

Copy link
Copy Markdown
Member

Hmm. It seems that https://github.com/imneme/pcg-cpp isn't maintained...

Do we have alternative implementation...?

https://github.com/imneme/pcg-cpp was vendored by #8879. @pitrou Do you have any opinion?

@pitrou

pitrou commented Oct 11, 2025

Copy link
Copy Markdown
Member

Setting ARROW_SIMD_LEVEL to NONE is done to disable the use of xsimd, which does not yet support msvc arm64 intrinsics, and is non-trivial to fix.

@JohanMabille @AntoinePrv @serge-sans-paille Do you think xsimd can support ARM64 intrinsics on MSVC?

@pitrou

pitrou commented Oct 11, 2025

Copy link
Copy Markdown
Member

A patch to the vendored pcg library, based on imneme/pcg-cpp#99. The upstream pcg library has not been updated in the past 3 years, so this may never get merged.

What does this patch have to do with xsimd? Am I missing something?

@jgiannuzzi

jgiannuzzi commented Oct 11, 2025

Copy link
Copy Markdown
Contributor Author

What does this patch have to do with xsimd? Am I missing something?

Sorry @pitrou, my description wasn't clear. This patch has nothing to do with xsimd. I just meant to say that this pcg patch is not enough to build arrow with msvc on arm64 - you also need to set ARROW_SIMD_LEVEL to NONE to switch off ARROW_HAVE_NEON, which in turn will disable the use of xsimd, as it doesn't build with msvc for arm64.

@jgiannuzzi

Copy link
Copy Markdown
Contributor Author

@JohanMabille @AntoinePrv @serge-sans-paille Do you think xsimd can support ARM64 intrinsics on MSVC?

I found this PR created in 2021 attempting to solve this: xtensor-stack/xsimd#612.

@pitrou

pitrou commented Oct 12, 2025

Copy link
Copy Markdown
Member

Hmm. It seems that https://github.com/imneme/pcg-cpp isn't maintained...

Do we have alternative implementation...?

There is a fork here by @brt-v: https://github.com/brt-v/pcg-cpp?tab=readme-ov-file#about-this-fork
I'm not sure how reasonable it is to rely on it, though.

@brt-v

brt-v commented Oct 12, 2025

Copy link
Copy Markdown

Hmm. It seems that https://github.com/imneme/pcg-cpp isn't maintained...
Do we have alternative implementation...?

There is a fork here by @brt-v: https://github.com/brt-v/pcg-cpp?tab=readme-ov-file#about-this-fork I'm not sure how reasonable it is to rely on it, though.

I made that fork because of the lack of maintenance of the original implementation, with mostly quality of life improvements. The logic and tests are identical to the original, excepts for some small fixes to get all tests working on Windows. Note that I'm not an rng researcher and have no intention in further developing the algorithm, I just wanted to provide an easy to use library. I understand to you may not want to rely on such a small/unknown repo, but I'm happy to address your concerns and make further improvements where needed.

@JohanMabille

JohanMabille commented Oct 13, 2025

Copy link
Copy Markdown
Contributor

Setting ARROW_SIMD_LEVEL to NONE is done to disable the use of xsimd, which does not yet support msvc arm64 intrinsics, and is non-trivial to fix.

@JohanMabille @AntoinePrv @serge-sans-paille Do you think xsimd can support ARM64 intrinsics on MSVC?

Yes it could, but I cannot give a deadline for that. For the record, an issue for tracking this feature request has already been opened.

@pitrou

pitrou commented Oct 13, 2025

Copy link
Copy Markdown
Member

I understand to you may not want to rely on such a small/unknown repo, but I'm happy to address your concerns and make further improvements where needed.

Well, I think we could at some point. For now, we still support C++11 so we'll have to patch our vendored copy instead.

@kou

kou commented Oct 13, 2025

Copy link
Copy Markdown
Member

we still support C++11

Really? I think that we require C++17:

# This ensures that things like c++17 get passed correctly
if(NOT DEFINED CMAKE_CXX_STANDARD)
set(CMAKE_CXX_STANDARD 17)
elseif(${CMAKE_CXX_STANDARD} VERSION_LESS 17)
message(FATAL_ERROR "Cannot set a CMAKE_CXX_STANDARD smaller than 17")
endif()

@pitrou

pitrou commented Oct 13, 2025

Copy link
Copy Markdown
Member

Really? I think that we require C++17:

Wow. I even did the change myself. Clearly I haven't drunk enough tea this morning...

(I was thinking of C++20 for some reason)

@jgiannuzzi

jgiannuzzi commented Oct 15, 2025

Copy link
Copy Markdown
Contributor Author

#47811 adds a CI job to build the C++ library with msvc arm64. I left it as draft as it depends on this current PR to be merged first. @pitrou what are your latest thoughts on it?

@pitrou pitrou left a comment

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please update the README to include the patch that was applied, otherwise LGTM.

@github-actions github-actions Bot added awaiting committer review Awaiting committer review and removed awaiting review Awaiting review labels Oct 15, 2025
@pitrou

pitrou commented Oct 15, 2025

Copy link
Copy Markdown
Member

@github-actions crossbow submit -g cpp win

@pitrou pitrou added the CI: Extra: C++ Run extra C++ CI label Oct 15, 2025
@github-actions

Copy link
Copy Markdown

Revision: d6c980c

Submitted crossbow builds: ursacomputing/crossbow @ actions-c9b3535f37

Task Status
example-cpp-minimal-build-static GitHub Actions
example-cpp-minimal-build-static-system-dependency GitHub Actions
example-cpp-tutorial GitHub Actions
test-build-cpp-fuzz GitHub Actions
test-build-vcpkg-win GitHub Actions
test-conda-cpp GitHub Actions
test-conda-cpp-valgrind GitHub Actions
test-cuda-cpp-ubuntu-22.04-cuda-11.7.1 GitHub Actions
test-debian-12-cpp-amd64 GitHub Actions
test-debian-12-cpp-i386 GitHub Actions
test-fedora-42-cpp GitHub Actions
test-ubuntu-22.04-cpp GitHub Actions
test-ubuntu-22.04-cpp-20 GitHub Actions
test-ubuntu-22.04-cpp-bundled GitHub Actions
test-ubuntu-22.04-cpp-emscripten GitHub Actions
test-ubuntu-22.04-cpp-no-threading GitHub Actions
test-ubuntu-24.04-cpp GitHub Actions
test-ubuntu-24.04-cpp-bundled-offline GitHub Actions
test-ubuntu-24.04-cpp-gcc-13-bundled GitHub Actions
test-ubuntu-24.04-cpp-gcc-14 GitHub Actions
test-ubuntu-24.04-cpp-minimal-with-formats GitHub Actions
test-ubuntu-24.04-cpp-thread-sanitizer GitHub Actions
verify-rc-source-windows GitHub Actions
wheel-windows-cp310-cp310-amd64 GitHub Actions
wheel-windows-cp311-cp311-amd64 GitHub Actions
wheel-windows-cp312-cp312-amd64 GitHub Actions
wheel-windows-cp313-cp313-amd64 GitHub Actions
wheel-windows-cp313-cp313t-amd64 GitHub Actions
wheel-windows-cp314-cp314-amd64 GitHub Actions
wheel-windows-cp314-cp314t-amd64 GitHub Actions

@jgiannuzzi

Copy link
Copy Markdown
Contributor Author

Please update the README to include the patch that was applied, otherwise LGTM.

Done! Could you please review again @pitrou?

Comment thread cpp/src/arrow/vendored/pcg/README.md Outdated
This enables building Arrow C++ for Windows ARM64 with MSVC when setting
ARROW_SIMD_LEVEL to NONE (to disable xsimd usage, which does not support
msvc arm64 intrinsics, and is non-trivial to fix).

This patch is based on imneme/pcg-cpp#99.
The upstream pcg library has not been updated in the past 3 years, so
this may never get merged.
@pitrou pitrou self-requested a review October 15, 2025 14:22
@pitrou

pitrou commented Oct 15, 2025

Copy link
Copy Markdown
Member

CI failures are unrelated, I'll merge. Thank you @jgiannuzzi !

@pitrou pitrou merged commit 541cba8 into apache:main Oct 15, 2025
48 checks passed
@pitrou pitrou removed the awaiting committer review Awaiting committer review label Oct 15, 2025
@jgiannuzzi jgiannuzzi deleted the windows-arm64-cpp branch October 15, 2025 15:19
@jgiannuzzi

Copy link
Copy Markdown
Contributor Author

Thanks @pitrou!

@conbench-apache-arrow

Copy link
Copy Markdown

After merging your PR, Conbench analyzed the 3 benchmarking runs that have been run so far on merge-commit 541cba8.

There were no benchmark performance regressions. 🎉

The full Conbench report has more details. It also includes information about 1 possible false positive for unstable benchmarks that are known to sometimes produce them.

zanmato1984 pushed a commit to zanmato1984/arrow that referenced this pull request Nov 5, 2025
… intrinsics (apache#47779)

### Rationale for this change

This change enables building Arrow C++ for Windows ARM64 with MSVC when setting `ARROW_SIMD_LEVEL` to `NONE`. This is useful to be able to build Arrow with `vcpkg` and use it as a dependency.

Setting `ARROW_SIMD_LEVEL` to `NONE` is done to disable the use of `xsimd`, which does not yet support msvc arm64 intrinsics, and is non-trivial to fix.

### What changes are included in this PR?

A patch to the vendored `pcg` library, based on imneme/pcg-cpp#99. The upstream pcg library has not been updated in the past 3 years, so this may never get merged.

### Are these changes tested?

Yes, the changes have been tested in microsoft/vcpkg#47750 (the same patch for `vcpkg`, which this change would alleviate) and in https://github.com/jgiannuzzi/ParquetSharp/actions/runs/18354286294 (a full run of the ParquetSharp CI, using this patch to build Arrow with `vcpkg`).

### Are there any user-facing changes?

Not really, unless you consider adding the ability to build Arrow on Windows ARM64 user-facing?
* GitHub Issue: apache#47784

Authored-by: Jonathan Giannuzzi <jonathan@giannuzzi.me>
Signed-off-by: Antoine Pitrou <antoine@python.org>
Mottl pushed a commit to Mottl/arrow that referenced this pull request May 26, 2026
… intrinsics (apache#47779)

### Rationale for this change

This change enables building Arrow C++ for Windows ARM64 with MSVC when setting `ARROW_SIMD_LEVEL` to `NONE`. This is useful to be able to build Arrow with `vcpkg` and use it as a dependency.

Setting `ARROW_SIMD_LEVEL` to `NONE` is done to disable the use of `xsimd`, which does not yet support msvc arm64 intrinsics, and is non-trivial to fix.

### What changes are included in this PR?

A patch to the vendored `pcg` library, based on imneme/pcg-cpp#99. The upstream pcg library has not been updated in the past 3 years, so this may never get merged.

### Are these changes tested?

Yes, the changes have been tested in microsoft/vcpkg#47750 (the same patch for `vcpkg`, which this change would alleviate) and in https://github.com/jgiannuzzi/ParquetSharp/actions/runs/18354286294 (a full run of the ParquetSharp CI, using this patch to build Arrow with `vcpkg`).

### Are there any user-facing changes?

Not really, unless you consider adding the ability to build Arrow on Windows ARM64 user-facing?
* GitHub Issue: apache#47784

Authored-by: Jonathan Giannuzzi <jonathan@giannuzzi.me>
Signed-off-by: Antoine Pitrou <antoine@python.org>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants