Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
50 commits
Select commit Hold shift + click to select a range
cf10b40
GH-44526: [C++][Acero] Fix crash when thread in asof_join is not runn…
mroz45 Nov 11, 2024
5e3ef51
MINOR: [Dev] Remove Rossi from collaborators list (#44694)
zanmato1984 Nov 11, 2024
d7e982c
GH-44607: [C++][Dev] Update bundled Thrift, update mirrors to use CDN…
amoeba Nov 11, 2024
aab7d81
GH-43631: [C++] Add C++ implementation of Async C Data Interface (#44…
zeroshade Nov 11, 2024
d7641dc
GH-44614: [Python][C++] Add version suffix to libarrow_python* librar…
raulcd Nov 13, 2024
ae0f60a
MINOR: [Java] Bump com.puppycrawl.tools:checkstyle from 10.20.0 to 10…
dependabot[bot] Nov 13, 2024
d7bc378
GH-44706: [Release][Archery][Packaging] Add "so_version" variable (#4…
kou Nov 13, 2024
d534e77
GH-44679: [C++][Python] Fix Flight Timestamp precision, revert workar…
EnricoMi Nov 14, 2024
479e350
GH-44716: [Dev][Integration] Add numpy to archery integration deps (#…
amoeba Nov 14, 2024
354a22f
MINOR: [Release] Proactively remove java-jni-dist directory in verifi…
amoeba Nov 14, 2024
65f498b
GH-44479: [CI][Archery] Add missing Flight integration targets (#44691)
kou Nov 14, 2024
26aa75f
GH-44223: [Dev] Use "Gandiva" instead of "C++ - Gandiva" label (#44722)
kou Nov 14, 2024
474c675
MINOR: [Docs] Add Opaque to implementation status matrix (#44723)
lidavidm Nov 14, 2024
29e8ea0
GH-44491: [C++] StatusConstant- cheaply copied const Status (#44493)
bkietz Nov 14, 2024
3b3af5d
GH-43080: [CI][Dev] Enable shellcheck (#44724)
kou Nov 15, 2024
1afae0b
GH-44726: [CI] Update substrait consumer call to use updated producer…
raulcd Nov 15, 2024
3f25672
GH-43808: [C++] skip `-0117` in StrptimeZoneOffset for old glibc (#44…
h-vetinari Nov 15, 2024
736d706
GH-43570: [CI][Dev][Docs] Update references to "docker-compose" with …
amoeba Nov 15, 2024
df40f7a
GH-43951: [CI][Python] Use GitHub Packages for vcpkg cache (#44644)
kou Nov 15, 2024
133e114
GH-44703: [CI][MATLAB][Packaging] Update MATLAB CI and `crossbow` pac…
kevingurney Nov 16, 2024
1df960e
MINOR: [JS] Bump cross-spawn from 7.0.3 to 7.0.5 in /js (#44746)
dependabot[bot] Nov 17, 2024
6427ed4
GH-27919: [CI][C++] Add a nightly job to test offline build (#44721)
kou Nov 17, 2024
e79f65c
MINOR: [Java] Bump `orc-core` to 1.9.5 (#44743)
dongjoon-hyun Nov 17, 2024
23328ff
GH-44754: [C++] Use lowercased `windows.h` to enable cross-platform b…
evetion Nov 17, 2024
64a3768
MINOR: [Docs][Python] Fix a few minor typos (#44756)
IndexSeek Nov 17, 2024
00de992
GH-44744: [C++] Upgrade ORC to 2.0.3 (#44745)
dongjoon-hyun Nov 17, 2024
99bdb80
MINOR: [C++] Fix the simple typo in the test (#44747)
c8ef Nov 18, 2024
4c2aef7
GH-44749: [CI][Dev] Apply ShellCheck lint to ci/scripts/c_glib_test.s…
kou Nov 18, 2024
aa26f28
GH-44651: [Python] Allow from_buffers to work with StringView on Pyth…
raulcd Nov 18, 2024
59decc3
GH-44767: [C++] Fix Float16.To{Little,Big}Endian on big endian machin…
QuLogic Nov 18, 2024
152e878
GH-44714: [C++] Keep field metadata for keys and values when importin…
paleolimbot Nov 18, 2024
4dc0492
GH-44734: [C++][CI] Fix arrow-c-bridge-test timeout with threading di…
zeroshade Nov 18, 2024
ad75248
GH-43410: [Python] Support Arrow PyCapsule stream objects in write_da…
jorisvandenbossche Nov 18, 2024
ea8b1d3
GH-44770: [Java] Update minor protobuf version to avoid CVE-2024-7254…
raulcd Nov 18, 2024
bc36282
GH-44773: [Dev][Doc] Remove obsolete Read the docs configuration (#44…
raulcd Nov 18, 2024
66c2cf0
GH-44742: [Ruby] Fix a bug that empty struct list value can't be buil…
kou Nov 19, 2024
60cc251
GH-44710: [Docs][C++] Add `arrow::ArrayStatistics` to API doc (#44764)
kou Nov 19, 2024
7cccd5f
MINOR: [C#] Bump ZstdSharp.Port from 0.8.1 to 0.8.2 in /csharp (#44781)
dependabot[bot] Nov 19, 2024
4cfe8d2
MINOR: [C#] Bump System.Threading.Tasks.Extensions and System.Runtime…
dependabot[bot] Nov 19, 2024
6134b2e
MINOR: [Java] Bump io.netty:netty-bom from 4.1.114.Final to 4.1.115.F…
dependabot[bot] Nov 19, 2024
9d19450
GH-44700: [C++][Parquet] Remove obsolete parquet_constants generated …
raulcd Nov 19, 2024
1e14b61
MINOR: [C#] Bump System.Memory, System.Buffers and System.Runtime.Com…
dependabot[bot] Nov 19, 2024
7dd66db
MINOR: [Java] Bump parquet.version from 1.14.3 to 1.14.4 in /java (#4…
dependabot[bot] Nov 19, 2024
29d9a23
MINOR: [Java] Bump org.codehaus.mojo:versions-maven-plugin from 2.17.…
dependabot[bot] Nov 19, 2024
8f8ac38
MINOR: [Java] Bump com.google.api.grpc:proto-google-common-protos fro…
dependabot[bot] Nov 19, 2024
1302889
GH-43124: [C++] Initialize offset vector head as 0 after memory alloc…
flashzxi Nov 19, 2024
33e8cbb
GH-44784: [C++][Parquet] Add `arrow::Result` version of `parquet::arr…
kou Nov 20, 2024
501418e
GH-44690: [C++] NumericBuilder::AppendValues append vector prevent fr…
mapleFU Nov 20, 2024
9015a81
GH-44769: [C++][Parquet] Fix read/write of metadata length footer on …
QuLogic Nov 20, 2024
54f8cf9
GH-44728: [Python] Trigger manual Garbage collection before checking …
raulcd Nov 21, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 0 additions & 1 deletion .asf.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -25,7 +25,6 @@ github:
- js8544
- laurentgo
- vibhatha
- zanmato1984
- ZhangHuiGui

notifications:
Expand Down
2 changes: 1 addition & 1 deletion .env
Original file line number Diff line number Diff line change
Expand Up @@ -97,7 +97,7 @@ VCPKG="943c5ef1c8f6b5e6ced092b242c8299caae2ff01" # 2024.04.26 Release
# use pulled built images in dev/tasks/python-wheels/github.windows.yml.
PYTHON_WHEEL_WINDOWS_IMAGE_REVISION=2024-08-06

# Use conanio/${CONAN_BASE}:{CONAN_VERSION} for "docker-compose run --rm conan".
# Use conanio/${CONAN_BASE}:{CONAN_VERSION} for "docker compose run --rm conan".
# See https://github.com/conan-io/conan-docker-tools#readme and
# https://hub.docker.com/u/conanio for available images.
CONAN_BASE=gcc10
Expand Down
1 change: 0 additions & 1 deletion .github/CODEOWNERS
Original file line number Diff line number Diff line change
Expand Up @@ -46,7 +46,6 @@

# Docs
# /docs/
# .readthedocs.yml
# *.md
# *.rmd
# *.rst
Expand Down
4 changes: 2 additions & 2 deletions .github/ISSUE_TEMPLATE/bug_report.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -38,15 +38,15 @@ body:
- C
- C#
- C++
- C++ - Gandiva
- Continuous Integration
- Developer Tools
- Documentation
- FlightRPC
- Format
- GLib
- Go
- GPU
- Gandiva
- Go
- Integration
- Java
- JavaScript
Expand Down
4 changes: 2 additions & 2 deletions .github/ISSUE_TEMPLATE/feature_request.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -41,15 +41,15 @@ body:
- C
- C#
- C++
- C++ - Gandiva
- Continuous Integration
- Developer Tools
- Documentation
- FlightRPC
- Format
- GLib
- Go
- GPU
- Gandiva
- Go
- Integration
- Java
- JavaScript
Expand Down
4 changes: 2 additions & 2 deletions .github/ISSUE_TEMPLATE/usage_question.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -61,15 +61,15 @@ body:
- C
- C#
- C++
- C++ - Gandiva
- Continuous Integration
- Developer Tools
- Documentation
- FlightRPC
- Format
- GLib
- Go
- GPU
- Gandiva
- Go
- Integration
- Java
- JavaScript
Expand Down
5 changes: 3 additions & 2 deletions .github/workflows/dev_pr/labeler.yml
Original file line number Diff line number Diff line change
Expand Up @@ -53,11 +53,12 @@
- r/R/flight.*
- python/pyarrow/*flight.*

"Component: C++ - Gandiva":
"Component: Gandiva":
- c_glib/gandiva-glib/**/*
- cpp/src/gandiva/**/*
- ruby/red-gandiva/**/*
- java/gandiva/**/*
- python/pyarrow/gandiva.*
- ruby/red-gandiva/**/*

"Component: Parquet":
- c_glib/parquet-glib/**/*
Expand Down
6 changes: 3 additions & 3 deletions .github/workflows/matlab.yml
Original file line number Diff line number Diff line change
Expand Up @@ -75,7 +75,7 @@ jobs:
- name: Install MATLAB
uses: matlab-actions/setup-matlab@v2
with:
release: R2024a
release: R2024b
- name: Install ccache
run: sudo apt-get install ccache
- name: Setup ccache
Expand Down Expand Up @@ -123,7 +123,7 @@ jobs:
- name: Install MATLAB
uses: matlab-actions/setup-matlab@v2
with:
release: R2024a
release: R2024b
- name: Install ccache
run: brew install ccache
- name: Setup ccache
Expand Down Expand Up @@ -162,7 +162,7 @@ jobs:
- name: Install MATLAB
uses: matlab-actions/setup-matlab@v2
with:
release: R2024a
release: R2024b
- name: Download Timezone Database
shell: bash
run: ci/scripts/download_tz_database.sh
Expand Down
10 changes: 10 additions & 0 deletions .pre-commit-config.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -173,3 +173,13 @@ repos:
'--disable',
'dangling-hyphen,line-too-long',
]
- repo: https://github.com/koalaman/shellcheck-precommit
rev: v0.10.0
hooks:
- id: shellcheck
# TODO: Remove this when we fix all lint failures
files: >-
(
?^ci/scripts/c_glib_build\.sh$|
?^ci/scripts/c_glib_test\.sh$|
)
19 changes: 0 additions & 19 deletions .readthedocs.yml

This file was deleted.

26 changes: 14 additions & 12 deletions c_glib/parquet-glib/arrow-file-reader.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -134,12 +134,13 @@ gparquet_arrow_file_reader_new_arrow(GArrowSeekableInputStream *source, GError *
{
auto arrow_random_access_file = garrow_seekable_input_stream_get_raw(source);
auto arrow_memory_pool = arrow::default_memory_pool();
std::unique_ptr<parquet::arrow::FileReader> parquet_arrow_file_reader;
auto status = parquet::arrow::OpenFile(arrow_random_access_file,
arrow_memory_pool,
&parquet_arrow_file_reader);
if (garrow_error_check(error, status, "[parquet][arrow][file-reader][new-arrow]")) {
return gparquet_arrow_file_reader_new_raw(parquet_arrow_file_reader.release());
auto parquet_arrow_file_reader_result =
parquet::arrow::OpenFile(arrow_random_access_file, arrow_memory_pool);
if (garrow::check(error,
parquet_arrow_file_reader_result,
"[parquet][arrow][file-reader][new-arrow]")) {
return gparquet_arrow_file_reader_new_raw(
parquet_arrow_file_reader_result->release());
} else {
return NULL;
}
Expand Down Expand Up @@ -168,12 +169,13 @@ gparquet_arrow_file_reader_new_path(const gchar *path, GError **error)
std::shared_ptr<arrow::io::RandomAccessFile> arrow_random_access_file =
arrow_memory_mapped_file.ValueOrDie();
auto arrow_memory_pool = arrow::default_memory_pool();
std::unique_ptr<parquet::arrow::FileReader> parquet_arrow_file_reader;
auto status = parquet::arrow::OpenFile(arrow_random_access_file,
arrow_memory_pool,
&parquet_arrow_file_reader);
if (garrow::check(error, status, "[parquet][arrow][file-reader][new-path]")) {
return gparquet_arrow_file_reader_new_raw(parquet_arrow_file_reader.release());
auto parquet_arrow_file_reader_result =
parquet::arrow::OpenFile(arrow_random_access_file, arrow_memory_pool);
if (garrow::check(error,
parquet_arrow_file_reader_result,
"[parquet][arrow][file-reader][new-path]")) {
return gparquet_arrow_file_reader_new_raw(
parquet_arrow_file_reader_result->release());
} else {
return NULL;
}
Expand Down
43 changes: 27 additions & 16 deletions ci/docker/python-wheel-manylinux.dockerfile
Original file line number Diff line number Diff line change
Expand Up @@ -69,36 +69,47 @@ RUN /arrow/ci/scripts/install_ccache.sh ${ccache} /usr/local
ARG vcpkg
COPY ci/vcpkg/*.patch \
ci/vcpkg/*linux*.cmake \
ci/vcpkg/vcpkg.json \
arrow/ci/vcpkg/
COPY ci/scripts/install_vcpkg.sh \
arrow/ci/scripts/
ENV VCPKG_ROOT=/opt/vcpkg
ARG build_type=release
ENV CMAKE_BUILD_TYPE=${build_type} \
VCPKG_FORCE_SYSTEM_BINARIES=1 \
VCPKG_OVERLAY_TRIPLETS=/arrow/ci/vcpkg \
PATH="${PATH}:${VCPKG_ROOT}" \
VCPKG_DEFAULT_TRIPLET=${arch_short}-linux-static-${build_type} \
VCPKG_FEATURE_FLAGS="manifests"

RUN arrow/ci/scripts/install_vcpkg.sh ${VCPKG_ROOT} ${vcpkg}
ENV PATH="${PATH}:${VCPKG_ROOT}"

COPY ci/vcpkg/vcpkg.json arrow/ci/vcpkg/
# cannot use the S3 feature here because while aws-sdk-cpp=1.9.160 contains
# ssl related fixes as well as we can patch the vcpkg portfile to support
# arm machines it hits ARROW-15141 where we would need to fall back to 1.8.186
# but we cannot patch those portfiles since vcpkg-tool handles the checkout of
# previous versions => use bundled S3 build
RUN vcpkg install \
VCPKG_FEATURE_FLAGS="manifests" \
VCPKG_FORCE_SYSTEM_BINARIES=1 \
VCPKG_OVERLAY_TRIPLETS=/arrow/ci/vcpkg
# For --mount=type=secret: The GITHUB_TOKEN is the only real secret but we use
# --mount=type=secret for GITHUB_REPOSITORY_OWNER and
# VCPKG_BINARY_SOURCES too because we don't want to store them
# into the built image in order to easily reuse the built image cache.
#
# For vcpkg install: cannot use the S3 feature here because while
# aws-sdk-cpp=1.9.160 contains ssl related fixes as well as we can
# patch the vcpkg portfile to support arm machines it hits ARROW-15141
# where we would need to fall back to 1.8.186 but we cannot patch
# those portfiles since vcpkg-tool handles the checkout of previous
# versions => use bundled S3 build
RUN --mount=type=secret,id=github_repository_owner \
--mount=type=secret,id=github_token \
--mount=type=secret,id=vcpkg_binary_sources \
export GITHUB_REPOSITORY_OWNER=$(cat /run/secrets/github_repository_owner); \
export GITHUB_TOKEN=$(cat /run/secrets/github_token); \
export VCPKG_BINARY_SOURCES=$(cat /run/secrets/vcpkg_binary_sources); \
arrow/ci/scripts/install_vcpkg.sh ${VCPKG_ROOT} ${vcpkg} && \
vcpkg install \
--clean-after-build \
--x-install-root=${VCPKG_ROOT}/installed \
--x-manifest-root=/arrow/ci/vcpkg \
--x-feature=azure \
--x-feature=azure \
--x-feature=flight \
--x-feature=gcs \
--x-feature=json \
--x-feature=parquet \
--x-feature=s3
--x-feature=s3 && \
rm -rf ~/.config/NuGet/

# Make sure auditwheel is up-to-date
RUN pipx upgrade auditwheel
Expand Down
46 changes: 25 additions & 21 deletions ci/scripts/c_glib_build.sh
Original file line number Diff line number Diff line change
Expand Up @@ -23,59 +23,63 @@ source_dir=${1}/c_glib
build_dir=${2}/c_glib
build_root=${2}

: ${ARROW_GLIB_WERROR:=false}
: ${ARROW_GLIB_VAPI:=true}
: ${BUILD_DOCS_C_GLIB:=OFF}
: "${ARROW_GLIB_WERROR:=false}"
: "${ARROW_GLIB_VAPI:=true}"
: "${BUILD_DOCS_C_GLIB:=OFF}"
with_doc=$([ "${BUILD_DOCS_C_GLIB}" == "ON" ] && echo "true" || echo "false")


if [ -n "${MSYSTEM:-}" ]; then
# Fix ARROW_HOME when running under MSYS2
export ARROW_HOME="$(cygpath --unix "${ARROW_HOME}")"
ARROW_HOME="$(cygpath --unix "${ARROW_HOME}")"
export ARROW_HOME
fi

export PATH="${ARROW_HOME}/bin:${PATH}"
PATH="${ARROW_HOME}/bin:${PATH}"

meson_pkg_config_path="${ARROW_HOME}/lib/pkgconfig"

mkdir -p ${build_dir}
mkdir -p "${build_dir}"

if [ -n "${VCPKG_ROOT:-}" -a -n "${VCPKG_TRIPLET:-}" ]; then
if [ -n "${VCPKG_ROOT:-}" ] && [ -n "${VCPKG_TRIPLET:-}" ]; then
vcpkg_install_root="${build_root}/vcpkg_installed"
$VCPKG_ROOT/vcpkg install --x-manifest-root=${source_dir} --x-install-root=${vcpkg_install_root}
export PKG_CONFIG="${vcpkg_install_root}/${VCPKG_TRIPLET}/tools/pkgconf/pkgconf.exe"
"${VCPKG_ROOT}/vcpkg" install \
--x-manifest-root="${source_dir}" \
--x-install-root="${vcpkg_install_root}"
PKG_CONFIG="${vcpkg_install_root}/${VCPKG_TRIPLET}/tools/pkgconf/pkgconf.exe"
export PKG_CONFIG
meson_pkg_config_path="${vcpkg_install_root}/${VCPKG_TRIPLET}/lib/pkgconfig:${meson_pkg_config_path}"
# Configure PATH for libraries required by the gobject-introspection generated binary
cpp_vcpkg_install_root="${build_root}/cpp/vcpkg_installed"
PATH="${cpp_vcpkg_install_root}/${VCPKG_TRIPLET}/debug/bin:${PATH}"
PATH="${cpp_vcpkg_install_root}/${VCPKG_TRIPLET}/bin:${PATH}"
export PATH="${vcpkg_install_root}/${VCPKG_TRIPLET}/bin:${PATH}"
PATH="${vcpkg_install_root}/${VCPKG_TRIPLET}/bin:${PATH}"
fi

if [ -n "${VCToolsInstallDir:-}" -a -n "${MSYSTEM:-}" ]; then
if [ -n "${VCToolsInstallDir:-}" ] && [ -n "${MSYSTEM:-}" ]; then
# Meson finds the gnu link.exe instead of MSVC link.exe when running in MSYS2/git bash,
# so we need to make sure the MSCV link.exe is first in $PATH
export PATH="$(cygpath --unix "${VCToolsInstallDir}")/bin/HostX64/x64:${PATH}"
PATH="$(cygpath --unix "${VCToolsInstallDir}")/bin/HostX64/x64:${PATH}"
fi

# Build with Meson
meson setup \
--backend=ninja \
--prefix=$ARROW_HOME \
--prefix="${ARROW_HOME}" \
--libdir=lib \
--pkg-config-path="${meson_pkg_config_path}" \
-Ddoc=${with_doc} \
-Dvapi=${ARROW_GLIB_VAPI} \
-Dwerror=${ARROW_GLIB_WERROR} \
${build_dir} \
${source_dir}
-Ddoc="${with_doc}" \
-Dvapi="${ARROW_GLIB_VAPI}" \
-Dwerror="${ARROW_GLIB_WERROR}" \
"${build_dir}" \
"${source_dir}"

pushd ${build_dir}
pushd "${build_dir}"
ninja
ninja install
popd

if [ "${BUILD_DOCS_C_GLIB}" == "ON" ]; then
mkdir -p ${build_root}/docs/c_glib
cp -a ${ARROW_HOME}/share/doc/*-glib/ ${build_root}/docs/c_glib/
mkdir -p "${build_root}/docs/c_glib"
cp -a "${ARROW_HOME}"/share/doc/*-glib/ "${build_root}/docs/c_glib/"
fi
6 changes: 3 additions & 3 deletions ci/scripts/c_glib_test.sh
Original file line number Diff line number Diff line change
Expand Up @@ -22,7 +22,7 @@ set -ex
source_dir=${1}/c_glib
build_dir=${2}/c_glib

: ${ARROW_GLIB_VAPI:=true}
: "${ARROW_GLIB_VAPI:=true}"

export DYLD_LIBRARY_PATH=${ARROW_HOME}/lib:${DYLD_LIBRARY_PATH}
export LD_LIBRARY_PATH=${ARROW_HOME}/lib:${LD_LIBRARY_PATH}
Expand All @@ -34,7 +34,7 @@ if [ -z "${ARROW_DEBUG_MEMORY_POOL}" ]; then
export ARROW_DEBUG_MEMORY_POOL=trap
fi

pushd ${source_dir}
pushd "${source_dir}"

ruby test/run-test.rb

Expand All @@ -51,7 +51,7 @@ fi

popd

pushd ${build_dir}
pushd "${build_dir}"
example/build
example/extension-type
if [ "${ARROW_GLIB_VAPI}" = "true" ]; then
Expand Down
Loading