Skip to content
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
88 commits
Select commit Hold shift + click to select a range
75b3166
ARROW-4523,ARROW-4524: [JS] Add row proxy generation benchmark, impro…
TheNeuralBit Feb 13, 2019
5ed6fb5
ARROW-4550: [JS] Fix AMD pattern
mbostock Feb 13, 2019
27ba26c
ARROW-4513: [Rust] Implement BitAnd/BitOr for &Bitmap
paddyhoran Feb 13, 2019
d831e2c
ARROW-47: [C++] Preliminary arrow::Scalar object model
wesm Feb 13, 2019
51b5143
ARROW-4539: [Java] Fix child vector count for lists. (#3625)
praveenbingo Feb 13, 2019
69d595a
ARROW-4558: [C++][Flight] Implement gRPC customizations without UB
wesm Feb 13, 2019
2571b03
ARROW-4340: [C++][CI] Build IWYU for LLVM 7 in iwyu docker-compose job
fsaintjacques Feb 14, 2019
49c70cc
ARROW-4204: [Gandiva] add support for decimal subtract
Feb 14, 2019
148213e
ARROW-3162: Flight Python bindings
Feb 14, 2019
b9819e8
ARROW-4563: [Python] Validate decimal128() precision input
pitrou Feb 14, 2019
47ebb1a
ARROW-1896: [C++] Do not allocate memory inside CastKernel. Clean up …
wesm Feb 15, 2019
bf138a8
ARROW-4576: [Python] Fix error during benchmarks
pitrou Feb 15, 2019
40b0c88
ARROW-3669: [Python] Raise error on Numpy byte-swapped array
pitrou Feb 15, 2019
10e8942
ARROW-4529: [C++] Add test for BitUtil::RoundDown
fsaintjacques Feb 15, 2019
09cb71c
ARROW-4474: Use signed integers in FlightInfo payload size fields
lidavidm Feb 15, 2019
bdbf630
ARROW-4582: [Python/C++] Acquire the GIL on Py_INCREF
xhochy Feb 15, 2019
341f1b2
ARROW-4585: [C++] Add protoc dependency to flight_testing
xhochy Feb 15, 2019
524a9dc
ARROW-4577: [C++] Don't set interface link libs on arrow_shared where…
xhochy Feb 15, 2019
e1bc0d4
ARROW-4586: [Rust] Remove arrow/mod.rs as it is not needed
paddyhoran Feb 16, 2019
a374c3c
ARROW-4490: [Rust] Add explicit SIMD vectorization for boolean ops in…
paddyhoran Feb 16, 2019
bbca717
ARROW-4341: [C++] Refactor Primitive builders and BooleanBuilder to u…
bkietz Feb 16, 2019
b74cc65
ARROW-4592: [GLib] Stop configure immediately when GLib isn't available
kou Feb 16, 2019
aa765aa
ARROW-4594: [Ruby] Arrow::StructArray#[] returns Arrow::Struct instea…
kou Feb 16, 2019
0c340b4
ARROW-4593: [Ruby] Arrow::Array#[out_of_range] returns nil
kou Feb 17, 2019
bb78e8b
ARROW-4584: [Python] Add built wheel to manylinux1 dockerignore
xhochy Feb 17, 2019
cfbe0ce
ARROW-4377: [Rust] Implement std::fmt::Debug for PrimitiveArrays
Feb 17, 2019
a7e93a5
ARROW-4598: [CI] Remove needless LLVM_DIR for macOS
kou Feb 17, 2019
3f3da9d
PARQUET-1532: [C++] Fix build error with MinGW
kou Feb 17, 2019
3606aa2
ARROW-4589: [Rust] Projection push down query optimizer rule
andygrove Feb 17, 2019
5e2445b
ARROW-4601: [Python] Add license header to dockerignore
xhochy Feb 17, 2019
811c7dc
ARROW-4464: [Rust] [DataFusion] Add support for LIMIT
Feb 17, 2019
240c469
ARROW-4265: [C++] Automatic conversion between Table and std::vector<…
xhochy Feb 18, 2019
2df33de
ARROW-4571: [Format] Tensor.fbs file has multiple root_type declarations
mrkn Feb 18, 2019
a65798a
ARROW-4196: [Rust] Add explicit SIMD vectorization for arithmetic ops…
paddyhoran Feb 18, 2019
c494f88
ARROW-4540: [Rust] Basic JSON reader
nevi-me Feb 18, 2019
103a231
[Docs] Add more links to README
feluelle Feb 18, 2019
0831222
ARROW-4613: [C++] Set CMAKE_INSTALL_LIBDIR in gtest thirdparty build
xhochy Feb 18, 2019
f19f031
[Python][Docker] Ignore site and manylinux1 for docker builds
xhochy Feb 18, 2019
0c6b2d2
ARROW-4612: [Python] Use cython from PyPI for windows wheels build
xhochy Feb 18, 2019
76979c9
ARROW-4420: [INTEGRATION] Make spark integration test pass and test a…
kszucs Feb 18, 2019
8cf05f9
ARROW-4597: [C++] Targets for system Google Mock shared library are m…
kou Feb 18, 2019
c8007dd
ARROW-4619: [R] Fix the autobrew script
jeroen Feb 18, 2019
9600d8b
ARROW-4543: [C#] Update Flat Buffers code to latest version
eerhardt Feb 18, 2019
d0d810b
ARROW-4546: Update LICENSE.txt with parquet-cpp licenses
fsaintjacques Feb 19, 2019
d532056
ARROW-4616: [C++] Log message in BuildUtils as STATUS
xhochy Feb 19, 2019
5680042
ARROW-4531: [C++] Support slices for SumKernel
fsaintjacques Feb 19, 2019
54fcb06
ARROW-4602: [Rust] [DataFusion] Integrate query optimizer with Execut…
andygrove Feb 19, 2019
aeb40ed
ARROW-4565: [R] Fix decimal record batches with no null values
javierluraschi Feb 19, 2019
24a98e4
ARROW-4599: [C++] Add support for system GFlags
kou Feb 19, 2019
5aaf700
ARROW-4142: [Java] JDBC Array -> Arrow ListVector
Feb 19, 2019
135d481
ARROW-4583: [Plasma] Fix some small bugs reported by code scan tool
Feb 19, 2019
707bac2
ARROW-4624: [C++] Fix building benchmarks
pitrou Feb 19, 2019
6fd5070
ARROW-4347: [CI][Python] Also run Python builds when Java affected.
wesm Feb 19, 2019
bd57703
ARROW-4623: [R] update Rcpp version
romainfrancois Feb 19, 2019
28bc3d1
ARROW-4556: [Rust] Preserve JSON field order when inferring schema
nevi-me Feb 19, 2019
2e61bcf
ARROW-4587: [C++] Fix segfaults around DoPut implementation
Feb 19, 2019
ef28f20
ARROW-4618: [Docker] Makefile to build dependent docker images
kszucs Feb 19, 2019
24f83be
ARROW-4632: [Ruby] Add BigDecimal#to_arrow
mrkn Feb 20, 2019
09cfd46
ARROW-4581: [C++] Do not require googletest_ep or gbenchmark_ep for l…
wesm Feb 20, 2019
6c4118b
ARROW-4562: [C++] Avoid copies when serializing Flight data
pitrou Feb 20, 2019
957fe15
ARROW-4629: [Python] Pandas arrow conversion slowed down by imports
fjetter Feb 20, 2019
9c19bb6
ARROW-694: [C++] Initial parser interface for reading JSON into Recor…
bkietz Feb 20, 2019
3fd18e3
ARROW-4267: [C++/Parquet] Handle duplicate and struct columns in RowG…
xhochy Feb 20, 2019
b228489
ARROW-4610: [Plasma] Avoid Crash in Plasma Java Client
Feb 20, 2019
d3c5b85
ARROW-3532: [Python] Emit warning when looking up for duplicate struc…
pitrou Feb 20, 2019
3173b0d
ARROW-4615: [C++] Add checked_pointer_cast
xhochy Feb 20, 2019
717328f
ARROW-4639: [CI] Switch off GFLAGS_SHARED for osx
Feb 21, 2019
2e4220e
ARROW-4635: [Java] allocateNew to use last capacity
Feb 21, 2019
6711404
ARROW-4559: [Python] Allow Parquet files with special characters in t…
pitrou Feb 21, 2019
a977250
ARROW-3981: [C++] Rename json.h
pitrou Feb 21, 2019
e8cc48b
ARROW-4372: [C++] Embed precompiled bitcode in the gandiva library
kszucs Feb 21, 2019
1442fb6
ARROW-4552: [JS] Add high-level Table and Column convenience methods
trxcllnt Feb 21, 2019
79f9db1
add selectAt() method to Table, Schema, and RecordBatch for selecting…
trxcllnt Feb 13, 2019
b3b4f1f
add Table and Schema assign() impls
trxcllnt Feb 13, 2019
8eabb1c
clean up/speed up: move common argument flattening methods into a uti…
trxcllnt Feb 13, 2019
a4222f8
clean up: eliminate more getters in favor of read-only properties
trxcllnt Feb 13, 2019
9c7ed3d
guard against out-of-bounds selections
trxcllnt Feb 13, 2019
69abf40
add initial RecordBatch.new and select tests
trxcllnt Feb 18, 2019
4656ea5
cleanup/rename Table + Schema + RecordBatch from -> new, cleanup argu…
trxcllnt Feb 18, 2019
854ae66
ensure Int/FloatVector.from return signatures are as specific as poss…
trxcllnt Feb 14, 2019
d561204
ensure Float16Vector.toArray() is zero-copy again, add toFloat32Array…
trxcllnt Feb 14, 2019
77fcd40
add initial BigInt64Array and BigUint64Array support
trxcllnt Feb 14, 2019
760a219
update BN to use BigIntArrays for signed/unsigned 64bit integers if p…
trxcllnt Feb 14, 2019
94d5633
support BigInt in comparitor/indexOf
trxcllnt Feb 14, 2019
6046e66
remove more getters in favor of readonly direct property accesses
trxcllnt Feb 14, 2019
7ac081a
fix lint
trxcllnt Feb 14, 2019
f44e97b
ensure truncated bitmap size isn't larger than it should be
trxcllnt Feb 18, 2019
69ee6f7
cleanup after rebase
trxcllnt Feb 21, 2019
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
4 changes: 4 additions & 0 deletions .dockerignore
Original file line number Diff line number Diff line change
Expand Up @@ -47,6 +47,7 @@ cpp/thirdparty
# python
python/build
python/dist
python/manylinux1/dist
python/*.egg-info
python/*.egg
python/*.pyc
Expand Down Expand Up @@ -90,3 +91,6 @@ js/test/__snapshots__

# Rust
rust/target

# site
site/
2 changes: 2 additions & 0 deletions .travis.yml
Original file line number Diff line number Diff line change
Expand Up @@ -225,6 +225,8 @@ matrix:
- $TRAVIS_BUILD_DIR/ci/travis_script_python.sh 3.6
- name: "[manylinux1] Python"
language: cpp
env:
- PYTHON_VERSIONS="2.7,32 3.6,16"
before_script:
- if [ $ARROW_CI_PYTHON_AFFECTED == "1" ]; then docker pull quay.io/xhochy/arrow_manylinux1_x86_64_base:llvm-7-manylinux1; fi
script:
Expand Down
112 changes: 53 additions & 59 deletions LICENSE.txt
Original file line number Diff line number Diff line change
Expand Up @@ -437,65 +437,6 @@ SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.

--------------------------------------------------------------------------------

This project includes code from the Boost project

Boost Software License - Version 1.0 - August 17th, 2003

Permission is hereby granted, free of charge, to any person or organization
obtaining a copy of the software and accompanying documentation covered by
this license (the "Software") to use, reproduce, display, distribute,
execute, and transmit the Software, and to prepare derivative works of the
Software, and to permit third-parties to whom the Software is furnished to
do so, all subject to the following:

The copyright notices in the Software and this entire statement, including
the above license grant, this restriction and the following disclaimer,
must be included in all copies of the Software, in whole or in part, and
all derivative works of the Software, unless such copies or derivative
works are solely in the form of machine-executable object code generated by
a source language processor.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
FITNESS FOR A PARTICULAR PURPOSE, TITLE AND NON-INFRINGEMENT. IN NO EVENT
SHALL THE COPYRIGHT HOLDERS OR ANYONE DISTRIBUTING THE SOFTWARE BE LIABLE
FOR ANY DAMAGES OR OTHER LIABILITY, WHETHER IN CONTRACT, TORT OR OTHERWISE,
ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER
DEALINGS IN THE SOFTWARE.

--------------------------------------------------------------------------------

This project includes code from the mapbox/variant project, BSD 3-clause
license

Copyright (c) MapBox
All rights reserved.

Redistribution and use in source and binary forms, with or without modification,
are permitted provided that the following conditions are met:

- Redistributions of source code must retain the above copyright notice, this
list of conditions and the following disclaimer.
- Redistributions in binary form must reproduce the above copyright notice, this
list of conditions and the following disclaimer in the documentation and/or
other materials provided with the distribution.
- Neither the name "MapBox" nor the names of its contributors may be
used to endorse or promote products derived from this software without
specific prior written permission.

THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS" AND
ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED
WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE
DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT HOLDER OR CONTRIBUTORS BE LIABLE FOR
ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES
(INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES;
LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON
ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT
(INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS
SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.

--------------------------------------------------------------------------------

This project includes code from the FlatBuffers project

Copyright 2014 Google Inc.
Expand Down Expand Up @@ -832,3 +773,56 @@ SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER
CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR
TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF
THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.

--------------------------------------------------------------------------------

This project includes code from Apache Kudu.

* cpp/cmake_modules/CompilerInfo.cmake is based on Kudu's cmake_modules/CompilerInfo.cmake

Copyright: 2016 The Apache Software Foundation.
Home page: https://kudu.apache.org/
License: http://www.apache.org/licenses/LICENSE-2.0

--------------------------------------------------------------------------------

This project includes code from Apache Impala (incubating), formerly
Impala. The Impala code and rights were donated to the ASF as part of the
Incubator process after the initial code imports into Apache Parquet.

Copyright: 2012 Cloudera, Inc.
Copyright: 2016 The Apache Software Foundation.
Home page: http://impala.apache.org/
License: http://www.apache.org/licenses/LICENSE-2.0

--------------------------------------------------------------------------------

This project includes code from Apache Aurora.

* dev/release/{release,changelog,release-candidate} are based on the scripts from
Apache Aurora

Copyright: 2016 The Apache Software Foundation.
Home page: https://aurora.apache.org/
License: http://www.apache.org/licenses/LICENSE-2.0

--------------------------------------------------------------------------------

This project includes code from the Google styleguide.

* cpp/build-support/cpplint.py is based on the scripts from the Google styleguide.

Copyright: 2009 Google Inc. All rights reserved.
Homepage: https://github.com/google/styleguide
License: 3-clause BSD

--------------------------------------------------------------------------------

This project includes code from Snappy.

* cpp/cmake_modules/{SnappyCMakeLists.txt,SnappyConfig.h} are based on code
from Google's Snappy project.

Copyright: 2009 Google Inc. All rights reserved.
Homepage: https://github.com/google/snappy
License: 3-clause BSD
85 changes: 85 additions & 0 deletions Makefile.docker
Original file line number Diff line number Diff line change
@@ -0,0 +1,85 @@
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing,
# software distributed under the License is distributed on an
# "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
# KIND, either express or implied. See the License for the
# specific language governing permissions and limitations
# under the License.

# build docker compose images:
# $ make -f Makefile.docker cpp
# run the built image:
# $ make -f Makefile.docker run cpp

.PHONY: clean run cpp cpp-alpine go js java rust r

DC := docker-compose

clean:
$(DC) down -v

run:
$(DC) run --rm $(filter-out $@,$(MAKECMDGOALS))

go:
$(DC) build go

js:
$(DC) build js

java:
$(DC) build java

rust:
$(DC) build rust

cpp:
$(DC) build cpp

cpp-alpine:
$(DC) build cpp-alpine

cpp-cmake32:
$(DC) build cpp-cmake32

c_glib: cpp
$(DC) build c_glib

r: cpp
$(DC) build r

python: cpp
$(DC) build python

python-alpine: cpp-alpine
$(DC) build python-alpine

lint: python
$(DC) build lint

iwyu: lint

clang-format: lint

docs: python

dask: python
$(DC) build dask

hdfs: python
$(DC) build hdfs-integration

spark: python
$(DC) build spark-integration

pandas-master: python
$(DC) build --no-cache pandas-master
34 changes: 11 additions & 23 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -17,26 +17,14 @@
under the License.
-->

## Apache Arrow

<table>
<tr>
<td>Build Status</td>
<td>
<a href="https://travis-ci.org/apache/arrow">
<img src="https://travis-ci.org/apache/arrow.svg?branch=master" alt="travis build status" />
</a>
</td>
<td>Code Coverage</td>
<td>
<a href="https://codecov.io/gh/apache/arrow">
<img src="https://codecov.io/gh/apache/arrow/branch/master/graph/badge.svg" alt="codecov.io code coverage" />
</a>
</td>
</tr>
</table>

### Powering In-Memory Analytics
# Apache Arrow

[![Build Status](https://travis-ci.org/apache/arrow.svg?branch=master)](https://travis-ci.org/apache/arrow)
[![Coverage Status](https://codecov.io/gh/apache/arrow/branch/master/graph/badge.svg)](https://codecov.io/gh/apache/arrow?branch=master)
[![License](http://img.shields.io/:license-Apache%202-blue.svg)](https://github.com/apache/arrow/blob/master/LICENSE.txt)
[![Twitter Follow](https://img.shields.io/twitter/follow/apachearrow.svg?style=social&label=Follow)](https://twitter.com/apachearrow)

## Powering In-Memory Analytics

Apache Arrow is a development platform for in-memory analytics. It contains a
set of technologies that enable big data systems to process and move data fast.
Expand All @@ -61,7 +49,7 @@ Major components of the project include:
Arrow is an [Apache Software Foundation](https://www.apache.org) project. Learn more at
[arrow.apache.org](https://arrow.apache.org).

### What's in the Arrow libraries?
## What's in the Arrow libraries?

The reference Arrow libraries contain a number of distinct software components:

Expand All @@ -79,7 +67,7 @@ The reference Arrow libraries contain a number of distinct software components:
implementations (e.g. sending data from Java to C++)
- Conversions to and from other in-memory data structures

### Getting involved
## Getting involved

Even if you do not plan to contribute to Apache Arrow itself or Arrow
integrations in other projects, we'd be happy to have you involved:
Expand All @@ -91,7 +79,7 @@ integrations in other projects, we'd be happy to have you involved:
- [Learn the format][2]
- Contribute code to one of the reference implementations

### How to Contribute
## How to Contribute

We prefer to receive contributions in the form of GitHub pull requests. Please
send pull requests against the [github.com/apache/arrow][4] repository.
Expand Down
4 changes: 1 addition & 3 deletions c_glib/arrow-glib/array-builder.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -136,9 +136,7 @@ garrow_array_builder_append_nulls(GArrowArrayBuilder *builder,

auto arrow_builder =
static_cast<BUILDER>(garrow_array_builder_get_raw(builder));
uint8_t valid_bytes[n];
memset(valid_bytes, 0, sizeof(uint8_t) * n);
auto status = arrow_builder->AppendNulls(valid_bytes, n);
auto status = arrow_builder->AppendNulls(n);
return garrow_error_check(error, status, context);
}

Expand Down
5 changes: 4 additions & 1 deletion c_glib/configure.ac
Original file line number Diff line number Diff line change
Expand Up @@ -99,7 +99,10 @@ fi
AC_SUBST(GARROW_CFLAGS)
AC_SUBST(GARROW_CXXFLAGS)

AM_PATH_GLIB_2_0([2.32.4], [], [], [gobject gio])
AM_PATH_GLIB_2_0([2.32.4],
[],
[AC_MSG_ERROR(GLib isn't available)],
[gobject gio])

GOBJECT_INTROSPECTION_REQUIRE([1.32.1])
GTK_DOC_CHECK([1.18-2])
Expand Down
1 change: 1 addition & 0 deletions ci/appveyor-cpp-build.bat
Original file line number Diff line number Diff line change
Expand Up @@ -44,6 +44,7 @@ if "%JOB%" == "Static_Crt_Build" (
cmake --build . --config Debug || exit /B
ctest --output-on-failure -j2 || exit /B
popd
rmdir /S /Q cpp\build-debug

mkdir cpp\build-release
pushd cpp\build-release
Expand Down
Loading