Releases
v2.0.0
2.0.0 (2025-04-17)
⚠ BREAKING CHANGES
make dataset
and name
params mandatory in udf
(#1619 )
Locational endpoints support is not available in BigFrames 2.0.
change default LLM model to gemini-2.0-flash-001, drop PaLM2TextGenerator and PaLM2TextEmbeddingGenerator (#1558 )
change default ingress setting for remote_function
to internal-only (#1544 )
make remote_function
params keyword only (#1537 )
make remote_function
default service account explicit (#1537 )
set allow_large_results=False
by default (#1541 )
Features
Add on
parameter in dataframe.rolling()
and dataframe.groupby.rolling()
(#1556 ) (45c9d9f )
Add component to manage temporary tables (#1559 ) (0a4e245 )
Add Series.to_pandas_batches() method (#1592 ) (09ce979 )
Add support for creating a Matrix Factorization model (#1330 ) (b5297f9 )
Allow input_types
, output_type
, and dataset
to be used positionally in remote_function
(#1560 ) (bcac8c6 )
Allow pandas.cut 'labels' parameter to accept a list of string (#1549 ) (af842b1 )
Change default ingress setting for remote_function
to internal-only (#1544 ) (c848a80 )
Detect duplicate column/index names in read_gbq before send query. (#1615 ) (40d6960 )
Drop support for locational endpoints (#1542 ) (4bf2e43 )
Enable time range rolling for DataFrame, DataFrameGroupBy and SeriesGroupBy (#1605 ) (b4b7073 )
Improve local data validation (#1598 ) (815e471 )
Make remote_function
default service account explicit (#1537 ) (9eb9089 )
Set allow_large_results=False
by default (#1541 ) (e9fb712 )
Support bigquery connection in managed function (#1554 ) (f6f697a )
Support bq connection path format (#1550 ) (e7eb918 )
Support gemini-2.0-X models (#1558 ) (3104fab )
Support inlining small list, struct, json data (#1589 ) (2ce891f )
Support time range rolling on Series. (#1590 ) (6e98a2c )
Use session temp tables for all ephemeral storage (#1569 ) (9711b83 )
Use validated local storage for data uploads (#1612 ) (aee4159 )
Warn the deprecated max_download_size
, random_state
and sampling_method
parameters in (DataFrame|Series).to_pandas()
(#1573 ) (b9623da )
Bug Fixes
to_pandas_batches()
respects page_size
and max_results
again (#1572 ) (27c5905 )
Ensure page_size
works correctly in to_pandas_batches
when max_results
is not set (#1588 ) (570cff3 )
Include role and service account in IAM exception (#1564 ) (8c50755 )
Make dataset
and name
params mandatory in udf
(#1619 ) (637e860 )
Pandas.cut returns labels index for numeric breaks when labels=False (#1548 ) (b2375de )
Prevent KeyError
in bpd.concat
with empty DF and struct/array types DF (#1568 ) (b4da1cf )
Read_csv supports for tilde local paths and includes index for bigquery_stream write engine (#1580 ) (352e8e4 )
Use dictionaries to avoid problematic google.iam namespace (#1611 ) (b03e44f )
Performance Improvements
Dependencies
Documentation
Add details for bigquery_connection
in [@bpd](https://github.com/bpd).udf
docstring (#1609 ) (ef63772 )
Add explain forecast snippet to multiple time series tutorial (#1586 ) (40c55a0 )
Add message to remove default model for version 3.0 (#1563 ) (910be2b )
Add samples for ArimaPlus time_series_id_col
feature (#1577 ) (1e4cd9c )
Add warning for bigframes 2.0 (#1557 ) (3f0eaa1 )
Deprecate default model in TextEmbedddingGenerator
, GeminiTextGenerator
, and other bigframes.ml.llm
classes (#1570 ) (89ab33e )
Include all licenses for vendored packages in the root LICENSE file (#1626 ) (8116ed0 )
Remove gemini-1.5 deprecation warning for GeminiTextGenerator
(#1562 ) (0cc6784 )
Use restructured text to allow publishing to PyPI (#1565 ) (d1e9ec2 )
Miscellaneous Chores
You can’t perform that action at this time.