Skip to content

v2.0.0

Compare
Choose a tag to compare
@release-please release-please released this 17 Apr 19:46
881e4f0

2.0.0 (2025-04-17)

⚠ BREAKING CHANGES

  • make dataset and name params mandatory in udf (#1619)
  • Locational endpoints support is not available in BigFrames 2.0.
  • change default LLM model to gemini-2.0-flash-001, drop PaLM2TextGenerator and PaLM2TextEmbeddingGenerator (#1558)
  • change default ingress setting for remote_function to internal-only (#1544)
  • make remote_function params keyword only (#1537)
  • make remote_function default service account explicit (#1537)
  • set allow_large_results=False by default (#1541)

Features

  • Add on parameter in dataframe.rolling() and dataframe.groupby.rolling() (#1556) (45c9d9f)
  • Add component to manage temporary tables (#1559) (0a4e245)
  • Add Series.to_pandas_batches() method (#1592) (09ce979)
  • Add support for creating a Matrix Factorization model (#1330) (b5297f9)
  • Allow input_types, output_type, and dataset to be used positionally in remote_function (#1560) (bcac8c6)
  • Allow pandas.cut 'labels' parameter to accept a list of string (#1549) (af842b1)
  • Change default ingress setting for remote_function to internal-only (#1544) (c848a80)
  • Detect duplicate column/index names in read_gbq before send query. (#1615) (40d6960)
  • Drop support for locational endpoints (#1542) (4bf2e43)
  • Enable time range rolling for DataFrame, DataFrameGroupBy and SeriesGroupBy (#1605) (b4b7073)
  • Improve local data validation (#1598) (815e471)
  • Make remote_function default service account explicit (#1537) (9eb9089)
  • Set allow_large_results=False by default (#1541) (e9fb712)
  • Support bigquery connection in managed function (#1554) (f6f697a)
  • Support bq connection path format (#1550) (e7eb918)
  • Support gemini-2.0-X models (#1558) (3104fab)
  • Support inlining small list, struct, json data (#1589) (2ce891f)
  • Support time range rolling on Series. (#1590) (6e98a2c)
  • Use session temp tables for all ephemeral storage (#1569) (9711b83)
  • Use validated local storage for data uploads (#1612) (aee4159)
  • Warn the deprecated max_download_size, random_state and sampling_method parameters in (DataFrame|Series).to_pandas() (#1573) (b9623da)

Bug Fixes

  • to_pandas_batches() respects page_size and max_results again (#1572) (27c5905)
  • Ensure page_size works correctly in to_pandas_batches when max_results is not set (#1588) (570cff3)
  • Include role and service account in IAM exception (#1564) (8c50755)
  • Make dataset and name params mandatory in udf (#1619) (637e860)
  • Pandas.cut returns labels index for numeric breaks when labels=False (#1548) (b2375de)
  • Prevent KeyError in bpd.concat with empty DF and struct/array types DF (#1568) (b4da1cf)
  • Read_csv supports for tilde local paths and includes index for bigquery_stream write engine (#1580) (352e8e4)
  • Use dictionaries to avoid problematic google.iam namespace (#1611) (b03e44f)

Performance Improvements

  • Directly read gbq table for simple plans (#1607) (6ad38e8)

Dependencies

Documentation

  • Add details for bigquery_connection in [@bpd](https://github.com/bpd).udf docstring (#1609) (ef63772)
  • Add explain forecast snippet to multiple time series tutorial (#1586) (40c55a0)
  • Add message to remove default model for version 3.0 (#1563) (910be2b)
  • Add samples for ArimaPlus time_series_id_col feature (#1577) (1e4cd9c)
  • Add warning for bigframes 2.0 (#1557) (3f0eaa1)
  • Deprecate default model in TextEmbedddingGenerator, GeminiTextGenerator, and other bigframes.ml.llm classes (#1570) (89ab33e)
  • Include all licenses for vendored packages in the root LICENSE file (#1626) (8116ed0)
  • Remove gemini-1.5 deprecation warning for GeminiTextGenerator (#1562) (0cc6784)
  • Use restructured text to allow publishing to PyPI (#1565) (d1e9ec2)

Miscellaneous Chores

  • Make remote_function params keyword only (#1537) (9eb9089)