Closed
Description
The idea of this ticket is make my plans for DataFusion visible, largely for my own personal organizational needs, but also to:
- Try some different ways to communicate / coordinate int he community
- Help provide an interesting summary of what is happening in DataFusion this week
I request anyone else who has plans like this for DataFusion could try something like this (maybe tickets are the best place 🤔 )
Plans for this week personally:
- Epic: Statistics improvements #8227 (specifically, Consolidate statistics aggregation #8229 as a step towards Introduce a way to represent constrained statistics / bounds on values in Statistics #8078)
- Edits to paper (improve language and get it under the page limit) Write DataFusion paper for (SIGMOD / VLDB / ICDE) #6782 with @ozankabak @Dandandan @yjshen and @JayjeetAtGithub @sunchao and @viirya
- sqlparser release: Release version
0.40.0
datafusion-sqlparser-rs#1030
My plans for next week
- FIFO
JoinHashMap
forHashJoin
#8130, in support of memory explosion in HashJoin outputs:HashJoinStream
memory tracking insufficient #7848 with @korowa
Major projects I intend to help review and coordinate
- Make function packages more modular (make all BuiltInScalarFunctions ScalarUDFs): [EPIC] Unify Function Interface (remove
BuiltInScalarFunction
) #8045 with @2010YOUY01 (help Refactor: UnifyExpr::ScalarFunction
andExpr::ScalarUDF
, introduce unresolved functions by name #8258) -
ARRAY
function hardening: [Epic] General ticket for the concept of the practical implementation ofARRAY
#6980 / https://github.com/apache/arrow-datafusion/issues/7988with @jayzhan211 @Veeupup @edmondop and @xudong963 - Review any new ticket / bug reports
Metadata
Metadata
Assignees
Labels
No labels