-
Notifications
You must be signed in to change notification settings - Fork 1.9k
Open
Labels
EPICA larger project, actively underway, with sub tasksA larger project, actively underway, with sub tasksenhancementNew feature or requestNew feature or request
Description
This is a follow on to #3058 as we have made significant progress since @kmitchener originally posted that
“Write cool software and tell people about it” – Paul Dix @pauldix (Founder and CTO of InfluxData)
Call to action:
The DataFusion community has invested a lot in the cool software; This ticket tracks tasks to improve the "tell people about it"
User Guide
- Getting started guide for new users (who want to use DataFusion in their project) #7014
- Automate production of SQL and DataFrame references for SQL functions / Expressions #3092
- Write a case study about using DataFusion #7015
- Document SQL operators (like
@>and~) #7153 - Document streaming usecase (like
UNBOUNDEDtables) #9016 - Make it easier to install / get started with DataFusion #7297
- Document DML statements #7319
- Hide boilerplate in documentation examples #14557
- Add a "Gentle Introduction to Arrow / Record Batches" #11336
- Improvements to user guide for DataFrames #11388
- Run / Test all examples in Documentation #14435
Rust Docs (docs.rs)
- Update SQL functions rustdocs with example usage
- Automate production of SQL and DataFrame references for SQL functions / Expressions #3092
- Include the latest rustdocs (e.g.
cargo docoutput on https://arrow.apache.org/datafusion/) #5981
Examples
- Review current examples and see how they can be improved
- Include examples reading from object stores (S3, Azure, GCS)
- Async User Defined Functions (UDF) #6518
- Add example for writing a
FileFormat#11079 - Add a section to the documentation explaining that PGO can help up substantially (25%) and maybe offer some tips for users to use it? #9561
- Add example for writing an
AnalyzerRule#10855 - Add example for writing an SQL analysis pass #10871
- [EPIC] Improve examples to make them easier to navigate #11172
- Provide an example of using a remote catalog #13714
Conference Papers
Blog posts
- Write a blog post fast Vectorized grouping for high cardinality #6988
- Blog post with DataFusion Jun - Sep 2023 #6780
- Blog post about user defined window functions #6781
Other
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
EPICA larger project, actively underway, with sub tasksA larger project, actively underway, with sub tasksenhancementNew feature or requestNew feature or request