Open
Description
Summary
This issue tracks priorities and discussions around DataFrame improvements based on issues and feedback.
Microsoft.Data.Analysis Open Issue Query: https://github.com/dotnet/machinelearning/issues?q=is%3Aopen+is%3Aissue+label%3AMicrosoft.Data.Analysis
Work Items
- DataFrame - Add KeyType Column Support #6499
- DataFrame - Support loading / saving to SQL #6498
- DataFrame - Drop multiple columns #6497
- DataFrame - Rename columns #6496
- Fix reported Merge operation bugs #6493
- Fix reported Join operation bugs #6494
- Fix reported Group operation bugs #6495
Related Issues
Create DataFrames
Data Formatting
Data Sources
- DataFrame LoadSql Method #5662
- Add parquet support for importing and exporting data to/from DataFrame. #5972
- DataFrame enhancements #6088
- DataFrame.LoadCsv should support URLs #5905
- Dataframe load data from excel and query using sql #5646
Other
Reshape DataFrames
- Pivot / Transpose
- Create columns
- Reset indices
- Rename columns
Filter / Sort DataFrames
Combine DataFrames
- Merge
- Join
- Concat
Group DataFrames
- Grouping
- Aggregate Functions
Summarize DataFrames
- Descriptive statistics
- PrimitiveDataFrameColumnComputations produce wrong result for Min/Max functions #5759
Handle Missing Data
- Drop missing data
- Impute / Fill missing data
DataTypes
Array / Vector / VBuffer
- DataFrame - add support for vbuffer #5872
- DataFrame needs to support vector columns. #5721
- DataFrame should support Array data #5746
- Support "Sparse" and "Dense" vector typed columns in the DataFrame #5690
DateTime
- Cannot run Transform on DataFrame with DateTime type #6213
- Add missing implementation for datetime relevant arrow types #6201
Other
- Let DataFrame support UInt32 columns to be compatible with MapValueToKey #5898
- Improve error experience when LoadCsv throws #5656
Misc
- LINQ / EFCore?
- DateTime operations
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment