-
Notifications
You must be signed in to change notification settings - Fork 921
Issues: rapidsai/cudf
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Add new nvtext::normalize_characters API
2 - In Progress
Currently a work in progress
improvement
Improvement / enhancement to an existing function
libcudf
Affects libcudf (C++/CUDA) code.
non-breaking
Non-breaking change
strings
strings issues (C++ and Python)
#17818
opened Jan 24, 2025 by
davidwendt
•
Draft
3 tasks done
New nvtext::wordpiece_tokenizer APIs
2 - In Progress
Currently a work in progress
CMake
CMake build issue
feature request
New feature or request
libcudf
Affects libcudf (C++/CUDA) code.
non-breaking
Non-breaking change
pylibcudf
Issues specific to the pylibcudf package
Python
Affects Python cuDF API.
strings
strings issues (C++ and Python)
#17600
opened Dec 16, 2024 by
davidwendt
•
Draft
3 tasks done
[FEA] Make line terminator sequence handling in regular expression engine a configurable option
feature request
New feature or request
strings
strings issues (C++ and Python)
wontfix
This will not be worked on
#15746
opened May 14, 2024 by
NVnavkumar
[FEA] Improve performance of strings matching in libcudf
0 - Backlog
In queue waiting for assignment
feature request
New feature or request
libcudf
Affects libcudf (C++/CUDA) code.
strings
strings issues (C++ and Python)
#15611
opened Apr 29, 2024 by
GregoryKimball
[FEA] Optionally support titlecase for capitalize
0 - Backlog
In queue waiting for assignment
feature request
New feature or request
libcudf
Affects libcudf (C++/CUDA) code.
Spark
Functionality that helps Spark RAPIDS
strings
strings issues (C++ and Python)
#14144
opened Sep 20, 2023 by
revans2
[FEA] Better scaling for simple regular expressions on long strings
0 - Backlog
In queue waiting for assignment
feature request
New feature or request
Performance
Performance related issue
Spark
Functionality that helps Spark RAPIDS
strings
strings issues (C++ and Python)
#14087
opened Sep 12, 2023 by
revans2
[FEA] Benchmark Jaccard with alternative distributions
0 - Backlog
In queue waiting for assignment
feature request
New feature or request
libcudf
Affects libcudf (C++/CUDA) code.
Performance
Performance related issue
strings
strings issues (C++ and Python)
#13726
opened Jul 19, 2023 by
vyasr
[FEA] Story - Improve performance with long strings
2 - In Progress
Currently a work in progress
libcudf
Affects libcudf (C++/CUDA) code.
Performance
Performance related issue
strings
strings issues (C++ and Python)
[FEA] Series.str.contains support for case
0 - Backlog
In queue waiting for assignment
feature request
New feature or request
Python
Affects Python cuDF API.
strings
strings issues (C++ and Python)
[BUG] Overflow potentially corrupting hashes in hash_vocab implementation
0 - Backlog
In queue waiting for assignment
bug
Something isn't working
Python
Affects Python cuDF API.
strings
strings issues (C++ and Python)
tests
Unit testing for project
[BUG] OOM when invoking normalize_characters on a relatively small dataframe
0 - Backlog
In queue waiting for assignment
bug
Something isn't working
libcudf
Affects libcudf (C++/CUDA) code.
Python
Affects Python cuDF API.
strings
strings issues (C++ and Python)
[FEA] Ability to control the amount of temporary memory used for regex expressions
feature request
New feature or request
libcudf
Affects libcudf (C++/CUDA) code.
strings
strings issues (C++ and Python)
#10852
opened May 13, 2022 by
jlowe
[FEA] Add version of extract_re that takes an index
feature request
New feature or request
libcudf
Affects libcudf (C++/CUDA) code.
strings
strings issues (C++ and Python)
#9855
opened Dec 7, 2021 by
andygrove
[FEA] Byte Pair Encoding Tokenizer
feature request
New feature or request
libcudf
Affects libcudf (C++/CUDA) code.
strings
strings issues (C++ and Python)
[FEA] Initial support for string UDFs via Numba
feature request
New feature or request
libcudf
Affects libcudf (C++/CUDA) code.
numba
Numba issue
Python
Affects Python cuDF API.
strings
strings issues (C++ and Python)
[FEA] Support an “extractall” method
feature request
New feature or request
Python
Affects Python cuDF API.
strings
strings issues (C++ and Python)
#7908
opened Apr 8, 2021 by
Nicholas-7
[FEA] Support string concatenation of a Series and something array-like into a Series
feature request
New feature or request
Python
Affects Python cuDF API.
strings
strings issues (C++ and Python)
[FEA] Jaro-Winkler algorithm for cudf.core.column.string.StringMethods.edit_distance
feature request
New feature or request
libcudf
Affects libcudf (C++/CUDA) code.
Python
Affects Python cuDF API.
strings
strings issues (C++ and Python)
#6503
opened Oct 12, 2020 by
paulhendricks
[BUG] Incorrect precision of floating values are being written to csv
bug
Something isn't working
cuIO
cuIO issue
libcudf
Affects libcudf (C++/CUDA) code.
strings
strings issues (C++ and Python)
[FEA] Support packing to a max input sequence length with cudf-subword tokenizer
feature request
New feature or request
libcudf
Affects libcudf (C++/CUDA) code.
Python
Affects Python cuDF API.
strings
strings issues (C++ and Python)
#6089
opened Aug 25, 2020 by
VibhuJawa
[FEA] Support additional pandas features in strings wrap implementation
feature request
New feature or request
strings
strings issues (C++ and Python)
#4348
opened Mar 6, 2020 by
galipremsagar
[FEA] String timestamp parsing to match Spark casting string to timestamp
feature request
New feature or request
libcudf
Affects libcudf (C++/CUDA) code.
Spark
Functionality that helps Spark RAPIDS
strings
strings issues (C++ and Python)
#3320
opened Nov 7, 2019 by
rwlee
ProTip!
Follow long discussions with comments:>50.