Skip to content

Issues: rapidsai/cudf

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Add new nvtext::normalize_characters API 2 - In Progress Currently a work in progress improvement Improvement / enhancement to an existing function libcudf Affects libcudf (C++/CUDA) code. non-breaking Non-breaking change strings strings issues (C++ and Python)
#17818 opened Jan 24, 2025 by davidwendt Draft
3 tasks done
New nvtext::wordpiece_tokenizer APIs 2 - In Progress Currently a work in progress CMake CMake build issue feature request New feature or request libcudf Affects libcudf (C++/CUDA) code. non-breaking Non-breaking change pylibcudf Issues specific to the pylibcudf package Python Affects Python cuDF API. strings strings issues (C++ and Python)
#17600 opened Dec 16, 2024 by davidwendt Draft
3 tasks done
[FEA] Make line terminator sequence handling in regular expression engine a configurable option feature request New feature or request strings strings issues (C++ and Python) wontfix This will not be worked on
#15746 opened May 14, 2024 by NVnavkumar
[FEA] Improve performance of strings matching in libcudf 0 - Backlog In queue waiting for assignment feature request New feature or request libcudf Affects libcudf (C++/CUDA) code. strings strings issues (C++ and Python)
#15611 opened Apr 29, 2024 by GregoryKimball
[FEA] Optionally support titlecase for capitalize 0 - Backlog In queue waiting for assignment feature request New feature or request libcudf Affects libcudf (C++/CUDA) code. Spark Functionality that helps Spark RAPIDS strings strings issues (C++ and Python)
#14144 opened Sep 20, 2023 by revans2
[FEA] Better scaling for simple regular expressions on long strings 0 - Backlog In queue waiting for assignment feature request New feature or request Performance Performance related issue Spark Functionality that helps Spark RAPIDS strings strings issues (C++ and Python)
#14087 opened Sep 12, 2023 by revans2
[FEA] Benchmark Jaccard with alternative distributions 0 - Backlog In queue waiting for assignment feature request New feature or request libcudf Affects libcudf (C++/CUDA) code. Performance Performance related issue strings strings issues (C++ and Python)
#13726 opened Jul 19, 2023 by vyasr
[FEA] Story - Improve performance with long strings 2 - In Progress Currently a work in progress libcudf Affects libcudf (C++/CUDA) code. Performance Performance related issue strings strings issues (C++ and Python)
#13048 opened Apr 3, 2023 by GregoryKimball Language model acceleration
[FEA] Series.str.contains support for case 0 - Backlog In queue waiting for assignment feature request New feature or request Python Affects Python cuDF API. strings strings issues (C++ and Python)
#12515 opened Jan 10, 2023 by mattf Pandas API Alignment and Coverage
[BUG] Overflow potentially corrupting hashes in hash_vocab implementation 0 - Backlog In queue waiting for assignment bug Something isn't working Python Affects Python cuDF API. strings strings issues (C++ and Python) tests Unit testing for project
#12403 opened Dec 16, 2022 by vyasr Language model acceleration
[BUG] OOM when invoking normalize_characters on a relatively small dataframe 0 - Backlog In queue waiting for assignment bug Something isn't working libcudf Affects libcudf (C++/CUDA) code. Python Affects Python cuDF API. strings strings issues (C++ and Python)
[FEA] Ability to control the amount of temporary memory used for regex expressions feature request New feature or request libcudf Affects libcudf (C++/CUDA) code. strings strings issues (C++ and Python)
#10852 opened May 13, 2022 by jlowe
[FEA] Add version of extract_re that takes an index feature request New feature or request libcudf Affects libcudf (C++/CUDA) code. strings strings issues (C++ and Python)
#9855 opened Dec 7, 2021 by andygrove
[FEA] Byte Pair Encoding Tokenizer feature request New feature or request libcudf Affects libcudf (C++/CUDA) code. strings strings issues (C++ and Python)
#9657 opened Nov 11, 2021 by VibhuJawa Language model acceleration
[FEA] Initial support for string UDFs via Numba feature request New feature or request libcudf Affects libcudf (C++/CUDA) code. numba Numba issue Python Affects Python cuDF API. strings strings issues (C++ and Python)
#9639 opened Nov 9, 2021 by brandon-b-miller UDF Enhancements
[FEA] Support an “extractall” method feature request New feature or request Python Affects Python cuDF API. strings strings issues (C++ and Python)
#7908 opened Apr 8, 2021 by Nicholas-7
[FEA] Jaro-Winkler algorithm for cudf.core.column.string.StringMethods.edit_distance feature request New feature or request libcudf Affects libcudf (C++/CUDA) code. Python Affects Python cuDF API. strings strings issues (C++ and Python)
#6503 opened Oct 12, 2020 by paulhendricks
[BUG] Incorrect precision of floating values are being written to csv bug Something isn't working cuIO cuIO issue libcudf Affects libcudf (C++/CUDA) code. strings strings issues (C++ and Python)
#6418 opened Oct 4, 2020 by galipremsagar CSV continuous improvement
[FEA] Support packing to a max input sequence length with cudf-subword tokenizer feature request New feature or request libcudf Affects libcudf (C++/CUDA) code. Python Affects Python cuDF API. strings strings issues (C++ and Python)
#6089 opened Aug 25, 2020 by VibhuJawa
[FEA] Support additional pandas features in strings wrap implementation feature request New feature or request strings strings issues (C++ and Python)
#4348 opened Mar 6, 2020 by galipremsagar
[FEA] String timestamp parsing to match Spark casting string to timestamp feature request New feature or request libcudf Affects libcudf (C++/CUDA) code. Spark Functionality that helps Spark RAPIDS strings strings issues (C++ and Python)
#3320 opened Nov 7, 2019 by rwlee
ProTip! Follow long discussions with comments:>50.