BTREE indexes to be case sensitive and GIN are case insensitive #335

wakibi · 2020-09-21T06:05:06Z

By their nature, BTREE indexes are case sensitive and that should be included in our documentation. Removing the case sensitivity on BTREEs removes the performance gains they were used in the first place. Users need made be aware of this

stale · 2021-02-17T17:00:30Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

stale · 2021-04-26T04:51:04Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

k8hughes · 2021-04-26T08:34:51Z

I think we still need this.

k8hughes · 2021-04-28T08:53:43Z

To be able to work on this we'd need to know the collumns that are most searched by for it to be efficent.

stale · 2021-06-27T12:08:50Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

wakibi · 2021-07-09T09:34:59Z

There is a blocker here, some of the fields to be indexed are rather too long and the DB can not index them. Suggestions on how to handle this are much welcome @akmiller01 @dean-breed @bill-anderson

bill-anderson · 2021-07-09T09:44:31Z

You don't want BTREE indexes on titles and descriptions, do you? In my old money (of my particular old currency) we had cross-reference indexes for free text fields. Is this GiST in new money?

wakibi · 2021-07-09T10:09:42Z

That's right @bill-anderson. GiST and GIN are to be used for those extremely long text columns, but they too have a max string length (which is about 8000 characters) they can be applied to. Unfortunately, some of our data exceeds that length!

bill-anderson · 2021-07-09T11:01:54Z

So what I think we should do is move this into an ETL discussion to limit all free-text fields to, say, 5000 characters. I don't think that will affect the integrity of the data. But we should do a bit of research on the long culprits.

dean-breed · 2022-04-11T09:27:55Z

@wakibi Would it be possible to get a list of reporting organisations and columns which would be truncated?

dean-breed · 2022-04-21T10:30:53Z

@wakibi Can I get this list?

edwinmp · 2022-04-21T15:37:28Z

@dean-breed sorry Chris is on leave ... let me stick this on Slack where he can't miss it.

wakibi · 2022-04-25T11:12:19Z

@dean-breed Find attached. I have added all those with more than 7500 (my reasoning is those approaching that figure could easily soon hit the 8000 ceiling as well).
long_columns.csv

dean-breed · 2022-04-25T11:28:04Z

@wakibi - I could do something on this myself but to avoid mistake likelihood. Could you output the reporting org reference also?

wakibi · 2022-04-25T12:06:19Z

@wakibi - I could do something on this myself but to avoid mistake likelihood. Could you output the reporting org reference also?

long_columns.csv

Attached. I had forgotten to format the first one.

dean-breed · 2022-04-26T08:14:00Z

Hi Chris @wakibi, is it possible to output this list as part of the ETL process? I could then include the information in the data quality file?

wakibi · 2022-05-03T07:13:21Z

Hi @dean-breed, do you want it sent as an email attachment?

dean-breed · 2022-05-03T07:57:40Z

That would be great if it could. Or just stored in a repo somewhere. Either works @wakibi

wakibi added the Database label Sep 21, 2020

stale bot added the wontfix This will not be worked on label Feb 17, 2021

stale bot closed this as completed Feb 24, 2021

edwinmp reopened this Feb 24, 2021

stale bot removed the wontfix This will not be worked on label Feb 24, 2021

stale bot added the wontfix This will not be worked on label Apr 26, 2021

stale bot removed the wontfix This will not be worked on label Apr 26, 2021

edwinmp added pinned and removed pinned labels Apr 26, 2021

stale bot added the wontfix This will not be worked on label Jun 27, 2021

stale bot closed this as completed Jul 4, 2021

edwinmp reopened this Jul 4, 2021

edwinmp added pinned and removed wontfix This will not be worked on labels Jul 4, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BTREE indexes to be case sensitive and GIN are case insensitive #335

BTREE indexes to be case sensitive and GIN are case insensitive #335

wakibi commented Sep 21, 2020

stale bot commented Feb 17, 2021

stale bot commented Apr 26, 2021

k8hughes commented Apr 26, 2021

k8hughes commented Apr 28, 2021

stale bot commented Jun 27, 2021

wakibi commented Jul 9, 2021

bill-anderson commented Jul 9, 2021

wakibi commented Jul 9, 2021 •

edited

Loading

bill-anderson commented Jul 9, 2021

dean-breed commented Apr 11, 2022

dean-breed commented Apr 21, 2022

edwinmp commented Apr 21, 2022

wakibi commented Apr 25, 2022 •

edited

Loading

dean-breed commented Apr 25, 2022

wakibi commented Apr 25, 2022

dean-breed commented Apr 26, 2022

wakibi commented May 3, 2022

dean-breed commented May 3, 2022

BTREE indexes to be case sensitive and GIN are case insensitive #335

BTREE indexes to be case sensitive and GIN are case insensitive #335

Comments

wakibi commented Sep 21, 2020

stale bot commented Feb 17, 2021

stale bot commented Apr 26, 2021

k8hughes commented Apr 26, 2021

k8hughes commented Apr 28, 2021

stale bot commented Jun 27, 2021

wakibi commented Jul 9, 2021

bill-anderson commented Jul 9, 2021

wakibi commented Jul 9, 2021 • edited Loading

bill-anderson commented Jul 9, 2021

dean-breed commented Apr 11, 2022

dean-breed commented Apr 21, 2022

edwinmp commented Apr 21, 2022

wakibi commented Apr 25, 2022 • edited Loading

dean-breed commented Apr 25, 2022

wakibi commented Apr 25, 2022

dean-breed commented Apr 26, 2022

wakibi commented May 3, 2022

dean-breed commented May 3, 2022

wakibi commented Jul 9, 2021 •

edited

Loading

wakibi commented Apr 25, 2022 •

edited

Loading