-
Notifications
You must be signed in to change notification settings - Fork 148
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
resource.infer(stats=True)
changes value of dialect.delimiter
depending on resource data
#1671
Comments
* Adiciona chave primária em coluna única chave_* para cada resource de acordo com schemas. * Adiciona campos de chaves aos schemas das resources * Correções para validação do datapackage.json * refactor: remove dead code * fix: build_package() should not be called in scripts/build.py * feat: move chave_* columns to first position * hardcode delimiter to prevent bug in resource.infer see frictionlessdata/frictionless-py#1671 * reset confict files --------- Co-authored-by: Francisco Júnior <fjunior.alves.oliveira@gmail.com>
Hi, thanks for the report and reproducible example. In your example, I get indeed I am not very familiar with the frictionless codebase, so sorry if I'm off base, I am guessing a bit. My understanding is that when it is not explicitly set, frictionless makes a guess about the delimiter being So do I understand correctly that the issue is not that it changes any behaviour, but that it explicitely sets the delimiter in the resource? If yes, why is it an issue? |
I did not notice that In general1 I would expect
For my particular use case the inference behaviour is problematic because I have several descriptors in which I do not explicitly set Footnotes
|
Ok thanks, I understand what bothers you, and I agree that using I also understand why you specifically asked about In any case I'll try to find the time to understand the exact behaviour of the delimiter detection. |
Knowing that this is also the behaviour in Maybe in the future making the delimiter detection more robust (which I'm not sure is high priority TBH). |
I did not find this information in the documentation, but by extrapolating this behavior:
I assumed that
resource.infer(stats=True)
would not changedialect.delimiter
(even if it was not explicitly set).However, this is not the case, and data like:
is causing
resource.infer(stats=True)
to setdialect.delimiter
to|
.There is a reproducible example here.
The text was updated successfully, but these errors were encountered: