Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Added ADO.NET importing/exporting functionality to DataFrame #5975

Merged

Conversation

andrei-faber
Copy link
Contributor

See #5972
This universal integration with ADO.NET allows importing and exporting data to a large number of SQL-compatible databases.
In addition to this, this PR adds methods to load data from any IEnumerable collections and to export data to System.Data.DataTable

@dnfadmin
Copy link

dnfadmin commented Oct 14, 2021

CLA assistant check
All CLA requirements met.

@andrei-faber
Copy link
Contributor Author

It seems that build fails because I added a reference to the System.Data.SQLite NuGet package (it's only used for integration tests), and the build doesn't use public NuGet repos; I wonder if it's possible to add System.Data.SQLite to the private repo?

@michaelgsharp
Copy link
Member

@andrei-faber I believe that should be just fine. Let me talk to some people on my end and I'll get back to you.

@michaelgsharp
Copy link
Member

/azp run

@azure-pipelines
Copy link

Azure Pipelines successfully started running 2 pipeline(s).

@michaelgsharp
Copy link
Member

@andrei-faber The NuGet package has been added and I have requeued the pipelines.

@@ -1,6 +1,7 @@
<Project Sdk="Microsoft.NET.Sdk">
<PropertyGroup>
<NoWarn>$(NoWarn);MSML_ParameterLocalVarName;MSML_PrivateFieldName;MSML_ExtendBaseTestClass;MSML_GeneralName</NoWarn>
<TargetFramework>netcoreapp3.1</TargetFramework>
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We don't need to specifically set the target framework here. It happens based on command line parameters and lets us set multiple different ones.

At some point I am going to change this to be multi-targeting, but at least for now we can just remove this.

@@ -34,4 +35,8 @@
<EmbeddedResource Include="../../src/Microsoft.Data.Analysis/Strings.resx" LogicalName="Microsoft.Data.Analysis.Strings.resources" />
</ItemGroup>

<ItemGroup>
<PackageReference Include="System.Data.SQLite" Version="1.0.115" />
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks like the version that I was able to get added is 1.0.113.7. Do you need 1.0.115 specifically? Or can we use 1.0.113.7?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@michaelgsharp it should work as well, but it seems that 1.0.113.7 is no longer available on nuget.org so I can't test it on my side.

@andrei-faber
Copy link
Contributor Author

andrei-faber commented Oct 21, 2021

@michaelgsharp some strange version conflict in there. Maybe it's better to use libraries with the same version number for System.Data.SQLite.Core in Microsoft.ML.Tests.csproj and System.Data.SQLite in Microsoft.Data.Analysis.Tests.csproj
For instance use System.Data.SQLite 1.0.112.2 (if it's possible to get it)

@michaelgsharp
Copy link
Member

@andrei-faber So part of the issue is that we are still building for .net core 2, which the sql package doesn't support. I am in the process of removing that, should be done by next week (have a couple of things do to first), that should resolve this issue. So we should be able to get this in next week.

I did get the version stuff figured out, I didn't get the package moved correctly.

@andrei-faber
Copy link
Contributor Author

@michaelgsharp great, thanks. Is there anything I can do to help?

@codecov
Copy link

codecov bot commented May 6, 2023

Codecov Report

Merging #5975 (cf05e79) into main (ff3b1b9) will increase coverage by 0.01%.
The diff coverage is 86.36%.

Additional details and impacted files
@@            Coverage Diff             @@
##             main    #5975      +/-   ##
==========================================
+ Coverage   68.65%   68.67%   +0.01%     
==========================================
  Files        1202     1203       +1     
  Lines      250769   250974     +205     
  Branches    26190    26209      +19     
==========================================
+ Hits       172166   172344     +178     
- Misses      71785    71805      +20     
- Partials     6818     6825       +7     
Flag Coverage Δ
Debug 68.67% <86.36%> (+0.01%) ⬆️
production 63.16% <79.31%> (+0.01%) ⬆️
test 88.87% <100.00%> (+0.01%) ⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files Coverage Δ
src/Microsoft.Data.Analysis/Extensions.cs 73.33% <73.33%> (ø)
src/Microsoft.Data.Analysis/DataFrame.IO.cs 80.76% <80.00%> (+0.88%) ⬆️
...Microsoft.Data.Analysis.Tests/DataFrame.IOTests.cs 98.97% <100.00%> (+0.07%) ⬆️

... and 5 files with indirect coverage changes

@michaelgsharp michaelgsharp merged commit 3d705bf into dotnet:main May 9, 2023
@ghost ghost locked as resolved and limited conversation to collaborators Jun 8, 2023
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants