Skip to content

Blob search using the glob(Unix style pathname pattern) syntax #40269

Open
@martinResearch

Description

@martinResearch

I would like to be able to efficiently list all the blobs in a container that match a Unix style pathname pattern.
as implemented in the python glob module

Describe the solution you'd like
I would like to be able to list blobs using for example

client.glob("dataset/sequence_*/image*_color.png")]

or

client.glob("dataset/**/image*_color.png", recursive=True)] 

We could list all the blobs in dataset and do the filtering locally, but that is very slow when the number of blobs is large and we are interested in selecting only a small subset. Ideally this feature would be implemented on the server side to avoid having to retrieve large lists of files locally, but I guess this would be out of scope for the python SDK.
I implemented my own solution based on the glob python package but would like this feature to be part of the SDK instead.

Metadata

Metadata

Assignees

No one assigned

    Labels

    ClientThis issue points to a problem in the data-plane of the library.Service AttentionWorkflow: This issue is responsible by Azure service team.StorageStorage Service (Queues, Blobs, Files)customer-reportedIssues that are reported by GitHub users external to the Azure organization.feature-requestThis issue requires a new behavior in the product in order be resolved.needs-team-attentionWorkflow: This issue needs attention from Azure service team or SDK team

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions