tap-instagram
is a Singer tap for the Instagram Graph API built
with the Meltano Tap SDK for Singer Taps.
-
Developer TODO:
Update the below as needed to correctly describe the install procedure. For instance, if you do not have a PyPi repo, or if you want users to directly install from your git repo, you can modify this step as appropriate.
pipx install tap-instagram
The Instagram tap replicates the following data:
Those sources are spread across a few additional streams since they support varying query parameters, time periods, amounts of historical data, etc. The following section outlines some important information, but defer to the API docs linked above for more detail.
- Users: IG User objects representing the Instagram Business or Creator Accounts from the
ig_user_ids
config parameter.- Replication Method: Full Table
- Media: IG Media objects representing media published by a given IG User.
- Replication Method: Full Table
- Parent Stream: Users
- Limitations: Return a max of 10k of the most recently created media objects for that user. Does not include stories, which are in the "stories" stream.
- Stories: IG Media objects representing stories published by a given IG User in the last 24 hours.
- Replication Method: Full Table
- Parent Stream: Users
- Limitations: Stories are only available for 24hrs, do not include Live Video stories or reshared stories.
- Media Children: IG Media objects corresponding to images or videos in an album.
- Replication Method: Full Table
- Parent Stream: Media
- User Insights 28 Day: User Insights stream containing impressions and reach for a 28 day period.
- Replication Method: Full Table
- Parent Stream: Users
- User Insights Audience: User Insights stream containing audience metrics audience_city, audience_country,
audience_gender_age, audience_locale for a lifetime period.
- Replication Method: Full Table
- Parent Stream: Users
- User Insights Daily: User Insights stream containing all daily metrics except follower_count, which has less
historical data - email_contacts, get_directions_clicks, impressions, phone_call_clicks, profile_views, reach,
text_message_clicks, website_clicks.
- Replication Method: Full Table
- Parent Stream: Users
- User Insights Followers: User Insights stream containing follower_count on a daily time period.
- Replication Method: Full Table
- Parent Stream: Users
- Limitations: Only returns data for the last 30 days.
- User Insights Online Followers: User Insights stream containing online_followers on a lifetime time period.
- Replication Method: Full Table
- Parent Stream: Users
- Limitations: Only returns data for the last 30 days.
- User Insights Weekly: User Insights stream containing impressions and reach for a weekly period.
- Replication Method: Full Table
- Parent Stream: Users
- Media Insights: Media Insights stream returning the supported metrics for photo, video, and album Media objects.
- Replication Method: Full Table
- Parent Stream: Media
- Limitation: Does not return insights for media published before account was changed from personal to business.
- Story Insights: Media Insights stream returning the supported metrics for story Media objects.
- Replication Method: Full Table
- Parent Stream: Stories
- Limitation: Does not return insights for stories published before account was changed from personal to business.
The tap accepts the following config options:
ig_user_ids: List[str]
(required): List of user IDs of Instagram Business Accounts or Instagram Creator Accounts. One way to find the user ID is by navigating to
Facebook Business Manager Settings > Accounts > Instagram accounts > [Your Account]
and you will see the user ID in the URL - https://business.facebook.com/instagram-account-v2s/{user_id}?business_id={business_id}
-
access_token: str
(required): A long-lived user access token, which can be obtained by following these instructions. Ensure the access token has the following permissions:instagram_basic
instagram_manage_insights
pages_show_list
pages_read_engagement
NOTE: You will need to create a Facebook App if you do not have one already to be able to generate an access token
A full list of supported settings and capabilities for this tap is available by running:
tap-instagram --about
You can easily run tap-instagram
by itself or in a pipeline using Meltano.
tap-instagram --version
tap-instagram --help
tap-instagram --config CONFIG --discover > ./catalog.json
-
Developer TODO:
As a first step, scan the entire project for the text "TODO:
" and complete any recommended steps, deleting the "TODO" references once completed.
pipx install poetry
poetry install
Create tests within the tap_instagram/tests
subfolder and
then run:
poetry run pytest
You can also test the tap-instagram
CLI interface directly using poetry run
:
poetry run tap-instagram --help
Testing with Meltano
Note: This tap will work in any Singer environment and does not require Meltano. Examples here are for convenience and to streamline end-to-end orchestration scenarios.
Your project comes with a custom meltano.yml
project file already created. Open the meltano.yml
and follow any "TODO" items listed in
the file.
Next, install Meltano (if you haven't already) and any needed plugins:
# Install meltano
pipx install meltano
# Initialize meltano within this directory
cd tap-instagram
meltano install
Now you can test and orchestrate using Meltano:
# Test invocation:
meltano invoke tap-instagram --version
# OR run a test `elt` pipeline:
meltano elt tap-instagram target-jsonl
See the dev guide for more instructions on how to use the SDK to develop your own taps and targets.