A frontend for this project has been created: eurofuels-front
eurofuels is an improved version of eurocombustibles, being this the backend part.
Retrieve the fuel prices data from the EU Weekly Oil Bulletin, store them on an AWS S3 bucket and display it on a webpage.
For retrieving the data, an AWS Lambda function will be used, which will store the data (as a JSON file) on an AWS S3 bucket. The lambda will run periodically triggered by an AWS EventBridge (by CloudWatch) event.
- Create an S3 Bucket on AWS
- Change Permissions Policy for the following one (to grant public read permission)
{
"Version": "2012-10-17",
"Statement": [
{
"Effect": "Allow",
"Principal": "*",
"Action": "s3:GetObject",
"Resource": "arn:aws:s3:::your-bucket-name/*"
}
]
}
- Give public access: disable the Block public access option
- Change Object Ownership to Bucket Owner Preferred
- Manage ACL (Access Control List): enable read to everyone
- Add CORS policy (under bucket's permission tab). Without it, data cannot be accessed. It can be in JSON or in XML
[
{
"AllowedHeaders": ["Authorization"],
"AllowedMethods": ["GET"],
"AllowedOrigins": ["*"],
"MaxAgeSeconds": 3000
}
]
<CORSConfiguration>
<CORSRule>
<AllowedOrigin>*</AllowedOrigin>
<AllowedMethod>GET</AllowedMethod>
<MaxAgeSeconds>3000</MaxAgeSeconds>
<AllowedHeader>Authorization</AllowedHeader>
</CORSRule>
</CORSConfiguration>
- The AWS S3 Bucket is now set up. Optionally upload the JSON file.
To get the public URL of the file, go to the file and on the Properties tab there it is under Object URL.
The Lambda function is written in Python and makes use of the following libraries:
pandas
numpy
(pandas dependency)pytz
(pandas dependency)openpyxl
(pandas dependency)
These libraries are not installed by default on AWS Python environment, so to set them up a Lambda Layer will be created for each one. These layers will then be applied to the AWS Lambda Function in order to import the libraries.
Follow the following guides to create these layers:
On AWS console go to User Settings → Security credentials and create new Access Keys. A Key_ID
and a Secret_Key
are generated, save them to be used on the script.
The code in charge of retrieving the data from its source and saving it as a JSON file on the S3 Bucket.
Create a Lambda Function with the following params:
- Runtime:
Python 3.9
- Architecture:
x86_64
On the code editor insert the lambda code and click Deploy button to deploy changes.
Set the following environment variables:
ACCESS_KEY
: the previously generatedKey_ID
SECRET_KEY
: the previously generatedSecret_Key
BUCKET_NAME
: the name given to the bucket when it was createdFILE_NAME
: the name of the JSON file to be saved on the bucket
Now apply the four layers previously created (on the code tab at the bottom).
Everything should be ready now, go to test tab and test that it works okay. Go to the S3 Bucket and check that the last time modified was updated.
Here an event will be created that will trigger the execution of the Lambda Function.
- Go to CloudWatch → Rules → Events and Create Rule
- Give name, description and select Schedule as Event Source
- It will redirect to EventBridge service, where the Schedule Pattern will be Recurring Schedule → Cron-based
- Enter the desired cron expression, for example
0 30 8 ? * WED *
(every Wednesday at 08:30) - In the Select targets section choose Lambda function as the target type and in the Function dropdown, select the Lambda function.
- Select to not retry if fail.
Now the Lambda Function will run on the schedule.
Here you can find a useful CRON expression generator.