Punk API infrasctructure

This project uses Terraform to construct a architecture that consumes, cleans and stoesdata from the Punk API, cl. Besides that, it provides a machine learning model that could be accessed remotely to predict the IBU (International Bitterness Units) of a beer via an AWS Lambda.

Setup

To build this project, you must have terraform installed and an AWS account.

Once you have both configured, it is possible to run the project. First of all, clone the repo and open it:

git clone https://github.com/joaorobson/aws_beer_classification.git
cd aws_beer_classification

Create a Python env to install some dependencies:

python3.9 -m venv env
source env/bin/activate
pip install -r notebooks/requirements.txt

Prediction setup

Before building the main architeture, it is necessary to create a Lambda that will be responsible to load a pre-trained model from a S3 bucket and make predictions remotely. This step was done using container images, given the memory limitations imposed by AWS regarding .zip deployment packages.

This can be done with the folowing steps:

Set some env variables:

export AWS_REGION=us-west-2
export BUCKET_NAME="beers-linear-regressor"
export IMAGE_NAME="ibu_prediction_image"
export IMAGE_TAG="latest"

Create the ECR repository to store the generated image:

terraform apply -target=aws_ecr_repository.ibu_prediction_repository

Set the REGISTRY_ID AND IMAGE_URI env variables:

export REGISTRY_ID=$(aws ecr \
  describe-repositories \
  --query 'repositories[?repositoryName == `'$IMAGE_NAME'`].registryId' \
  --output text)
export IMAGE_URI=${REGISTRY_ID}.dkr.ecr.${AWS_REGION}.amazonaws.com/${IMAGE_NAME}

Authenticate the docker client to the ECR registry using you AWS account id:

aws ecr get-login-password --region $AWS_REGION | docker login --username AWS --password-stdin [aws_account_id].dkr.ecr.$AWS_REGION.amazonaws.com

Build and push the docker image:

cd code/model/
docker build -t $IMAGE_URI .
docker push $IMAGE_URI:$IMAGE_TAG

NOTE: Currently, the lambda function will not work properly, because it depends of a model version stored at the S3 bucket. To make it work, follow the commands in the next sections.

Main architecture setup

After that, to build the architeture in AWS, in the root directoryof the project, run:

terraform apply

This command create all the resources used by the project. The comportament is rather basic: every 5 minutes, a new beer record is retrieved and store in S3 buckets, one with the raw data and another with a cleaned version. With that, the cleaned data bucket can be used to train a machine learning model locally, which is exemplified by this notebook.

Model training

Now, it is possible to train a model given the data collected and stored by the architecture. To do that, run the notebook located here:

 ./env/bin/jupyter notebook

After run it, it will be possible to make a prediction via the Lambda created early using the notebook itself or via CLI:

cd notebooks
./invoke_predict_ibu.sh

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
code		code
notebooks		notebooks
README.md		README.md
athena.bucket.tf		athena.bucket.tf
build.tf		build.tf
cloudwatch_event.tf		cloudwatch_event.tf
ecr.tf		ecr.tf
firehose.cleaned_data.tf		firehose.cleaned_data.tf
firehose.raw_data.tf		firehose.raw_data.tf
gluetable.tf		gluetable.tf
kinesis.tf		kinesis.tf
lambda.collect_data.tf		lambda.collect_data.tf
lambda.ibu_prediction.tf		lambda.ibu_prediction.tf
lambda.store_cleaned_data.tf		lambda.store_cleaned_data.tf
locals.tf		locals.tf
main.tf		main.tf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Punk API infrasctructure

Setup

Prediction setup

Main architecture setup

Model training

References

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Punk API infrasctructure

Setup

Prediction setup

Main architecture setup

Model training

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages