Machine-Learning

23 Oct '24

Embedded Embeddings Database: Building a low cost serverless RAG solution

Retrieval-Augmented Generation (RAG) solutions are an impressive way to talk to one’s data. One of the challenges of RAG solutions is the associated cost, often driven by the vector database. In a previous blog article I presented how to tackle this issue by using Athena with Locality Sensitive Hashing (LSH) as a knowledge database. One the of the main limitations with Athena is the latency and the low number of concurrent queries. In this new blog article, I present a new low-cost serverless solution that makes use of an embedded vector database, SQLite, to achieve a low cost while maintaining high concurrency.

Read Blog

09 Aug '24

Building a low cost serverless Retrieval-Augmented Generation (RAG) solution

Written by Franck Awounang Nekdem

Large language models (LLMs) can generate complex text and solve numerous tasks such as question-answering, information extraction, and text summarization. However, they may suffer from issues such as information gaps or hallucinations. In this blog article, we will explore how to mitigate these issues using Retrieval Augmented Generation (RAG) and build a low-cost solution in the process.

Read Blog

18 Jun '24

An unsung hero of Amazon SageMaker: Local Mode

Written by Franck Awounang Nekdem

Amazon SageMaker offers a highly customizable platform for machine learning at scale. Job execution within Amazon SageMaker can take some time to set up, which can be inconvenient or even time consuming during development and debugging phases. Running training and processing jobs locally can greatly increase the speed of development and debugging before running them at scale on AWS.

Read Blog

03 Apr '24

Hyperparameter Tuning with Ray 2.x and AWS Sagemaker

Written by Chrishon Nilanthan

Finding the right set of hyperparameters when training machine learning models is a resource-consuming and costly task. In this post, we try to simplify this by exploring hyperparameter tuning for reinforcement learning models with Ray 2.x on AWS Sagemaker.

Read Blog

01 Mar '24

Reinforcement learning with Ray 2.x on Amazon SageMaker

Written by Franck Awounang Nekdem

A few years ago Amazon SageMaker introduced direct support for reinforcement learning (RL) through integration of RL-frameworks, including Ray. However, support has not been kept up to date and the supported versions are no longer what you might call current.

Read Blog

16 Feb '24

Understanding Iterations in Ray RLlib

Written by Maurice Borgmeier

Recently I’ve been engaged in my first reinforcement learning project using Ray’s RLlib and Sagemaker. I had dabbled in machine learning before, but one of the nice things about this project is that it allows me to dive deep into something unfamiliar. Naturally, that results in some mistakes being made. Today I want to share a bit about my experience in trying to improve the iteration time for the IMPALA algorithm in Ray’s RLlib.

Read Blog

Articles tagged with "machine-learning"

Embedded Embeddings Database: Building a low cost serverless RAG solution

Building a low cost serverless Retrieval-Augmented Generation (RAG) solution

An unsung hero of Amazon SageMaker: Local Mode

Hyperparameter Tuning with Ray 2.x and AWS Sagemaker

Reinforcement learning with Ray 2.x on Amazon SageMaker

Understanding Iterations in Ray RLlib