Blog

Building AI Agents with Google’s ADK

Study Notes from Kaggle’s 5-Day AI Agents Intensive Course

Exploring Anthropic’s Memory Tool

Adding persistent memory to AI agents with the Anthropic Python SDK

Making Sense of Memory in AI Agents

Study notes on agent memory management: How agents remember, recall, and (struggle to) forget information.

The Evolution from RAG to Agentic RAG to Agent Memory

The journey from one-shot retrieval to persistent agent memory

Virtual context management with MemGPT and Letta

Review of the paper ‘MemGPT: Towards LLMs as Operating Systems’ and the Letta framework

Building an AI agent from scratch in Python

How to implement a single AI agent with an LLM API and no frameworks.

First impressions from testing 4 Coding Agents with Jupyter Notebooks

How well do Claude Code and Gemini CLI with and without Cursor and Gemini from within Google Colab handle Jupyter Notebook workflows for teaching and experimentation?

37 Things I Learned About Information Retrieval in Two Years at a Vector Database Company

Reflections on what I’ve learned about information retrieval in the last two years working at Weaviate

NeoBERT: A Next-Generation BERT

Who wrote this?

2024 in Review: What I Got Right, Where I Was Wrong, and Bolder Predictions for 2025

What I got right (and wrong) about trends in 2024 and daring to make bolder predictions for the year ahead

The Challenges of Retrieving and Evaluating Relevant Context for RAG

A case study with a grade 1 text understanding exercise for how to measure context relevance in your retrieval-augmented generation system using Ragas, TruLens, and DeepEval

Shifting Tides: The Competitive Edge of Open Source LLMs over Closed Source LLMs

Why I think smaller open source foundation models have already begun replacing proprietary models by providers, such as OpenAI, in Generative AI applications

Intro to DSPy: Goodbye Prompting, Hello Programming!

How the DSPy framework solves the fragility problem in LLM-based applications by replacing prompting with programming and compiling

Advanced Retrieval-Augmented Generation: From Theory to LlamaIndex Implementation

How to address limitations of naive RAG pipelines by implementing targeted advanced RAG techniques in Python

2023 in Review: Recapping the Post-ChatGPT Era and What to Expect for 2024

How the LLMOps landscape has evolved and why we haven’t seen many Generative AI applications in the wild yet — but maybe in 2024.

Evaluating RAG Applications with RAGAs

A framework with metrics and LLM-generated data to evaluate the performance of your Retrieval-Augmented Generation pipeline

Improving Retrieval Performance in RAG Pipelines with Hybrid Search

How to find more relevant search results by combining traditional keyword-based search with modern vector search

Recreating Amazon’s New Generative AI Feature: Product Review Summaries

How to generate summaries from data in your Weaviate vector database with an OpenAI LLM in Python using a concept called “Generative Feedback Loops”

Retrieval-Augmented Generation (RAG): From Theory to LangChain Implementation

From the theory of the original academic paper to its Python implementation with OpenAI, Weaviate, and LangChain

Recreating Andrej Karpathy’s Weekend Project — a Movie Search Engine

Building a movie recommender system with OpenAI embeddings and a vector database

A Guide on 12 Tuning Strategies for Production-Ready RAG Applications

How to improve the performance of your Retrieval-Augmented Generation (RAG) pipeline with these “hyperparameters” and tuning strategies

Why OpenAI’s API Is More Expensive for Non-English Languages

Beyond words: How byte pair encoding and Unicode encoding factor into pricing disparities

Easily Estimate Your OpenAI API Costs with Tiktoken

Count your tokens and avoid going bankrupt from using the OpenAI API

Getting Started with Weaviate: A Beginner’s Guide to Search with Vector Databases

How to use vector databases for semantic search, question answering, and generative search in Python with OpenAI and Weaviate

Explaining Vector Databases in 3 Levels of Difficulty

From noob to expert: Demystifying vector databases across different backgrounds

Matplotlib Tips to Instantly Improve Your Data Visualizations — According to “Storytelling with Data”

Recreating lessons learned from Cole Nussbaumer Knaflic’s book in Python using Matplotlib

Boosting PyTorch Inference on CPU: From Post-Training Quantization to Multithreading

How to reduce inference time on CPU with clever model selection, post-training quantization with ONNX Runtime or OpenVINO, and multithreading with ThreadPoolExecutor

10 Exciting Project Ideas Using Large Language Models (LLMs) for Your Portfolio

Learn how to build apps and showcase your skills with large language models (LLMs). Get started today!

PyTorch Image Classification Tutorial for Beginners

Fine-tuning pre-trained Deep Learning models in Python

Getting Started with LangChain: A Beginner’s Guide to Building LLM-Powered Applications

A LangChain tutorial to build anything with large language models in Python

Cutout, Mixup, and Cutmix: Implementing Modern Image Augmentations in PyTorch

Data augmentation techniques for Computer Vision implemented in Python

Stationarity in Time Series — A Comprehensive Guide

How to check if a time series is stationary and what you can do if it is non-stationary in Python

How to Save and Load Your Neural Networks in Python

A complete guide to saving and loading checkpoints and entire Deep Learning models in PyTorch and TensorFlow/Keras

Audio Classification with Deep Learning in Python

Fine-tuning image models to tackle domain shift and class imbalance with PyTorch and torchaudio in audio data

Data Augmentation Techniques for Audio Data in Python

How to augment audio in waveform (time domain) and as spectrograms (frequency domain) with librosa, numpy, and PyTorch

2 Simple Steps To Reduce the Memory Usage of Your Pandas Dataframe

How to fit a large dataset into your RAM in Python

A Simple Approach to Hierarchical Time Series Forecasting with Machine Learning

How to “boost” your cyclical sales data forecast with LightGBM and Python

Beginner’s Guide to the Must-Know LightGBM Hyperparameters

The most important LightGBM parameters, what they do, and how to tune them

Building a Recommender System using Machine Learning

“Candidate rerank” approach with co-visitation matrix and GBDT ranker model in Python

Intermediate Deep Learning with Transfer Learning

A practical guide for fine-tuning Deep Learning models for computer vision and natural language processing

Pandas vs. Polars: A Syntax and Speed Comparison

Understanding the major differences between the Python libraries Pandas and Polars for Data Science

Will We Be Using ChatGPT Instead of Google To Get a Christmas Cookie Recipe Next Year?

Will ChatGPT replace search engines? A walkthrough with the use case of looking up a sugar cookie recipe

A Visual Guide to Learning Rate Schedulers in PyTorch

LR decay and annealing strategies for Deep Learning in Python

A Visual Guide to Learning Rate Schedulers in PyTorch

LR decay and annealing strategies for Deep Learning in Python

Kaggle Days Paris 2022

Discussing Data Science with Kagglers while eating macarons

How to Create a PDF Report for Your Data Analysis in Python

Automate PDF generation with the FPDF library as part of your data analysis

How to Create a GIF from Matplotlib Plots in Python

A data visualization technique for 2-dimensional time series data using imageio

A Collection of Must-Know Techniques for Working with Time Series Data in Python

How to manipulate and visualize time series data in datetime format with ease

How to Easily Customize SHAP Plots in Python

Adjust the colors and figure size and add titles and labels to SHAP plots

Everything You Need to Know About the Binary Search Algorithm

Master the Binary Search algorithm in 8 minutes

A Beginner’s Guide to Prompt Design for Text-to-Image Generative Models

Learn these prompt engineering tricks before you waste your free trial credits

Intermediate Data Analysis Techniques for Text Data

How to perform Exploratory Data Analysis on text data for Natural Language Processing

AI-Generated Art: How to Get Started with Generating Your Own Images

A non-technical comparison of DALL·E2, Midjourney, and Stable Diffusion

Fundamental Data Analysis Techniques for Text Data

EDA for NLP: From counts, lengths, and term frequencies to why you don’t need word clouds

Time Series Problems Simply Explained as Fast Food Combo Meals

The difference between univariate vs. multivariate, single-step vs. multistep, and sliding vs. expanding window time series problems

Visualizing Part-of-Speech Tags with NLTK and SpaCy

Customizing displaCy’s entity visualizer

Interpreting ACF and PACF Plots for Time Series Forecasting

How to determine the order of AR and MA models

How to Handle Large Datasets in Python

A Comparison of CSV, Pickle, Parquet, Feather, and HDF5

How to Merge Pandas DataFrames

How to Avoid Losing Valuable Data Points (incl. Cheat Sheet)

Why Your Data Visualizations Should Be Colorblind-Friendly

Especially if You Are Trying to Convince Men

5 Ideas to Create New Features from Polygons

How to Get the Area and Other Features From a WKT String with Shapely

Essential Techniques to Style Pandas DataFrames

How to Effectively Communicate Data with Tables (including Cheat Sheet)