Leonie Monigatti
  • Blog
Categories
All (87)
AWS Machine Learning Blog (1)
Docker Blog (1)
DZ Bank Innovation Blog (4)
Paper review (3)
Stackoverflow Blog (1)
Towards Data Science Archive (53)
Weaviate Blog (10)
Weights & Biases Blog (6)

Blog

Agent Memory: Filesystem vs Database

A discussion on the trade-offs of different storage solutions for agent memory
Jan 19, 2026

Building AI Agents with Google’s ADK

Study Notes from Kaggle’s 5-Day AI Agents Intensive Course
Nov 26, 2025

Exploring Anthropic’s Memory Tool

Adding persistent memory to AI agents with the Anthropic Python SDK
Nov 25, 2025

Making Sense of Memory in AI Agents

Study notes on agent memory management: How agents remember, recall, and (struggle to) forget information.
Nov 20, 2025

The Evolution from RAG to Agentic RAG to Agent Memory

The journey from one-shot retrieval to persistent agent memory
Nov 3, 2025

Virtual context management with MemGPT and Letta

Review of the paper ‘MemGPT: Towards LLMs as Operating Systems’ and the Letta framework
Paper review
Oct 17, 2025

Building an AI agent from scratch in Python

How to implement a single AI agent with an LLM API and no frameworks.
Sep 30, 2025

First impressions from testing 4 Coding Agents with Jupyter Notebooks

How well do Claude Code and Gemini CLI with and without Cursor and Gemini from within Google Colab handle Jupyter Notebook workflows for teaching and experimentation?
Jul 28, 2025

37 Things I Learned About Information Retrieval in Two Years at a Vector Database Company

Reflections on what I’ve learned about information retrieval in the last two years working at Weaviate
Jul 3, 2025

NeoBERT: A Next-Generation BERT

Paper review
Jun 25, 2025

Who wrote this?

And why?
Jun 24, 2025

An Overview of Late Interaction Retrieval Models: ColBERT, ColPali, and ColQwen

Weaviate Blog
Apr 9, 2025

What Are Agentic Workflows? Patterns, Use Cases, Examples, and More

Weaviate Blog
Mar 6, 2025

2024 in Review: What I Got Right, Where I Was Wrong, and Bolder Predictions for 2025

What I got right (and wrong) about trends in 2024 and daring to make bolder predictions for the year ahead
Towards Data Science Archive
Dec 17, 2024

ハイブリッド検索とは?

Weaviate Blog
Dec 11, 2024

What is Agentic RAG

Weaviate Blog
Nov 5, 2024

A Gentle Introduction to Vector Databases

Weaviate Blog
Aug 1, 2024

OpenAI’s Matryoshka Embeddings in Weaviate

Weaviate Blog
Jun 18, 2024

The Challenges of Retrieving and Evaluating Relevant Context for RAG

A case study with a grade 1 text understanding exercise for how to measure context relevance in your retrieval-augmented generation system using Ragas, TruLens, and DeepEval
Towards Data Science Archive
Jun 10, 2024

Step-by-Step Guide to Choosing the Best Embedding Model for Your Application

Weaviate Blog
Jun 4, 2024

Evaluation Metrics for Search and Recommendation Systems

Weaviate Blog
May 28, 2024

Building a Local RAG System for Privacy Preservation with Ollama and Weaviate

Weaviate Blog
May 21, 2024

Shifting Tides: The Competitive Edge of Open Source LLMs over Closed Source LLMs

Why I think smaller open source foundation models have already begun replacing proprietary models by providers, such as OpenAI, in Generative AI applications
Towards Data Science Archive
Apr 29, 2024

Intro to DSPy: Goodbye Prompting, Hello Programming!

How the DSPy framework solves the fragility problem in LLM-based applications by replacing prompting with programming and compiling
Towards Data Science Archive
Paper review
Feb 27, 2024

Advanced Retrieval-Augmented Generation: From Theory to LlamaIndex Implementation

How to address limitations of naive RAG pipelines by implementing targeted advanced RAG techniques in Python
Towards Data Science Archive
Feb 19, 2024

Using Weaviate with Non-English Languages

Weaviate Blog
Jan 30, 2024

Build enterprise-ready generative AI solutions with Cohere foundation models in Amazon Bedrock and Weaviate vector database on AWS Marketplace

AWS Machine Learning Blog
Jan 24, 2024

What Is MLOps? Machine Learning Operations Explained

Weights & Biases Blog
Jan 11, 2024

2023 in Review: Recapping the Post-ChatGPT Era and What to Expect for 2024

How the LLMOps landscape has evolved and why we haven’t seen many Generative AI applications in the wild yet – but maybe in 2024.
Towards Data Science Archive
Dec 18, 2023

Evaluating RAG Applications with RAGAs

A framework with metrics and LLM-generated data to evaluate the performance of your Retrieval-Augmented Generation pipeline
Towards Data Science Archive
Dec 13, 2023

Improving Retrieval Performance in RAG Pipelines with Hybrid Search

How to find more relevant search results by combining traditional keyword-based search with modern vector search
Towards Data Science Archive
Nov 28, 2023

Recreating Amazon’s New Generative AI Feature: Product Review Summaries

How to generate summaries from data in your Weaviate vector database with an OpenAI LLM in Python using a concept called “Generative Feedback Loops”
Towards Data Science Archive
Nov 21, 2023

Retrieval-Augmented Generation (RAG): From Theory to LangChain Implementation

From the theory of the original academic paper to its Python implementation with OpenAI, Weaviate, and LangChain
Towards Data Science Archive
Nov 14, 2023

Recreating Andrej Karpathy’s Weekend Project — a Movie Search Engine

Building a movie recommender system with OpenAI embeddings and a vector database
Towards Data Science Archive
Nov 7, 2023

A Guide on 12 Tuning Strategies for Production-Ready RAG Applications

How to improve the performance of your Retrieval-Augmented Generation (RAG) pipeline with these “hyperparameters” and tuning strategies
Towards Data Science Archive
Oct 6, 2023

From prototype to production: Vector databases in generative AI applications

What exactly is a vector database? And how does it relate to generative AI?
Stackoverflow Blog
Sep 23, 2023

How to Get Started with the Weaviate Vector Database on Docker

Docker Blog
Sep 19, 2023

Why OpenAI’s API Is More Expensive for Non-English Languages

Beyond words: How byte pair encoding and Unicode encoding factor into pricing disparities
Towards Data Science Archive
Aug 16, 2023

Easily Estimate Your OpenAI API Costs with Tiktoken

Count your tokens and avoid going bankrupt from using the OpenAI API
Towards Data Science Archive
Aug 1, 2023

Getting Started with Weaviate: A Beginner’s Guide to Search with Vector Databases

How to use vector databases for semantic search, question answering, and generative search in Python with OpenAI and Weaviate
Towards Data Science Archive
Jul 18, 2023

Explaining Vector Databases in 3 Levels of Difficulty

From noob to expert: Demystifying vector databases across different backgrounds
Towards Data Science Archive
Jul 4, 2023

Matplotlib Tips to Instantly Improve Your Data Visualizations — According to “Storytelling with Data”

Recreating lessons learned from Cole Nussbaumer Knaflic’s book in Python using Matplotlib
Towards Data Science Archive
Jun 20, 2023

Boosting PyTorch Inference on CPU: From Post-Training Quantization to Multithreading

How to reduce inference time on CPU with clever model selection, post-training quantization with ONNX Runtime or OpenVINO, and multithreading with ThreadPoolExecutor
Towards Data Science Archive
Jun 13, 2023

10 Exciting Project Ideas Using Large Language Models (LLMs) for Your Portfolio

Learn how to build apps and showcase your skills with large language models (LLMs). Get started today!
Towards Data Science Archive
May 15, 2023

PyTorch Image Classification Tutorial for Beginners

Fine-tuning pre-trained Deep Learning models in Python
Towards Data Science Archive
May 9, 2023

Getting Started with LangChain: A Beginner’s Guide to Building LLM-Powered Applications

A LangChain tutorial to build anything with large language models in Python
Towards Data Science Archive
Apr 25, 2023

Understanding LLMOps: Large Language Model Operations

Weights & Biases Blog
Apr 16, 2023

Cutout, Mixup, and Cutmix: Implementing Modern Image Augmentations in PyTorch

Data augmentation techniques for Computer Vision implemented in Python
Towards Data Science Archive
Apr 14, 2023

Stationarity in Time Series — A Comprehensive Guide

How to check if a time series is stationary and what you can do if it is non-stationary in Python
Towards Data Science Archive
Apr 11, 2023

How to Save and Load Your Neural Networks in Python

A complete guide to saving and loading checkpoints and entire Deep Learning models in PyTorch and TensorFlow/Keras
Towards Data Science Archive
Apr 5, 2023

Audio Classification with Deep Learning in Python

Fine-tuning image models to tackle domain shift and class imbalance with PyTorch and torchaudio in audio data
Towards Data Science Archive
Apr 4, 2023

Data Augmentation Techniques for Audio Data in Python

How to augment audio in waveform (time domain) and as spectrograms (frequency domain) with librosa, numpy, and PyTorch
Towards Data Science Archive
Mar 28, 2023

2 Simple Steps To Reduce the Memory Usage of Your Pandas Dataframe

How to fit a large dataset into your RAM in Python
Towards Data Science Archive
Mar 21, 2023

A Simple Approach to Hierarchical Time Series Forecasting with Machine Learning

How to “boost” your cyclical sales data forecast with LightGBM and Python
Towards Data Science Archive
Mar 14, 2023

Beginner’s Guide to the Must-Know LightGBM Hyperparameters

The most important LightGBM parameters, what they do, and how to tune them
Towards Data Science Archive
Mar 7, 2023

Building a Recommender System using Machine Learning

“Candidate rerank” approach with co-visitation matrix and GBDT ranker model in Python
Towards Data Science Archive
Mar 1, 2023

Intermediate Deep Learning with Transfer Learning

A practical guide for fine-tuning Deep Learning models for computer vision and natural language processing
Towards Data Science Archive
Feb 22, 2023

Intro to MLOps: Data and Model Versioning

Weights & Biases Blog
Jan 17, 2023

Pandas vs. Polars: A Syntax and Speed Comparison

Understanding the major differences between the Python libraries Pandas and Polars for Data Science
Towards Data Science Archive
Jan 11, 2023

Will We Be Using ChatGPT Instead of Google To Get a Christmas Cookie Recipe Next Year?

Will ChatGPT replace search engines? A walkthrough with the use case of looking up a sugar cookie recipe
Towards Data Science Archive
Dec 22, 2022

Intro to MLOps: Hyperparameter Tuning

Weights & Biases Blog
Dec 20, 2022

A Visual Guide to Learning Rate Schedulers in PyTorch

LR decay and annealing strategies for Deep Learning in Python
Towards Data Science Archive
Dec 6, 2022

Kaggle Days Paris 2022

Discussing Data Science with Kagglers while eating macarons
Towards Data Science Archive
Nov 22, 2022

Intro to MLOps: Machine Learning Experiment Tracking

Weights & Biases Blog
Nov 22, 2022

How to Create a PDF Report for Your Data Analysis in Python

Automate PDF generation with the FPDF library as part of your data analysis
Towards Data Science Archive
Oct 25, 2022

How to Create a GIF from Matplotlib Plots in Python

A data visualization technique for 2-dimensional time series data using imageio
Towards Data Science Archive
Oct 18, 2022

A Collection of Must-Know Techniques for Working with Time Series Data in Python

How to manipulate and visualize time series data in datetime format with ease
Towards Data Science Archive
Oct 12, 2022

How to Easily Customize SHAP Plots in Python

Adjust the colors and figure size and add titles and labels to SHAP plots
Towards Data Science Archive
Oct 4, 2022

Everything You Need to Know About the Binary Search Algorithm

Master the Binary Search algorithm in 8 minutes
Towards Data Science Archive
Sep 27, 2022

A gentle introduction to time series analysis & forecasting

Fundamental concepts around time series analysis and time series forecasting, including everything from classical approaches to modern machine learning models with examples in Python
Weights & Biases Blog
Sep 25, 2022

A Beginner’s Guide to Prompt Design for Text-to-Image Generative Models

Learn these prompt engineering tricks before you waste your free trial credits
Towards Data Science Archive
Sep 20, 2022

Intermediate Data Analysis Techniques for Text Data

How to perform Exploratory Data Analysis on text data for Natural Language Processing
Towards Data Science Archive
Sep 13, 2022

AI-Generated Art: How to Get Started with Generating Your Own Images

A non-technical comparison of DALL·E2, Midjourney, and Stable Diffusion
Towards Data Science Archive
Sep 7, 2022

Fundamental Data Analysis Techniques for Text Data

EDA for NLP: From counts, lengths, and term frequencies to why you don’t need word clouds
Towards Data Science Archive
Aug 31, 2022

Time Series Problems Simply Explained as Fast Food Combo Meals

The difference between univariate vs. multivariate, single-step vs. multistep, and sliding vs. expanding window time series problems
Towards Data Science Archive
Aug 23, 2022

99 Lessons on Data Analysis from Placing Top 5 in 5 Kaggle Analytics Challenges

(Grand)Masterclass: How to approach (and win) a Kaggle Analytics Competition
Towards Data Science Archive
Aug 16, 2022

Visualizing Part-of-Speech Tags with NLTK and SpaCy

Customizing displaCy’s entity visualizer
Towards Data Science Archive
Aug 9, 2022

Interpreting ACF and PACF Plots for Time Series Forecasting

How to determine the order of AR and MA models
Towards Data Science Archive
Aug 2, 2022

How to Handle Large Datasets in Python

A Comparison of CSV, Pickle, Parquet, Feather, and HDF5
Towards Data Science Archive
Jul 26, 2022

How to Merge Pandas DataFrames

How to Avoid Losing Valuable Data Points (incl. Cheat Sheet)
Towards Data Science Archive
Jul 20, 2022

Why Your Data Visualizations Should Be Colorblind-Friendly

Especially if You Are Trying to Convince Men
Towards Data Science Archive
Jul 12, 2022

5 Ideas to Create New Features from Polygons

How to Get the Area and Other Features From a WKT String with Shapely
Towards Data Science Archive
Jul 6, 2022

Essential Techniques to Style Pandas DataFrames

How to Effectively Communicate Data with Tables (including Cheat Sheet)
Towards Data Science Archive
Jun 27, 2022

Warum Expertensysteme nicht als Teilgebiet von Künstlicher Intelligenz wahrgenommen werden

DZ Bank Innovation Blog
Oct 18, 2017

Innovationstag 2017: die Genossenschaftliche FinanzGruppe im Aufbruch

DZ Bank Innovation Blog
Oct 11, 2017

Trendscouts nehmen Machine Learning unter die Lupe: Kundensegmentierung (Teil 2)

DZ Bank Innovation Blog
Aug 23, 2017

Trendscouts nehmen Machine Learning unter die Lupe: Kundensegmentierung (Teil 1)

DZ Bank Innovation Blog
Aug 21, 2017
No matching items
    Back to top
    • Hi, I am Leonie, a machine learning engineer and technical writer. I help developers build vector-based AI solutions. My writing focuses on machine learning and AI engineering.
    • Copyright 2025, Leonie Monigatti
    • Imprint

    • Privacy Policy

    Cookie Consent