Thomas Kalnik's Technical Blog

Latest Posts

Unified Image Generation and Editing with Flux Kontext: Revolutionizing YouTube Thumbnail Workflows

June 3, 2025

An in-depth exploration of Black Forest Labs' Flux Kontext model and its breakthrough unified approach to image generation and editing, with specific focus on solving character consistency challenges in YouTube thumbnail creation.

Building Intelligent Memory Systems for Multi-Agent AI Architectures

May 20, 2025

Exploring the design and implementation of sophisticated memory systems for agentic AI, including discrete memory records, semantic intent understanding, and graph-based relationship modeling.

GraphRAG Assisted Ideation with a YouTube Knowledge Graph

March 23, 2025

I've been building a system that helps creators generate better video ideas using a combination of Retrieval-Augmented Generation (RAG) with PostgreSQL's pg_vector and a Neo4j knowledge graph.

Scaling the Summit: Distributed Inference with Meta-Llama-3.1-405B using vLLM

October 13, 2024

This post details the technical approach, configuration, and key insights from deploying one of the largest language models currently available using distributed inference techniques.

Fine-Tuning Llama 3.1 8B with Direct Preference Optimization: A Distributed Training Approach

September 27, 2024

As part of our deep learning research initiatives, I recently conducted a distributed Direct Preference Optimization (DPO) fine-tuning of the Meta Llama 3.1 8B model.

Building a Multi-Cloud AI Image Generation Service with Flux

August 23, 2024

In this post, I'll share my experience designing and implementing a production image generation system across multiple cloud platforms, with a focus on the technical concepts that could be valuable for similar projects.

Fine-tuning SDXL for Specialized Thumbnail Generation: A Technical Deep Dive

June 4, 2024

I recently undertook a project to fine-tune Stability AI's SDXL model for creating custom thumbnails in a specific visual style.