analytix consulting blog

Vector search vs. model retraining: a comparison of audio similarity search methods

There is currently lots of attention to vector databases with the advent of enterprise application of LLMs combined with RAG techniques. In many use cases, these vector databases are hidden behind frameworks like LangChain which abstract away document ingestion into the vector database and retrieval of relevant documents, making it unnecessary to directly interact with the vector databases themselves. I found it interesting to get a bit more of a deep dive into a vector database product to get a view of how to set up and work with them without working with a RAG use case again. Read More ›

Whisper-based video text extraction tool

Whisper is a speech-to-text model built by OpenAI which can be easily run on local hardware, even with the largest model sizes. I will present a Streamlit-based tool to easily extract text snippets from video files using the Whisper model without being reliant on the command line. Read More ›

NVIDIA DeepStream for highly performant video stream analytics

If, like me, you have ever been experimenting with applying deep learning models for video stream processing to perform object detection or scene classification, you will probably run into approaches where Python scripts are used to extract each individual frame from a video file in a loop and feed each of the frames to a model individually. In this blog post I will present a much more performant approach based on NVIDIA's DeepStream toolkit. Read More ›

Running a Self-Hosted LLM for Email Address Classification

Recently I was asked to come up with a solution to classify a large list of tens of thousands of email addresses based on their details. Although the original request was a bit different, for this blog post I will make the example of classifying the addresses as being from a personal email provider (like Gmail, Yahoo, etc.) or from a business or other institution. I will demonstrate an approach how to use a self-hosted Zephyr-7B model using the Litserve package to perform this task very efficiently. Read More ›

Installing Nextcloud Plugin on TrueNAS CORE With Letsencrypt Support in 2024

Nowadays it feels that the TrueNAS CORE community is shrinking every month. I have been a user of this solution for more than 8 years now, but some aspects of the system like plugins have never matured to a degree where they are really usable. I have been using the Nextcloud plugin for years now (after having upgraded from ownCloud), but updating the plugin from either the designated plugin architecture of from the Nextcloud management screens is quite often painful and tends to break the jail if there are major upgrades on key components like PHP. Read More ›