Vector Vanguard: Tracking the Pulse of Vector Tech 08/2024
Welcome to "Vector Vanguard: Tracking the Pulse of Vector Tech 08/2024" - a source for the latest developments in vector databases, vector indexes, RAG (Retrieval-Augmented Generation), similarity search, and related technologies that caught my attention in the last...
Vector Vanguard: Tracking the Pulse of Vector Tech 07/2024
Welcome to "Vector Vanguard: Tracking the Pulse of Vector Tech 07/2024" - a source for the latest developments in vector databases, vector indexes, RAG (Retrieval-Augmented Generation), similarity search, and related technologies that caught my attention in the last...
Oracle AI Vector – Semantic Search
With the advent of Large Language Models (LLM), vector databases are becoming increasingly popular. Vector databases and similar approaches have existed for a long time such as geodata have long been established. Oracle offers since Oracle Database 23ai vector...
Vector Indexes in Vector Databases: Semantic Search Performance
Vector indexes are crucial for semantic search performance, optimizing efficient querying. In this article, I will delve into various types of vector indexes, their workings, pros and cons, and recommendations for their use. The article also provides a practical...
Similarity search in vector databases: a comprehensive guide
Similarity search in vector databases has emerged as a pivotal technique enabling efficient retrieval of information by comparing complex data points within high-dimensional spaces. The ability to find similar items efficiently is crucial for applications ranging from...
Vector Database – What, Why, and How
In today's data-driven world, vector databases are available to handle complex, high-dimensional data. This article describes vector databases including use cases as well as an example with the PostgreSQL extension pg_vector. What is a vector database? A vector...
How to Be Useful: Unpacking Arnold Schwarzenegger’s Secrets to Success
Did you know that the man who conquered bodybuilding, Hollywood, and the political arena believes that his multifaceted success boils down to just seven principles? Yes, Arnold Schwarzenegger, in his book "Be Useful: Seven Tools for Life," distills the essence of his...
Data visualization with Flourish
Flourish is a data visualization and storytelling platform that helps data enthusiasts understand and communicate complex data. With a wide range of customizable templates and interactive features, Flourish makes it easy to create beautiful and engaging visualizations...
Predictions about data for 2023 and beyond
Predictions about data for 2023 and beyond. End of the year: it’s the time for predictions. Let’s have a look at some predictions regarding data. There are many predictions for Machine Learning, Deep Learning, and AI - explainability, professionalisation, and...
Data Vault and Star Schema with PlantUML: Entity Relationship Diagram as Code
Entity Relationship Diagram as code means developers use the same tools for creating the diagrams - or documentation in general - as for coding. Documentation includes more than just source code and some comments. If the documentation is textual and not binary,...
Materialization examples of Data Engineering with dbt
dbt offers several materialization options to create ETL/ELT processes. The article shows and compares various approaches how to use dbt for ETL/ELT. A previous post contains an introduction into dbt: Data Engineering with dbt – first steps using PostgreSQL and...
Data Engineering with dbt – first steps using PostgreSQL and Oracle
dbt is a Data Engineering tool supporting version control with CI/CD for transformations and materialization. The approach with dbt differs from tools like SSIS, DataFactory, Informatica. The developer models the target tables/views and the transformations. dbt uses...