Data Blog by Andreas Buckenhofer

Similarity search in vector databases: a comprehensive guide

Similarity search in vector databases: a comprehensive guide

Similarity search in vector databases has emerged as a pivotal technique enabling efficient retrieval of information by comparing complex data points within high-dimensional spaces. The ability to find similar items efficiently is crucial for applications ranging from...

read more
Vector Database – What, Why, and How

Vector Database – What, Why, and How

In today's data-driven world, vector databases are available to handle complex, high-dimensional data. This article describes vector databases including use cases as well as an example with the PostgreSQL extension pg_vector. What is a vector database? A vector...

read more
Data visualization with Flourish

Data visualization with Flourish

Flourish is a data visualization and storytelling platform that helps data enthusiasts understand and communicate complex data. With a wide range of customizable templates and interactive features, Flourish makes it easy to create beautiful and engaging visualizations...

read more
Predictions about data for 2023 and beyond

Predictions about data for 2023 and beyond

Predictions about data for 2023 and beyond. End of the year: it’s the time for predictions. Let’s have a look at some predictions regarding data. There are many predictions for Machine Learning, Deep Learning, and AI - explainability, professionalisation, and...

read more
Materialization examples of Data Engineering with dbt

Materialization examples of Data Engineering with dbt

dbt offers several materialization options to create ETL/ELT processes. The article shows and compares various approaches how to use dbt for ETL/ELT. A previous post contains an introduction into dbt: Data Engineering with dbt – first steps using PostgreSQL and...

read more
PostgreSQL application_name

PostgreSQL application_name

PostgreSQL application_name can be set in the connection string. The view pg_stat_activity will show the application_name to help to identify the sessions. The article shows how to set application_name and how to benefit from it. It is highly recommended to set the...

read more
PostgreSQL columnar extension cstore_fdw

PostgreSQL columnar extension cstore_fdw

PostgreSQL columnar extension cstore_fdw is a storage extension which is suited for OLAP-/DWH-style queries and data-intense applications. Columnar analytical databases have unique characteristics compared to row-oriented data access. Many commercial products exist:...

read more
PostgreSQL partitioning guide

PostgreSQL partitioning guide

PostgreSQL partitioning is a powerful feature when dealing with huge tables. Partitioning allows breaking a table into smaller chunks, aka partitions. Logically, there seems to be one table only if accessing the data, but physically there are several partitions....

read more

Archives

Categories