Relationshiped by Pratim Bhosale: Issue #1
Your spot to dive deep into Database internals and learn about everything happening in the database world.
Every issue of Relationshiped will contain recent news articles about the leading and upcoming databases, the most impactful videos and blogs covering database internals, trending packages and tools that people are using and sometimes deep dives into data structures.
Some News 🚀
Open Source Vector Database Startup QdrantRaises $28M - NEWS
Qdrant, a Berlin-based open-source vector database company, raised $28 million in Series A funding to advance AI through their vector search engine and database for unstructured data. They introduced a new binary quantization technology, attracting significant users like Deloitte, Accenture, and Elon Musk's xAI. This funding reflects a growing investor interest in vector databases, paralleled by substantial investments in similar enterprises.
Nile raised 11.6M to build Serverless Postgres - NEWS
Founded by senior engineers of Confluent and backed by some big names from the Tech Industry like the COO of Notion Akshay Kothari, CEO of Confluent Jay Kreps Nile a serverless Postgres claims to help developers build world-class SaaS applications. Nile virtualizes individual tenants into Postgres to enable native tenant data isolation, performance isolation between tenants, per-tenant backups, and tenant placement on shared or dedicated compute globally for latency or compliance.
A Cautionary Tale of TRUNCATE and CASCADE: Unraveling Linear's Data Loss Incident - NEWS
On Wednesday, January 24, Linear experienced a temporary data loss incident from 04:47 to 09:56 UTC. On January 24, 2024, Linear faced a significant data loss incident, all due to a command that went further than intended. TRUNCATE TABLE <new_table> CASCADE
;
This operation was meant to clear out a new table and any related test data. However, the CASCADE option led to the deletion of more data than planned, affecting tables linked by foreign keys. This event highlights the risks associated with using powerful database commands and suggests a need for better warnings or safety features in database tools to prevent such widespread data loss, pointing out that even experienced teams can be caught off guard by the extensive effects of seemingly straightforward commands.
An excellent incident covering blog by the Engineering team of Linear.
Good reads 📖
Are we at peak vector database? - OPINION
There is a wave of vector databases coming up recently. Thanks to the AI wave. But where does this stop? Are vector databases really the future? How are databases encompassing the AI/Chat/RAG/ features in them?
Vart: A persistent data structure for snapshot isolation - DEEP DIVE
This blog post delves into the intricacies of transaction isolation and introduces VART—a persistent data structure designed for snapshot isolation. Furthermore, it offers a glimpse into the evolution of SurrealKV, a persistent in-memory, transactional Key-Value store with versioning support.
SIEVE: An Efficient Turn-Key Eviction Algorithm for Web Caches - PAPER
The paper introduces SIEVE, a new caching eviction algorithm that is simpler than Least Recently Used (LRU) and more efficient and scalable for web cache workloads. Unlike traditional algorithms that often compromise throughput and simplicity for efficiency, SIEVE achieves superior performance with minimal code changes in production cache libraries. Notably, SIEVE outperforms existing state-of-the-art algorithms in terms of lower miss ratios and scalability, as it requires no locking for cache hits and has been proven effective across various cache traces from multiple sources.
Binge watch 🎥
The Compression Algorithm that powers all Time Series Databases
In this video, Arpit takes an in-depth look into a very popular compression/encoding algorithm called Delta Encoding which almost all time-series databases use to store the data. Apart from looking into what it is and how it works, it also goes through the implementation of it in Golang and benchmark the impact.
CMU Advanced Database Systems 2024
My favourite lecture series to learn all things database.
Associate Prof at CMU Andy Pavlo’s live lectures on Database systems.
The course is a comprehensive study of the internals of modern database management systems. It will cover the core concepts and fundamentals of the components that are used in large-scale analytical systems (OLAP)
Check it out🥞
pg-schema-diff - LIBRARY
A pretty cool library from Stripe that computes the diff(erences) between Postgres database schemas and generates the SQL required to get your database schema from point A to B with minimal downtime & locks. This enables you to take your database and migrate it to any desired schema defined in plain DDL.
SurrealML- PACKAGE
SurrealML is a feature that allows you to store trained machine learning models in a special format called 'surml'. This enables you to run these models in either Python or Rust, and even upload them to a SurrealDB node to run the models on the server
Slashbase - IDE
Slashbase is an open-source modern database IDE for your dev/data workflows. Use Slashbase to connect to any of your databases, browse data and schema, write, run and save queries, and create charts. Supports MySQL, PostgreSQL and MongoDB
A LETTER FROM THE EDITOR
While the Tech world is blessed with Hacker News, Reddit, Tech Crunch and other platforms like Twitter to get their updates, there was no single source to be on top of everything happening in the Database World until now.
According to a report by Gartner, global spending on database management systems is expected to increase at a 16.8% annual rate from 2022 to 2027, reaching approximately $203.6 billion by 2027.
I’m Pratim Bhosale, a developer advocate and database enthusiast.
Starting today, Relationshiped will bring you gems from the world of databases.
News, Trends, Resources, interviews and everything that is leaving an impact on this industry.
Stay connected,
Pratim Bhosale