The Data Quarry

Back

GitHub activity

github activities

Talks & podcasts

Practical AI Podcast - Sep 2024
Seems like we are hearing a lot about GraphRAG these days, but there are lots of questions: what is it, is it hype, what is practical? One of our all time favorite podcast friends, Prashanth Rao, joins us to dig into this topic beyond the hype. Prashanth gives us a bit of background and practical use cases for GraphRAG and graph data.
AICamp San Francisco - Jun 2024
In this talk, we will explore some of the ways in which knowledge graphs are used in conjunctions with LLMs and graph databases to power RAG systems that provide insights from structured or unstructured data. We will also highlight some practical methods on how to build RAG systems that utilize the power of graphs.
FOSDEM 2024 - Lightning Talks - Feb 2024
This talk presents Kùzu: a new open-sourced graph database management system (GDBMS) that is designed for graph data science (GDS) eco-system, specifically in Python. GDS applications require a series of data processing steps, such as extracting data from tabular sources into a graph of nodes and relationships, cleaning and transforming the graph, extracting node features, and finally moving data into a GDS package, such as NetworkX and PyTorch Geometric for graph analytics. These steps can be performed easily and efficiently by GDBMSs, which provide high-level graph-based data models and query languages to developers. Kùzu is a GDBMS designed to serve as an essential storage system for GDS developers.
Practical AI Podcast - Jul 2023
There’s so much talk (and hype) these days about vector databases. We thought it would be timely and practical to have someone on the show that has been hands on with the various options and actually tried to build applications leveraging vector search. Prashanth Rao is a real practitioner that has spent and huge amount of time exploring the expanding set of vector database offerings. After introducing vector database and giving us a mental model of how they fit in with other datastores, Prashanth digs into the trade offs as related to indices, hosting options, embedding vs. query optimization, and more.
Engineer your Career (EYC) Podcast - Jun 2020
In this episode, we talk to Prashanth Rao a Data Scientist and Software Developer in Vancouver, Canada. In our conversation with Prashanth, we talk about what it was like for him to transition from mechanical engineering to data science as well as hear some great approaches to being a better lifelong learner.

Projects

Pydantic Benchmarks
Benchmarks for newer versions of Pydantic v2, written in Rust 🦀
Rust in pieces
Translating Python 🐍 projects to Rust 🦀 via simple bite-sized pieces
DBHub FastAPI
Boilerplate to quickly set up a FastAPI layer on top of numerous NoSQL, graph and vector databases
KúzuDB study
Benchmark study on KùzuDB, an embedded OLAP graph database, on an artificial social network dataset
LanceDB study
Benchmark study on LanceDB, an embedded vector DB, for full-text search and vector search