GitHub activity • The Data Quarry

github activities

Talks & podcasts

Practical AI Podcast - Sep 2024

Seems like we are hearing a lot about GraphRAG these days, but there are lots of questions: what is it, is it hype, what is practical? One of our all time favorite podcast friends, Prashanth Rao, joins us to dig into this topic beyond the hype. Prashanth gives us a bit of background and practical use cases for GraphRAG and graph data.

AICamp San Francisco - Jun 2024

In this talk, we will explore some of the ways in which knowledge graphs are used in conjunctions with LLMs and graph databases to power RAG systems that provide insights from structured or unstructured data. We will also highlight some practical methods on how to build RAG systems that utilize the power of graphs.

FOSDEM 2024 - Lightning Talks - Feb 2024

This talk presents Kùzu: a new open-sourced graph database management system (GDBMS) that is designed for graph data science (GDS) eco-system, specifically in Python. GDS applications require a series of data processing steps, such as extracting data from tabular sources into a graph of nodes and relationships, cleaning and transforming the graph, extracting node features, and finally moving data into a GDS package, such as NetworkX and PyTorch Geometric for graph analytics. These steps can be performed easily and efficiently by GDBMSs, which provide high-level graph-based data models and query languages to developers. Kùzu is a GDBMS designed to serve as an essential storage system for GDS developers.

Practical AI Podcast - Jul 2023

There’s so much talk (and hype) these days about vector databases. We thought it would be timely and practical to have someone on the show that has been hands on with the various options and actually tried to build applications leveraging vector search. Prashanth Rao is a real practitioner that has spent and huge amount of time exploring the expanding set of vector database offerings. After introducing vector database and giving us a mental model of how they fit in with other datastores, Prashanth digs into the trade offs as related to indices, hosting options, embedding vs. query optimization, and more.

Engineer your Career (EYC) Podcast - Jun 2020

In this episode, we talk to Prashanth Rao a Data Scientist and Software Developer in Vancouver, Canada. In our conversation with Prashanth, we talk about what it was like for him to transition from mechanical engineering to data science as well as hear some great approaches to being a better lifelong learner.

Projects

Pydantic Benchmarks

Benchmarks for newer versions of Pydantic v2, written in Rust 🦀

Rust in pieces

Translating Python 🐍 projects to Rust 🦀 via simple bite-sized pieces

DBHub FastAPI

Boilerplate to quickly set up a FastAPI layer on top of numerous NoSQL, graph and vector databases

KúzuDB study

Benchmark study on KùzuDB, an embedded OLAP graph database, on an artificial social network dataset

LanceDB study

Benchmark study on LanceDB, an embedded vector DB, for full-text search and vector search