Blog

BigQuery MCP Roo Code

Jul 13, 2025

Connecting to BigQuery via MCP in VS Code with Roo Code #

This post outlines connecting Google BigQuery via Model Context Protocol (MCP) in VS Code with MCP Toolbox and Roo Code for seamless BigQuery interaction in a more generic way than advertized in the documentation of the MCP Toolbox.

Prerequisites #

Before you begin, ensure you have the following:

MCP Toolbox: Integrates data sources and tools. (GitHub)
Roo Code: VS Code extension. (GitHub)
Google Cloud Project: With BigQuery API enabled and appropriate permissions.

Step 1: Configure `tools.yaml` #

tools.yaml defines the BigQuery source and a generic query tool, enabling MCP Toolbox to interact with BigQuery.

...

Understanding Support Patterns

Oct 1, 2024

BigQuery, AI, GenAI, Google Cloud, Support

Understanding Support Patterns #

Finding patterns in the support cases helps to find structural issues and bottlenecks in the data platform.

Just after BigQuery released the ML.GENERATE_TEXT functions (mid 2024) and made Gemini accessible though BigQuery, I was curios to to test this new feature. And what better use case then understanding Support Patterns and issues with the support on the fly.

Thus, I tried these features by analyzing a sample of our internal support tickets.

...

BigQuery Storage Optimization

Aug 5, 2024

BigQuery, Google Cloud, Storage

BigQuery Storage Optimization #

Over time data easily accumulates. Purging no longer needed data (Bad Data) can save cost and can also reduce the carbon footprint of any Data Warehouse.

In this post, I am describing a simple to identify unused and therefore potentially obsolete data on a table level in BigQuery. This method is easy to reproduce and may help you to also reduce your BigQuery storage cost.

...

Ingestion with dbt & DuckDB

Jul 13, 2024

DuckDB, Dbt

Streamlined Data Ingestion with dbt and DuckDB #

Efficient file processing is crucial in data engineering. I just did a small experiment that explores integrating dbt (data build tool) with DuckDB (dbt-duckdb), enhanced by an Excel plugin. This combination of tools appears to be a simple framework for local and remote file processing and exposes itself as a powerful framework for ingestion tasks.

Why Combine dbt and DuckDB? #

By leveraging dbt and DuckDB together, we can ensure:

...

GCS Storage Optimization

Jul 13, 2024

GCS, Google Cloud, Storage

GCS Storage Optimization #

With data volumes continuously growing, optimizing Google Cloud Storage usage can lead to significant cost savings. To tackle this challenge, I developed a Python utility that helps summarize and analyze the stored data, making it easier to identify large files and folders on GCS.

While identifying the total storage cost on a bucket is relatively straight forward using the GCP billing report, identifying large files and folders on buckets can be a tedious task. This utility helps to quickly identify large blobs / files and folders.

...

Economic Cooperative Systems

May 10, 2024

Economy, AI

Economic Cooperative Systems #

I am reflecting on the potential of AI and what has to be build.

I am taking a procurement process as the baseline for a cooperative system. The perspective of the buying model:

Identifying Needs: The whole process starts with a need. This need can be identified by a model itself recognizing that it might not be the best to solve the problem itself.
Supplier Research and Selection: reach out to a marketplace / exchange and fin potential suppliers based on something like a request for Proposal/Quotation (RFP/RFQ). The offered prices in conjunction with a vendor self assessed confidence for a response are received as a bid in the exchange.
Approval / PO issuance: The buying model can now pick it’s supplier based on it’s configuration of price & quality. (There might be more than one quality metric.)
Assign the task to the model identified.
Receipt and Inspection: the buying model can now assess the quality of the response or might leave this assessment to the user.
Payment: a payment can be disputed if the response is of a objective quality below a certain measure. (There might be quality assurance / audit models within the system to ensure orderly conduct of business.) If the objective measures are ok, a payment has to be made. Feedback can be given, that yields future decisions, blacklisting of models etc.
Keeping and Audit: Given experience, there must be a record for each transaction with all it’s parameters. This allows for further
Performance Review and Relationship Management: The supplier’s performance is reviewed, and feedback is provided. This step also involves maintaining and managing the relationship with the supplier for future transactions.and auditing.

It is easy to derive the vendor perspective from the above.

...