Skip to Content
DestinationsBigQueryOverview

BigQuery

Source + Destination — Vendo streams your Shopify e-commerce data directly to Google BigQuery, giving you a powerful data warehouse for advanced analytics, custom reporting, and machine learning. Own your data and run unlimited SQL queries against your complete customer dataset.

Key Benefits

  • Raw Data Access — Complete, unsampled Shopify data in your BigQuery project
  • Real-time Streaming — Client-side events streamed directly to BigQuery
  • Historical Sync — Full backfill of orders, customers, products, and more
  • SQL Analytics — Run complex queries for custom analysis
  • Data Ownership — Your data in your Google Cloud project

Data Tables

Vendo creates and maintains these tables in your BigQuery dataset:

Table NameDescriptionUpdate Frequency
ordersComplete order dataReal-time + backfill
customersCustomer profilesEvery few hours
productsProduct catalogDaily
eventsClient-side tracking eventsReal-time streaming
abandoned_checkoutsAbandoned cart dataHourly
fulfillmentsOrder fulfillment dataReal-time
inventory_itemsInventory levelsDaily
errorsIntegration error logsReal-time

If ad platform integrations are connected, ad data is also written to BigQuery:

TableDescription
export_ad_dataDaily ad metrics (impressions, clicks, spend) per ad
change_historyAd account change events (Google Ads, Meta Ads)

Dataset Structure

Data is organized in a dataset named vendo_{shop_name} in your GCP project.

Platform Details

SettingValue
Dataset Namingvendo_{shop_name}
Sync MethodServer-side via BigQuery API + real-time client streaming
IdentityShopify Customer ID
Historical BackfillFull order/customer history

Identity & Deduplication

  • Identity — Shopify Customer ID
  • Deduplication — Records are keyed by source IDs within each table

What to Expect After Setup

  1. Dataset Createdvendo_{shop_name} dataset appears in your BigQuery project
  2. Tables Created — All schema tables are created automatically
  3. Historical Backfill — Past data loads within hours
  4. Real-time Events — Client-side events stream immediately

Data Freshness

Data TypeLatency
Client-side eventsReal-time (seconds)
OrdersNear real-time (minutes)
CustomersEvery few hours
ProductsDaily
Abandoned cartsHourly

Verify Setup

  1. Confirm the vendo_{shop_name} dataset exists in BigQuery
  2. Check that new orders or events are appearing in the relevant tables
  3. Review the errors table for any failed syncs

Estimated Storage Costs

BigQuery pricing (approximate):

MetricCost
Storage~$0.02/GB/month
Queries~$5/TB scanned (first 1TB free/month)
Store SizeEstimated Data Volume
Small store< 1 GB/month
Medium store1–10 GB/month
Large store10–100 GB/month

Compatible Sources

BigQuery accepts table exports from all Vendo sources:

SourceWhat Vendo Exports
ShopifyOrders, customers, products, events, abandoned checkouts, fulfillments
StripePayments, subscriptions, customers, invoices
Google AdsAd performance metrics, geo data, change history
Meta AdsAd performance metrics, geo data, change history
TikTok AdsAd performance metrics, geo data
Snap AdsAd performance metrics, geo data
Microsoft AdsAd performance metrics, geo data
LinkedIn AdsAd performance metrics, geo data
X AdsAd performance metrics, geo data
MixpanelEvent and user data
SegmentEvent and user data
AmplitudeEvent and user data

BigQuery also receives output from SQL models, Python models, and audiences.

Last updated on