Convex Tanstack DB Collection

On-demand real-time sync between Convex and TanStack DB collections.

Uses a "backfill + tail" pattern: fetch full history for new filters, then subscribe with a cursor to catch ongoing changes. Convex's OCC guarantees per-key timestamp monotonicity, enabling efficient cursor-based sync without a global transaction log.

Installation

``bash npm install @michaelstewart/convex-tanstack-db-collection

`or`


pnpm add @michaelstewart/convex-tanstack-db-collection


Example Use Case
Imagine a Slack-like app with messages inside channels:

`typescript // convex/schema.ts import { defineSchema, defineTable } from 'convex/server' import { v } from 'convex/values'

export default defineSchema({ channels: defineTable({ name: v.string(), }), messages: defineTable({ // Client-generated UUID to support optimistic inserts id: v.string(), channelId: v.id('channels'), authorId: v.string(), body: v.string(), updatedAt: v.number(), }) .index('by_channel_updatedAt', ['channelId', 'updatedAt']) .index('by_author_updatedAt', ['authorId', 'updatedAt']), })`

`typescript // src/collections.ts import { createCollection } from '@tanstack/react-db' import { convexCollectionOptions } from '@michaelstewart/convex-tanstack-db-collection' import { api } from '@convex/_generated/api'

const messagesCollection = createCollection( convexCollectionOptions({ client: convexClient, query: api.messages.getMessagesAfter, filters: { filterField: 'channelId', convexArg: 'channelIds' }, getKey: (msg) => msg.id, }), )

// In your UI - TanStack DB extracts channelId from the where clause const { data: messages } = useLiveQuery((q) => q .from({ msg: messagesCollection }) .where(({ msg }) => msg.channelId.eq(currentChannelId)), )`

`typescript // convex/messages.ts import { v } from 'convex/values' import { query } from './_generated/server'

export const getMessagesAfter = query({ args: { channelIds: v.optional(v.array(v.id('channels'))), after: v.optional(v.number()), }, handler: async (ctx, { channelIds, after = 0 }) => { if (!channelIds || channelIds.length === 0) return []

const results = await Promise.all( channelIds.map((channelId) => ctx.db .query('messages') .withIndex('by_channel_updatedAt', (q) => q.eq('channelId', channelId).gt('updatedAt', after), ) .collect(), ), ) return results.flat() }, })`

`Design Background`

`$3`

ElectricSQL syncs from Postgres using the write-ahead log (WAL) as a changelog. Every transaction has a globally-ordered transaction ID (txid), so Electric can stream exactly what changed and clients can confirm when their mutations are synced by waiting for specific txids.

Convex doesn't have a global transaction log—there's no single writer assigning sequential IDs. Instead, Convex provides:

1. Deterministic Optimistic concurrency control (OCC): Transactions are serializable based on read sets, with automatic deterministic retry on conflicts 2. Reactive subscriptions: Queries automatically re-run when their dependencies change, tracked efficiently via index ranges in query read sets

This adapter uses these two Convex superpowers to construct an update log from an index on updatedAt. Because OCC guarantees that updatedAt is non-decreasing for any given key (it acts as a Lamport timestamp), we can query after: cursor to fetch only newer records.

The result is efficient cursor-based sync—with two caveats:

1. Index records in the last few seconds of the update log can become visible out of order- solved with tail overlap 2. Hard deletes are unsupported

`$3`

We use a two-phase sync:

1. Backfill: Query with after: 0to get full current state for filter values 2. Tail: Subscribe withafter: globalCursor - tailOverlapMs to catch ongoing changes

A single subscription covers all active filter values.

Why one subscription for all filters?

Convex function calls are billed on subscription creation and subscription update. If you have 50 filter values active, 50 separate subscriptions could be expensive. Instead, we merge them into one subscription that tracks changes across all values, using cursor advancement to minimize redundant data.

`$3`

The per-key timestamp guarantee doesn't extend across keys. Specifically, commit order doesn't match timestamp generation order:

`T=1000: Transaction A generates updatedAt=1000 for key1 T=1001: Transaction B generates updatedAt=1001 for key2 T=1002: Transaction B commits first → key2 visible with updatedAt=1001 T=1003: Transaction A commits second → key1 visible with updatedAt=1000`

If we see key2 first, advance globalCursor to 1001, and re-subscribe with after: 1001, we'd never see key1 because 1000 < 1001.

The tail overlap (tailOverlapMs, default 10 seconds) solves this with a conservative the subscription cursor:

`typescript subscriptionCursor = globalCursor - tailOverlapMs`

This creates an overlap window where we re-receive some data. The LWW (Last-Write-Wins) resolution using updatedAt handles duplicates correctly—for any given key, we keep whichever version has the higher timestamp.

The tradeoff: A larger overlap means more duplicate data but safer sync. A smaller overlap saves bandwidth but risks missing updates if transactions take longer than the window to commit.

`$3`

Your documents must have an updatedAt field that you update on every mutation. To guarantee monotonicity within each key, even with updates from different servers with skewed clocks, use a Lamport style timestamp:

`typescript /** * Calculate a monotonically increasing updatedAt timestamp. * Uses max(Date.now(), prevUpdatedAt + 1) to handle server clock skew. */ function getLamportUpdatedAt(prevUpdatedAt: number): number { return Math.max(Date.now(), prevUpdatedAt + 1) }

// On insert await ctx.db.insert('messages', { ...data, updatedAt: Date.now(), // No previous timestamp, so Date.now() is fine })

// On update const existing = await ctx.db.get(id) await ctx.db.patch(id, { ...changes, updatedAt: getLamportUpdatedAt(existing.updatedAt), })`


More Examples
$3
You can filter by multiple fields using the same sync query:

`typescript // Filter by channel OR by author - both use the same getMessagesAfter query const messagesCollection = createCollection( convexCollectionOptions({ client: convexClient, query: api.messages.getMessagesAfter, filters: [ { filterField: 'channelId', convexArg: 'channelIds' }, { filterField: 'authorId', convexArg: 'authorIds' }, ], getKey: (msg) => msg.id, }), )

// View messages in a channel const { data: channelMessages } = useLiveQuery((q) => q .from({ msg: messagesCollection }) .where(({ msg }) => msg.channelId.eq(channelId)), )

// Or view all messages by an author const { data: authorMessages } = useLiveQuery((q) => q .from({ msg: messagesCollection }) .where(({ msg }) => msg.authorId.eq(userId)), )`

`$3`

For small datasets, sync everything:

`typescript const allMessagesCollection = createCollection( convexCollectionOptions({ client: convexClient, query: api.messages.getAllMessagesAfter, // Query takes only { after } getKey: (msg) => msg.id, }), )`

See examples/convex-tutorial for a working chat app demonstrating optimistic inserts and cursor-based sync.


Convex Query Setup (Advanced)
Convex Query Setup

Your sync query accepts filter arrays and an after timestamp. Use compound indexes for efficient queries:

`typescript // convex/messages.ts import { v } from 'convex/values' import { query } from './_generated/server'

export const getMessagesAfter = query({ args: { channelIds: v.optional(v.array(v.id('channels'))), authorIds: v.optional(v.array(v.string())), after: v.optional(v.number()), }, handler: async (ctx, { channelIds, authorIds, after = 0 }) => { // Query each channel using the compound index if (channelIds && channelIds.length > 0) { const results = await Promise.all( channelIds.map((channelId) => ctx.db .query('messages') .withIndex('by_channel_updatedAt', (q) => q.eq('channelId', channelId).gt('updatedAt', after), ) .collect(), ), ) return results.flat() }

// For author queries, use a different index (or filter) if (authorIds && authorIds.length > 0) { const results = await Promise.all( authorIds.map((authorId) => ctx.db .query('messages') .withIndex('by_author_updatedAt', (q) => q.eq('authorId', authorId).gt('updatedAt', after), ) .collect(), ), ) return results.flat() }

return [] }, })`

The compound index ["channelId", "updatedAt"] allows efficient range queries: "all messages in this channel updated after this timestamp".


Configuration Reference
Configuration
$3

`typescript interface FilterDimension { // Field name in TanStack DB queries (e.g., 'channelId') filterField: string

// Convex query argument name (e.g., 'channelIds') convexArg: string

// If true, assert only one value is ever requested (default: false) // Throws error if multiple values requested single?: boolean }`

`$3`

`typescript interface ConvexCollectionConfig { client: ConvexClient | ConvexReactClient query: FunctionReference<'query'> getKey: (item: T) => string | number

// Filter configuration (optional) filters?: FilterDimension | FilterDimension[]

// Timestamp field for LWW conflict resolution (default: 'updatedAt') updatedAtFieldName?: string

// Debounce for batching loadSubset calls (default: 50ms) debounceMs?: number

// Overlap window when rewinding subscription cursor (default: 10000ms) // See "The Tail Overlap" section above for why this is needed tailOverlapMs?: number

// Messages before re-subscribing with advanced cursor (default: 10) // Set to 0 to disable cursor advancement entirely resubscribeThreshold?: number

// Mutation handlers onInsert?: (params) => Promise onUpdate?: (params) => Promise }`

`$3`

The default tailOverlapMs of 10 seconds is generous. Convex has a 1-second execution time limit for user code in mutations, so it's unlikely that a record becomes visible multiple seconds after another record with a later timestamp. However I expect it is technically possible in cases of degraded DB performance or bad clock skew.

Even if you set this ultra-conservatively to 5 minutes, you'd still cut duplicate traffic by orders of magnitude in most apps. Ask yourself: what percentage of data on this page was written in the last 5 minutes? For many applications, it's a small fraction.

`How It Works`

1. Filter Extraction: Parses TanStack DB whereclauses to extract filter values 2. Backfill: Fetches full history for new filter values withafter: 03. Subscription Merging: Maintains a single Convex subscription for all active filter values 4. LWW Conflict Resolution: UsesupdatedAttimestamps to handle overlapping data 5. Cursor Advancement: Periodically re-subscribes with advanced cursor to reduce data transfer

`Limitations`

`$3`

This adapter does not support hard deletes. When a record is deleted from Convex, other subscribed clients have no way to learn about the deletion—the sync query only returns items that exist.

Use soft deletes instead:

`typescript // Instead of deleting: await ctx.db.delete(id)

// Set a status field: await ctx.db.patch(id, { status: 'deleted', updatedAt: Date.now(), })`

The sync will receive the updated record with status: 'deleted'. Your UI can filter out deleted items:

`typescript const { data } = useLiveQuery((q) => q .from({ item: itemsCollection }) .where(({ item }) => item.status.eq('active')), )`

`$3`

Only .eq() and .in() operators are supported for filter extraction. Complex expressions like .gt(), .lt(), or nested or conditions on filter fields won't work.

`When to Use This`

Consider starting with query-collection if you have few items on screen. It's simpler, uses Convex's built-in useQuery` under the hood, and is sufficient for many apps.

This adapter is for when you need:

- On-demand sync: Specifically load data matching your current queries
- Cursor-based efficiency: Avoid re-fetching unchanged data on every subscription update

Convex Tanstack DB Collection

On-demand real-time sync between Convex and TanStack DB collections.

Installation

``bash npm install @michaelstewart/convex-tanstack-db-collection

`or`


pnpm add @michaelstewart/convex-tanstack-db-collection


Example Use Case
Imagine a Slack-like app with messages inside channels:

`typescript // convex/schema.ts import { defineSchema, defineTable } from 'convex/server' import { v } from 'convex/values'

`typescript // convex/messages.ts import { v } from 'convex/values' import { query } from './_generated/server'

`Design Background`

`$3`

Convex doesn't have a global transaction log—there's no single writer assigning sequential IDs. Instead, Convex provides:

The result is efficient cursor-based sync—with two caveats:

1. Index records in the last few seconds of the update log can become visible out of order- solved with tail overlap 2. Hard deletes are unsupported

`$3`

We use a two-phase sync:

1. Backfill: Query with after: 0to get full current state for filter values 2. Tail: Subscribe withafter: globalCursor - tailOverlapMs to catch ongoing changes

A single subscription covers all active filter values.

Why one subscription for all filters?

`$3`

The per-key timestamp guarantee doesn't extend across keys. Specifically, commit order doesn't match timestamp generation order:

If we see key2 first, advance globalCursor to 1001, and re-subscribe with after: 1001, we'd never see key1 because 1000 < 1001.

The tail overlap (tailOverlapMs, default 10 seconds) solves this with a conservative the subscription cursor:

`typescript subscriptionCursor = globalCursor - tailOverlapMs`

The tradeoff: A larger overlap means more duplicate data but safer sync. A smaller overlap saves bandwidth but risks missing updates if transactions take longer than the window to commit.

`$3`

// On insert await ctx.db.insert('messages', { ...data, updatedAt: Date.now(), // No previous timestamp, so Date.now() is fine })

// On update const existing = await ctx.db.get(id) await ctx.db.patch(id, { ...changes, updatedAt: getLamportUpdatedAt(existing.updatedAt), })`


More Examples
$3
You can filter by multiple fields using the same sync query:

// View messages in a channel const { data: channelMessages } = useLiveQuery((q) => q .from({ msg: messagesCollection }) .where(({ msg }) => msg.channelId.eq(channelId)), )

// Or view all messages by an author const { data: authorMessages } = useLiveQuery((q) => q .from({ msg: messagesCollection }) .where(({ msg }) => msg.authorId.eq(userId)), )`

`$3`

For small datasets, sync everything:

See examples/convex-tutorial for a working chat app demonstrating optimistic inserts and cursor-based sync.


Convex Query Setup (Advanced)
Convex Query Setup

Your sync query accepts filter arrays and an after timestamp. Use compound indexes for efficient queries:

`typescript // convex/messages.ts import { v } from 'convex/values' import { query } from './_generated/server'

return [] }, })`

The compound index ["channelId", "updatedAt"] allows efficient range queries: "all messages in this channel updated after this timestamp".


Configuration Reference
Configuration
$3

`typescript interface FilterDimension { // Field name in TanStack DB queries (e.g., 'channelId') filterField: string

// Convex query argument name (e.g., 'channelIds') convexArg: string

// If true, assert only one value is ever requested (default: false) // Throws error if multiple values requested single?: boolean }`

`$3`

`typescript interface ConvexCollectionConfig { client: ConvexClient | ConvexReactClient query: FunctionReference<'query'> getKey: (item: T) => string | number

// Filter configuration (optional) filters?: FilterDimension | FilterDimension[]

// Timestamp field for LWW conflict resolution (default: 'updatedAt') updatedAtFieldName?: string

// Debounce for batching loadSubset calls (default: 50ms) debounceMs?: number

// Overlap window when rewinding subscription cursor (default: 10000ms) // See "The Tail Overlap" section above for why this is needed tailOverlapMs?: number

// Messages before re-subscribing with advanced cursor (default: 10) // Set to 0 to disable cursor advancement entirely resubscribeThreshold?: number

// Mutation handlers onInsert?: (params) => Promise onUpdate?: (params) => Promise }`

`$3`

`How It Works`

`Limitations`

`$3`

This adapter does not support hard deletes. When a record is deleted from Convex, other subscribed clients have no way to learn about the deletion—the sync query only returns items that exist.

Use soft deletes instead:

`typescript // Instead of deleting: await ctx.db.delete(id)

// Set a status field: await ctx.db.patch(id, { status: 'deleted', updatedAt: Date.now(), })`

The sync will receive the updated record with status: 'deleted'. Your UI can filter out deleted items:

`typescript const { data } = useLiveQuery((q) => q .from({ item: itemsCollection }) .where(({ item }) => item.status.eq('active')), )`

`$3`

Only .eq() and .in() operators are supported for filter extraction. Complex expressions like .gt(), .lt(), or nested or conditions on filter fields won't work.

`When to Use This`

Consider starting with query-collection if you have few items on screen. It's simpler, uses Convex's built-in useQuery` under the hood, and is sufficient for many apps.

This adapter is for when you need:

- On-demand sync: Specifically load data matching your current queries
- Cursor-based efficiency: Avoid re-fetching unchanged data on every subscription update