Get Started with Recce Cloud
Set up Cloud to automate data review on every pull request. This guide walks you through each onboarding step.
Goal
Recce compares Base vs Current environments to validate data changes in every PR:
- Base: your main branch (production)
- Current: your PR branch (development)
- Per-PR schema: an isolated database schema created for each pull request, so multiple PRs can validate simultaneously without conflicts
For accurate comparisons, both environments should use consistent data ranges. See Best Practices for Preparing Environments for environment strategies.
Prerequisites
- Cloud account: free trial at cloud.reccehq.com
- dbt project in a git repository that runs successfully: your environment can execute
dbt buildanddbt docs generate - Repository admin access for setup: required to add workflows and secrets
- Data warehouse: read access to your warehouse for data diffing
Onboarding Process Overview
After signing up, you'll enter the onboarding flow:
- Connect data warehouse
- Connect Git provider
- Add Recce to CI/CD
- Merge the CI/CD change
Recce Web Agent Setup
You can use the Recce Web Agent to help automate your setup. Currently it handles step 3 (Add Recce to CI/CD):
- The agent analyzes your repository and CI/CD setup
- You answer clarifying questions the agent asks about your environment strategy
- The agent creates a PR with customized workflow files
The agent covers common setups and continues to expand coverage. If your setup isn't supported yet, the agent directs you to the Setup Guide below for manual configuration. Need help? Contact us at [email protected].
Setup Guide
This guide explains each onboarding step in detail.
First, go to cloud.reccehq.com and create your free account.
1. Connect Data Warehouse
Provide read-only credentials so Recce can run data diffs against your warehouse.
2. Connect Git Provider
Authorize the Recce app and select the repositories you want to connect.
3. Add Recce to CI/CD
This step adds CI/CD workflow files to your repository. The web agent detects your setup and guides you through. For manual setup, follow the linked guides below.
Choose your setup
| Question | If this is you... | Then... |
|---|---|---|
| How do you run dbt? | You own your dbt run (GitHub Actions, GitLab CI, CircleCI) | Continue reading below |
| You run dbt on a platform (dbt Cloud, Paradime, etc.) | See dbt Cloud Setup | |
| How complex is your environment? | Simple (prod and dev targets) | Continue reading below. We use per-PR schemas for fast setup. See Environment Setup for why. |
| Advanced (multiple schemas, staging environments) | See Environment Setup | |
| What's your CI/CD platform? | GitHub Actions | Continue reading below |
| Other (GitLab CI, CircleCI, etc.) | See Setup CD and Setup CI |
Configure in this order: profile, then CD, then CI. CD establishes the production baseline that CI compares against.
a. Configure your dbt profile
Add ci and prod targets to your profiles.yml so Recce can compare base and current environments.
b. Set up baseline updates (CD)
Add a workflow that uploads production artifacts to Cloud after every merge to main.
c. Set up PR validation (CI)
Add a workflow that uploads PR branch artifacts so Recce can validate changes before merge.
Your workflows use GITHUB_TOKEN (automatically provided by GitHub Actions) and your existing warehouse credential secrets.
recce vs recce-cloud
pip install recce is the open source CLI for local validation. pip install recce-cloud is the CI/CD uploader for Cloud.
4. Merge the CI/CD change
Merge the PR containing the workflow files. After merging:
- The Base workflow automatically uploads your Base to Cloud
- The Current workflow is ready to validate future PRs
In Cloud, verify you see:
- GitHub Integration: Connected
- Warehouse Connection: Connected
- Production Metadata: Updated automatically
- PR Sessions: all open PRs appear in the list. Only PRs with uploaded metadata can be launched for review.
5. Final Steps
You can now:
- See data review summaries in PR comments
- Launch Recce instance to visualize changes
- Review downstream impacts before merging
Verification Checklist
- Base workflow: Trigger manually, check Base metadata appears in Cloud
- Current workflow: Create a test PR, verify PR session appears
- Data diff: Open PR session, run Row Count Diff
Troubleshooting
| Issue | Solution |
|---|---|
| Authentication errors | Confirm repository is connected in Cloud settings |
| Push to main blocked | Check branch protection rules |
| Secret names don't match | Update template to use your existing secret names |
| Workflow fails | Check secrets are configured correctly |
| Artifacts missing | Ensure dbt docs generate completes before upload |
| Warehouse connection fails | Check IP whitelisting; add GitHub Actions IP ranges |
Next Steps
- Environment Setup - Configure dbt profiles and CI/CD variables
- Setup CD - Detailed CD workflow guide
- Setup CI - Detailed CI workflow guide
- Environment Best Practices - Strategies for source data and schema management
