
Loading...
Source chonks define your raw data tables and create the foundation for your dbt project's data lineage. They're the entry point for all your data.
A Source chonk analyzes your raw tables and generates comprehensive documentation and data quality tests.
Source definition with table and column documentation
Automated checks for data staleness
AI-generated descriptions for each column
Create a Source chonk when you want to establish the starting point for your data pipeline.
Here's what DataChonk generates when you create a Source chonk for e-commerce order data.
version: 2
sources:
- name: raw_ecommerce
description: Raw e-commerce data from the production database
database: analytics
schema: raw
freshness:
warn_after: {count: 12, period: hour}
error_after: {count: 24, period: hour}
tables:
- name: orders
description: Raw orders table containing all customer orders
loaded_at_field: _loaded_at
columns:
- name: order_id
description: Unique identifier for the order
tests:
- unique
- not_null
- name: customer_id
description: Foreign key to the customers table
tests:
- not_null
- name: order_date
description: Date when the order was placed
- name: total_amount
description: Total order amount in USD
- name: status
description: Current order status (pending, shipped, delivered, cancelled)Customize how your Source chonk generates code with these options.
| Property | Type | Default | Description |
|---|---|---|---|
| include_freshness | boolean | true | Add freshness tests to detect stale data |
| include_tests | boolean | true | Add column-level tests (unique, not_null) |
| generate_descriptions | boolean | true | Use AI to generate column descriptions |
| freshness_warn | number | 12 | Hours before freshness warning |
| freshness_error | number | 24 | Hours before freshness error |
Get better results from Chonk by being specific about your source requirements.
Example Prompt