Build source assets
Execute source assets' build logic to create tables or views in the data warehouse.
Overview
With the source successfully set up, we'll build the three tables and load the data into Snowflake. This step takes the raw data from Google Sheets and transforms it into usable data.
Instructions
Building: Materialize your asset
- Open the bottom drawer
- Select the build tab
- Insert the build command to build and import all three tables:
y42 build -s source:raw_jaffle_shop_data.*
- Click on
Visualize DAG
to see the assets this command will build - Click on
Build now
- You will see a new row added to the build window
- The build will be in queueing mode indicated by the blue quarter bar
- Click on the build row to see more details
- You will see the three pending jobs
- Navigate to
Logs
to view build logs - Navigate back to
Builds jobs
- Click on an individual build job to see the job’s details
- Navigate to
Logs
to see detailed logs showing how your Google Sheet is accessed, rows are read, and data is written to a file that is then uploaded to your Snowflake warehouse
- After a couple of minutes the build should be
Ready
.
Publication: Adding metadata
To make our data more insightful, we'll add metadata to the raw_orders
table using the auto-documentation feature. This will help us understand the columns and their types better. We'll also take a sneak peek at the data to see how it's shaping up.
Add metadata
- Click on
raw_orders
in the left file selector - Click
Sync columns
- Four columns should be added
- order_id
- customer_id
- order_date
- status
- Commit the table metadata
- Click on the
Commit & Push
button at the top - Name your Commit:
adding raw_orders metadata
- Commit your changes
- Wait for the pre-configured checks
- Click on the
View the data
- Open the bottom drawer
- Navigate to
Data
, you will see a preview of the row count - Wait for the data to load
- Verify that you can see data
Up next
Now that you've created source tables in the data warehouse, you can reference them as upstream dependencies using dbt models.