This is a story we've lived through more than once, but this particular client sticks with us. A mid-size fashion retailer — about 40 employees, growing fast — was running their entire analytics operation on a single shared Excel workbook. Fourteen sheets. Six people with edit access. Thousands of manual formulas, some of which nobody could explain anymore. Every Friday, two analysts spent three hours exporting transaction data from their e-commerce platform and accounting system, copy-pasting it into the spreadsheet, then manually updating a dozen KPI calculations. Leadership waited 2–3 days for basic metrics like daily margin by category or inventory turnover.

One broken formula could quietly invalidate days of decision-making. There was no audit trail. Nobody knew which numbers were current. The spreadsheet had become both a bottleneck and a risk — and everyone knew it, but nobody had the bandwidth to fix it.

We replaced it with a real data pipeline in six weeks. No re-architecting of their business. No downtime. Just modern infrastructure that automated the manual work and made data accessible in real-time.

The Before State: The Pain

We want to paint this picture clearly, because if any of it sounds familiar, you're probably in the same boat:

What We Built

We implemented a standard modern data stack: extract, transform, load. Nothing exotic — just the right tools connected properly.

Here's the architecture:

Shopify
QuickBooks
Fivetran (Ingestion)
Snowflake (Warehouse)
dbt (Transformations)
Tableau (Dashboards)
Daily automated data flow · Real-time dashboards · Zero manual work

Week-by-Week: How the Six Weeks Actually Went

Week 1: Discovery and Data Audit. We mapped their data sources, documented the current spreadsheet logic (which was painful — some formulas were completely undocumented and we had to reverse-engineer what they were doing), and sketched out the target workflow. We also audited historical data for gaps and inconsistencies. By Friday, we had a spec everyone agreed on.

Week 2: Warehouse Setup and First Connectors. Provisioned Snowflake (Standard Edition, to keep costs down). Configured Fivetran connectors for Shopify and QuickBooks. By mid-week, transaction and accounting data was flowing into Snowflake daily. We validated the data against their spreadsheet to make sure nothing was getting lost or mangled in transit.

Weeks 3–4: dbt Transformations and Data Validation. This is where the real work happened. We built dbt models to join transaction and accounting data, calculate margin and COGS, aggregate by store and category and day. We added tests to catch data quality issues — missing values, out-of-range numbers, inconsistent joins. The business team reviewed the results and caught one edge case we'd missed: returns were being treated as negative sales instead of separate records. Fixed it in dbt in a day.

Week 5: Dashboard Build and Stakeholder Review. Built Tableau dashboards: daily sales and margin by store, by category, by product. A top-level executive dashboard with key metrics and trends. We invited leadership to a review session. They immediately asked for three custom views — margin by supplier, same-store week-over-week comparison, inventory value by location. All doable in Tableau. We added them in a few hours.

Week 6: Training and Handoff. Trained the analytics team on Tableau (for reporting) and the engineering team on dbt (for maintaining transformations). Documented the data dictionary — what each column means, where it comes from. Set up a Slack notification for failed data loads. By Friday, they were using the new system for all reports. The spreadsheet was officially retired.

The Results

The Unexpected Win

Within a month, the CEO noticed something: the supply chain team had started using the inventory value dashboard to plan purchases. They'd never done that with the spreadsheet because pulling the data was too slow. The new system didn't just replace old work — it enabled behaviors that weren't possible before.

What Made This Work

Is This Right for Your Business?

You're probably ready for a real data pipeline if any of these sound familiar:

Spreadsheets aren't bad — they're great for small data and quick analysis. But they don't scale, and they hide inefficiencies that compound over time. This retailer reclaimed 150+ hours per year, reduced errors to near-zero, and uncovered pricing insights that directly improved their margins. The cost was six weeks of focused work and about $800/month in ongoing infrastructure.

If your spreadsheet is starting to feel like a liability instead of a tool, it probably is.

For a deeper dive into the tools and architecture, read our full guide on the modern data stack for businesses that aren't Netflix. And if you're worried about what this will cost to run, our cloud cost optimization guide covers how to keep infrastructure lean. Learn more about our data engineering services.