Jake Watson

Principal Data Engineer & Bookworm

Post Is Microsoft Purview The Answer to Modern Data Governance?

Is Microsoft Purview The Answer to Modern Data Governance?

In today’s data-driven world, effective governance is crucial for managing vast volumes of information while ensuring security and compliance. Microsoft Purview promises a robust solution to modern data governance challenges, offering seamless integration with Microsoft’s ecosystem. As businesses increasingly rely on data to drive decision-making, tools like Purview help streamline data cataloguing, classification, and protection. …

Post Should you use Data Lakehouse instead of a Data Warehouse and / or Data Lake? 

Should you use Data Lakehouse instead of a Data Warehouse and / or Data Lake? 

Should you use Data Lakehouse instead of a Data Warehouse and / or Data Lake?  Intro  When using your Data Platform to improve your Business Intelligence with useful dashboards and reports, you’ll more than likely want to use a Data Warehouse. Add on your data science builds, and storing your raw data cheaply, plus adding a…

Post Should you build or buy your data platform?

Should you build or buy your data platform?

“Build Versus Buy” is More a Scale of Options than a Binary Decision As consultants, we’ve been involved in helping our clients to decide which is the best software to invest in (or not invest in some cases). This is quite a responsibility as we are putting our reputation in the hands of a vendor,…

Post Why Invest in Data Quality?

Why Invest in Data Quality?

This can seem like a rhetorical question: you should always invest in Data Quality! But we are still not investing enough: surveys show Data Quality issues are increasing in most organisations and on average, take up 34% of a Data Engineers time instead of them creating value by adding new features. This increases to 50% in large…

Post How to create a secure Azure Data Platform

How to create a secure Azure Data Platform

There are many methods you can use to secure your data platform and the data contained within it within Azure. The security controls that will be most effective for each data platform differ based on the usage of the platform, the data sources for the platform and many other factors; Having a holistic view of…

Post What are the challenges of building a data platform?

What are the challenges of building a data platform?

When building a data platform for your business, you need to anticipate and plan for any potential challenges you may encounter. That’s where a data engineering specialist can help.  With decades of data experience behind us, we’ve seen and dealt with practically every problem you might encounter during data platform development. In this guide, we’ll…

Post What is a data platform?

What is a data platform?

In today’s digitised business landscape, the ability to collect, analyse, and manage data effectively can be the critical difference between a business’s success and failure. But what exactly enables businesses to harness the full potential of their data? Enter the customer data platform. In this blog, we’ll discuss everything you need to know about data…

Post Does Microsoft Purview solve the Data Governance Challenge?

Does Microsoft Purview solve the Data Governance Challenge?

Microsoft Purview Data Governance for the cloud, on-premise, multi-cloud and office 365 workloads. Introduction Most organisations are exploding with data that has been collected, transformed, and reported on, but this data is often not well-tracked as the organisation becomes more data-driven, increasing two pain problems that have been growing for the last few decades: How…

Post Taming your data assets with Databricks

Taming your data assets with Databricks

Safely, securely, and efficiently handling data at any scale is challenging. Here at Oakland, we’ve had years of experience helping complex organisations tame their vast data assets to draw meaningful insights from them. These years of experience and the fact we are passionately tech-agnostic enable us to recommend the right tool for the job. One…

Post Prefect: Should you utilise the next generation of data pipelining software?

Prefect: Should you utilise the next generation of data pipelining software?

What is data pipelining, and why does it matter? Data pipelining, or orchestration is an everyday activity performed by companies to move their data from one place, likely the primary storage location, to another, such as a cloud-based data lake, often including transformations during this process. Whilst this is a standard operational activity, the number…

Post Replicating Data Warehouse with Databricks Lakehouse

Replicating Data Warehouse with Databricks Lakehouse

Introduction In a recent post, we outlined what a Databricks Lakehouse encompasses and why you may want to utilise one rather than a Data Warehouse and/or Data Lake. Following that introduction, this post will walk through an example use case, including code, for how the Databricks Lakehouse replicates some common warehousing patterns while having the…

Post The Pro’s and Con’s of using Data Lakehouse

The Pro’s and Con’s of using Data Lakehouse

When using your Data Platform to improve your Business Intelligence with useful dashboards, and reports, you’ll more than likely want to use a Data Warehouse. Add on your data science builds and storing your raw data cheaply, plus adding a Data Lake just for good measure, and the costs soon start adding up. Running both…