Designing A Data Platform From Scratch

Building a data platform is a complex journey that needs extensive planning to be successful. It needs knowledge of the available technologies, the operating environment’s requirements, and the stakeholders’ expectations. In this episode, the show’s host, Tobias Macey, reflects on his plans for building a data platform and what he’s learned from running the podcast that is influencing his decisions.

 

 

Your data platform must be scalable, fault-tolerant, and performant, which means your cloud provider must be as well. Linode has been powering production systems for more than 17 years, and they have now launched a fully managed Kubernetes platform. You have everything you need to build a bulletproof data pipeline with the combined power of the Kubernetes engine for flexible and scalable deployments, as well as features like dedicated CPU instances, GPU instances, and object storage. If you go to dataengineeringpodcast.com/linode today, you’ll also receive a $100 credit to use on building your cluster, object storage, and reliable backups.

 

TimescaleDB is the leading open-source relational database that supports time-series data, which is time-stamped and can be used to track how a system changes. TimescaleDB combines PostgreSQL’s familiarity with the speed and petabyte-scale required to handle such unending data.

Source link