Healthcare companies across the industry are jumping on the trend of creating data science teams, but many of them are jumping prematurely.
How to populate your staging environment with data while keeping user data secure.
Learn the difference between a Table and a Stream in Kafka and when to use them.
Make your job queue more performant, robust, and understandable by breaking it up into a pipeline of messages.
We’ve been doing more data engineering for clients lately. Every time I say we’re doing that, I have to explain what it is and what we’re doing! A brief glossary to describe data engineering and some common practices and technologies in the data engineering ecosystem.
Use interval
and date_trunc
to make working with SQL dates easier.
Write testable reports in SQL that don’t erode over time.
What do you do when indexes aren’t enough?
Take a guided tour through a query plan for a “simple” SQL query.
A discussion about the reasons for using Active Record migrations vs. temporary rake tasks for data manipulations.