Jared Ko – Senior Data Engineer

WordPress site created April 27, 2023.. Lots more to come!

About Me

Over 15 years as a database developer and 5 years as a data engineer, I’ve been working with data my entire career. Starting as a teenager, using Filemaker Pro to help my father run his distributing company, one of my first paying roles was to write an Access database using Visual Basic to download automated fuel consumption/levels for a gasoline supplier.

I later progressed to Microsoft SQL Server, working as a database developer for about 12 years, specializing in high-throughput OLTP work in SQL Server (ahem… as “high-throughput” as you can be in an undistributed environment). Seeing the market shift away from this technology, I shifted my specialty to data warehousing.

Fortunately, I was able to make natural transitions from database developer to data engineer picking up new skills along the way. A project replicating SQL Server warehouse data to Redshift was a great introduction to AWS technologies that had me anxious to learn new technologies to become a successful data engineer.

Missing from that “natural transition” was exposure to “big data” technologies like Apache Airflow, Spark, Kafka, etc., so I enrolled in UW Big Data Engineering certificate program. This nine-month course provided intensive exposure to the areas I’d been missing and helped lead to my job with Twich, which used many of the things I had just been learning.

Starting up a Homelab

Seeking a way to try out new data engineering technologies, I started building out a homelab in April 2023. The picture on the right is the beginning of that lab.

Starting at the top is a 16-port KVM switch I purchased a couple of years back. Below it, my former home theater PC, waiting to be repurposed.

At the middle of the rack is my NAS, running Truenas Core for the past 10 years (and going through many upgrades).

The bottom half represents the start of my homelab: Currently five Lenovo M900s (32GB RAM each) and an HP EliteDesk 800 (16GB RAM). Each box has hard drives capacities ranging from 100GB to 1TB and they all run Ubuntu Linux.

Not pictured are two repurposed Mac Minis. The first runs MAAS (Metal as a Service) for on-the-fly provisioning/take-down of machine images and VMs.

The second Mac mini is being used as a config/credentials server running Hashicorp Vault. TLDR reasons for provisioning this way is Macs don’t allow PXE (boot/install over network), so they seemed ideal to hosting services I needed to keep in place.

Work in Progress!

My homelab and my career as a data engineer are an ongoing work in progress. Read more about my ongoing adventures in setting up my homelab here!