Profile Picture
Hi, I’m Mike Czech 馃帳 This is a collection of things I鈥檝e learned.

Using PDB in Metaflow

March 23, 2025 路 1 min 路 138 words 路 Me

Finding the Maximal Rectangle in Augmented Reality

March 16, 2025 路 2 min 路 272 words 路 Me

How Boosted Decision Trees can Benefit from Language Models

March 13, 2025 路 5 min 路 981 words 路 Me

Polars is_in vs. inner join

March 3, 2025 路 1 min 路 125 words 路 Me

A Lightweight Vector DB with DuckDB and Cloud Run

March 2, 2025 路 3 min 路 518 words 路 Me

Do you really need a distributed query engine?

October 11, 2024 路 2 min 路 362 words 路 Me

Optimizing Data Loading and GPU Usage in PyTorch

March 20, 2024 路 3 min 路 445 words 路 Me

SQL Query Testing with DuckDB and SQLGlot

January 12, 2024 路 2 min 路 365 words 路 Me

Enhance Training Speed with Mixed Precision Training

December 20, 2023 路 2 min 路 353 words 路 Me

Optional Sampling for Better Feedback Loops

October 4, 2023 路 3 min 路 481 words 路 Me

Reducing Memory Requirements with Sparse Data Structures

June 11, 2023 路 2 min 路 317 words 路 Me

Reconsidering Data Types for Efficiency

September 20, 2022 路 2 min 路 242 words 路 Me

Columnar vs. Row-Based Storage: Key Differences and Use Cases

March 20, 2022 路 2 min 路 347 words 路 Me

Enhancing Code Performance with Vectorization

January 22, 2022 路 2 min 路 351 words 路 Me

Table Partitioning and Clustering Strategies

November 4, 2020 路 2 min 路 283 words 路 Me

Implementing a Python Package Repository on GCP

September 13, 2020 路 4 min 路 832 words 路 Me

Optimizing Resource Utilization in Batch Jobs on GKE

April 15, 2019 路 3 min 路 567 words 路 Me

Concolic Testing With KLEE

July 8, 2014 路 3 min 路 452 words 路 Me