czechthedata.com
Impressum
Hi, I’m Mike Czech 馃帳 This is a collection of things I鈥檝e learned.
Using PDB in Metaflow
Finding the Maximal Rectangle in Augmented Reality
How Boosted Decision Trees can Benefit from Language Models
Polars is_in vs. inner join
A Lightweight Vector DB with DuckDB and Cloud Run
Do you really need a distributed query engine?
Optimizing Data Loading and GPU Usage in PyTorch
SQL Query Testing with DuckDB and SQLGlot
Enhance Training Speed with Mixed Precision Training
Optional Sampling for Better Feedback Loops
Reducing Memory Requirements with Sparse Data Structures
Reconsidering Data Types for Efficiency
Columnar vs. Row-Based Storage: Key Differences and Use Cases
Enhancing Code Performance with Vectorization
Table Partitioning and Clustering Strategies
Implementing a Python Package Repository on GCP
Optimizing Resource Utilization in Batch Jobs on GKE
Concolic Testing With KLEE