My tech musings

Thursday, July 27, 2023

Classifying Text using Gzip and KNN

A new paper came out at ACL 2023 which showed that using gzip compression in combination with the k-nearest neighbours algorithm can achieve accuracy similar to that of state-of-the-art deep learning models such as BERT on text classification. Not only that, due to the fact that this technique is non-parametric, it beats those large deep learning models on out of distribution data samples. If this intrigues you, then read my article on substack which explains the findings of the paper in simple terms.

Friday, June 16, 2023

A Deep Dive to Understand The Sorting Algorithm Optimizations of AlphaDev

DeepMind's AlphaDev model has been able to find optimized implementations for small sorting functions. In my latest article I take a deep dive to explain what these optimizations are and how they fare against the benchmark implementations. Check it out: https://codeconfessions.substack.com/p/creating-chatgpt-plugins-using-the