Blog posts

2023

LLM Theory

less than 1 minute read

Published:

Awesome LLM theory articles I chanced upon

Simple tricks to improve Retrieval Augmented Generation (RAG) systems

2 minute read

Published:

Motivation

Retrieval Augmented Generation (RAG) is a framework for retrieving facts from an external knowledge base to ground large language models (LLMs) on the most accurate, up-to-date information. RAG is increasingly popular in industry as it’s simple to implement yet powerful. Here I’ll share some tricks to improve RAG systems.

Simple solution to training causal LMs with seq2seq objective

2 minute read

Published:

Motivation

How do we train causal language models (e.g. Alpaca, LLaMA, gpt-neox-20b…) with seq2seq objective? This goal is important because we want to instruction-tune our causal LMs, especially since Alpaca is the best open model at time of writing.

2022

Engineering Tricks

2 minute read

Published:

Compilation of tricks I find useful.

Undergraduate Study Tips

2 minute read

Published:

Motivation

With yesterday’s results release, I have finally graduated! Having been tested for many years of my life, here are some study tips I found useful for my own undergraduate studies. I hope they come in useful for a junior out there :).

2021

Paper Reviews

32 minute read

Published:

Motivation

As I step into this world of research, I realised I’m still in the midst of discovering what I’m truly interested in. To this end, I hope to read papers solely because I’m interested in it (instead of, say, to do literature review for my current research).