Posts by Tags

advice

LLM Theory

less than 1 minute read

Published: September 22, 2023

Awesome LLM theory articles I chanced upon

Engineering Tricks

2 minute read

Published: December 27, 2022

Compilation of tricks I find useful.

Undergraduate Study Tips

2 minute read

Published: May 31, 2022

Motivation

With yesterday’s results release, I have finally graduated! Having been tested for many years of my life, here are some study tips I found useful for my own undergraduate studies. I hope they come in useful for a junior out there :).

data science

My first kaggle competition: March Machine Learning 2025

3 minute read

Published: March 31, 2025

Problem

I’ve always wanted to compete in Kaggle. And the cool thing about being a consultant is that on case break, I actually have some time to commit to Kaggling!

kaggle

My first kaggle competition: March Machine Learning 2025

3 minute read

Published: March 31, 2025

Problem

I’ve always wanted to compete in Kaggle. And the cool thing about being a consultant is that on case break, I actually have some time to commit to Kaggling!

networks

Crash Course on using SoC Compute Clusters

1 minute read

Published: January 17, 2021

Need access to NUS computing resources but not sure how? Here’s a quick crash course!

pretrain

Suppose that every device takes in a batch of tensors where the tensors across devices are of different sizes, will 3D parallelism still work?

2 minute read

Published: July 11, 2024

As I’m learning more about 3D parallelism, I wonder - suppose that every device takes in a batch of tensors where the tensors across devices are of different sizes, will 3D parallelism still work? Turns out, it works for data and pipeline parallelism, but tensor parallelism will need some work.

research

Simple tricks to improve Retrieval Augmented Generation (RAG) systems

2 minute read

Published: August 28, 2023

Motivation

Retrieval Augmented Generation (RAG) is a framework for retrieving facts from an external knowledge base to ground large language models (LLMs) on the most accurate, up-to-date information. RAG is increasingly popular in industry as it’s simple to implement yet powerful. Here I’ll share some tricks to improve RAG systems.

Given the same compute budget, which is better: parameter efficient finetuning (PEFT) or full finetuning?

less than 1 minute read

Published: May 04, 2023

Motivation

If we are allowed to train till convergence, we know that full finetuning is better than parameter efficient finetuning (PEFT). But what if we have a fixed compute budget? Given a fixed budget, PEFT can go through significantly more tokens. Will full finetuning still be better than PEFT?

Simple solution to training causal LMs with seq2seq objective

2 minute read

Published: March 17, 2023

Motivation

How do we train causal language models (e.g. Alpaca, LLaMA, gpt-neox-20b…) with seq2seq objective? This goal is important because we want to instruction-tune our causal LMs, especially since Alpaca is the best open model at time of writing.

Random Machine Learning Theory

8 minute read

Published: May 30, 2022

Just my personal notes!

Paper Reviews

32 minute read

Published: December 22, 2021

Motivation

As I step into this world of research, I realised I’m still in the midst of discovering what I’m truly interested in. To this end, I hope to read papers solely because I’m interested in it (instead of, say, to do literature review for my current research).

Larry Law

Posts by Tags

advice

Motivation

data science

Problem

kaggle

Problem

networks

pretrain

research

Motivation

Motivation

Motivation

Motivation