Technical Blog
Writings on MLOps, machine learning, software engineering, and programming.
For personal writings on Afghanistan, books, languages, and life, see the personal blog.
Trying to instrument an agentic app with Arize Phoenix and litellm
llms
agents
evals-course
evaluation
miniproject
hinbox
Testing out instrumenting LLM tracing for litellm with Braintrust and Langfuse
llms
agents
evals-course
evaluation
miniproject
hinbox
Building hinbox: An agentic research tool for historical document analysis
llms
agents
evals-course
evaluation
miniproject
hinbox
research
How to think about evals
evals-course
llms
llmops
evaluation
Learnings from a week of building with local LLMs
claude
llm
llms
miniproject
openai
prompt-engineering
softwareengineering
tools
Building an MCP Server for Beeminder: Connecting AI Assistants to Personal Data
tools
anthropic
claude
miniproject
Tinbox: an LLM-based document translation tool
translation
llm
llms
languages
research
miniproject
python
tools
Starting the Hugging Face Agents course
agents
huggingface
skillbuilding
llmops
llms
AI Engineering Architecture and User Feedback
books-i-read
llm
llms
llmops
evaluation
Notes on ‘AI Engineering’ chapter 9: Inference Optimisation
books-i-read
inference
llm
llms
hardware
Notes on ‘AI Engineering’ (Chip Huyen) chapter 7: Finetuning
books-i-read
finetuning
llm
llms
Notes on ‘AI Engineering’ (Chip Huyen) chapter 4
books-i-read
llm
llms
evaluation
Notes on ‘AI Engineering’ (Chip Huyen) chapter 1
books-i-read
llm
llms
finetuning
prompt-engineering
Final notes on ‘Prompt Engineering for LLMs’
llm
prompt-engineering
books-i-read
evaluation
Assembling the Prompt: Notes on ‘Prompt Engineering for LLMs’ ch 6
llm
prompt-engineering
books-i-read
Prompt Content: Notes on ‘Prompt Engineering for LLMs’ ch 5
llm
prompt-engineering
books-i-read
RAG
Starting to read Prompt Engineering for LLMs
llm
prompt-engineering
books-i-read
tokenisation
My finetuned models beat OpenAI’s GPT-4
nlp
afghanistan
llms
miniproject
finetuning
isafpr
evaluation
How to think about creating a dataset for LLM finetuning evaluation
llms
finetuning
isafpr
afghanistan
datasets
evaluation
miniproject
Evaluating the Baseline Performance of GPT-4-Turbo for Structured Data Extraction
nlp
afghanistan
datalabelling
llms
isafpr
miniproject
evaluation
Structured Data Extraction for ISAF Press Releases with Instructor
nlp
afghanistan
datalabelling
isafpr
llms
miniproject
Introducing the Afghanwire Dataset: A Unique Collection of Translated Afghan Media Articles from 2006-2009
miniproject
afghanistan
datalabelling
datasets
nlp
llms
isafpr
Publishing the ISAF Press Releases dataset
miniproject
afghanistan
datalabelling
datasets
nlp
llms
Automating database backups with Tarsnap
databases
skillbuilding
softwareengineering
tools
miniproject
Building MathsPrompt: a tool to help me review and practice problems for my degree
openai
llms
mathematics
rust
mu123
q31
skillbuilding
softwareengineering
tools
miniproject
Terraform Input Variables
terraform
devops
softwareengineering
Tokenizer Links
nlp
balochi-language-model
tokenisation
links
Tokenizing Balochi with HuggingFace’s Tokenizer and FastAI/Spacy
nlp
balochi-language-model
tokenisation
balochi
The Risks of Language Models in Minority Languages
balochi
nlp
balochi-language-model
deep-learning
ethics
Low-resource language models: making a start with Balochi
balochi
nlp
balochi-language-model
deep-learning
Exponents and Logarithms: a MU123 review
mathematics
mu123
q31
Terraform for the Uninitiated: Demystifying Your First Codebase
terraform
softwareengineering
devops
How to remove a commit (or two) from your git branch
git
softwareengineering
versioncontrol
Tricking my digits classifier with diffusion
computervision
fastai
parttwo
Deep learning tricks all the way down, with a bit of mathematics for good measure
computervision
fastai
parttwo
Storing Bytes: what data serialisation is and why you need it for machine learning
redactionmodel
computervision
mlops
python
tools
zenml
It takes a tribe: how I’m thinking about putting my object detection model into production
tools
redactionmodel
computervision
mlops
More Data, More Problems: Using DVC to handle data versioning for a computer vision problem
tools
redactionmodel
computervision
mlops
Redaction Image Classifier: NLP Edition
fastai
nlp
partone
Some foundations for machine learning with PyTorch
fastai
computervision
partone
A painless way to create an MVP demo using computer vision models
fastai
computervision
redactionmodel
tools
How my pet cat taught me a lesson about validation data for image classification
fastai
computervision
partone
How to trust the data you feed your model: alternative data validation solutions in a computer vision context (part 3)
tools
redactionmodel
computervision
datavalidation
How to trust the data you feed your model: data validation with Great Expectations in a computer vision context (part 2)
tools
redactionmodel
computervision
datavalidation
How to trust the data you feed your model: data validation with Great Expectations in a computer vision context (part 1)
tools
redactionmodel
computervision
datavalidation
‘I guess this is what data-centric AI is!’: Performance boosts after training with synthetic data
tools
redactionmodel
computervision
Starting Docker In A Month Of Lunches
tools
dockerinamonthoflunches
books-i-read
Figuring out why my object detection model is underperforming with FiftyOne, a great tool you probably haven’t heard of
redactionmodel
computervision
tools
debugging
jupyter
It’s raining bboxes: how I wrote a Python script to create 2097 synthetic images to help improve my machine learning model
redactionmodel
computervision
python
tools
What are invariants and how can they help make your Python classes more robust?
robustpython
python
books-i-read
Upgrade your Python dicts with data classes
robustpython
python
books-i-read
Using mypy for Python type checking
robustpython
python
books-i-read
Different ways to constrain types in Python
robustpython
python
books-i-read
Learning about ‘nbdev’ while building a Python package for PDF machine learning datasets
python
jupyter
fastai
tools
Getting practical with type annotations and mypy
robustpython
python
books-i-read
What makes code robust?
robustpython
python
books-i-read
73% accuracy for redaction object detection
redactionmodel
computervision
progressreport
What is VFNet?
redactionmodel
computervision
How to annotate image data for object detection with Prodigy
redactionmodel
computervision
datalabelling
Launching a podcast about MLOps
zenml
podcast
appearances
Six problems TFX was trying to solve in 2017
tfx
tensorflow
google
mlops
papers-i-read
Training a classifier to detect redacted documents with fastai
fastai
redactionmodel
computervision
datalabelling
No matching items