Skills & Learning Resources
This is a living list of tools I’m learning and using in my bioinformatics work — along with links to some of the most helpful resources I’ve found. I’ve included what each tool is for and how I use it in practice.
Whether you’re just starting out or brushing up, I hope this helps you on your own learning path.
Programming & Analysis
Python
Used for building data pipelines, analyzing text, and creating interactive tools with Streamlit.
🔗 Python Crash Course (Free Book)
R & Statistical Learning
Learning supervised learning methods in R, based on ISLR. Covers regression, classification, bootstrapping, and more.
Web Apps & Visualization
Streamlit
A Python framework I use to turn data science and LLM projects into interactive web apps.
ggplot2
Visualization tool in R for creating layered, publication-quality plots.
LLM Tools
LangChain & Ollama
Used to create document Q&A tools and summarize regulatory/strategic texts.
ChromaDB
A vector database for semantic document search and RAG-based pipelines.
Bioinformatics
FastQC
Checking sequencing read quality before downstream analysis.
Tools & Workflow
Git & GitHub
Used for tracking learning projects, version control, and public portfolio publishing.
Conda & WSL
Used for managing environments and running bioinformatics tools in a Linux shell on Windows.
Want to share a great resource or chat about any of these tools? Reach out here.