GENE 46100
  • Home
Categories
All (32)
gene46100 (21)
gpt (1)
homework (6)
how-to (3)
llm (1)
notebook (9)
project (3)
slides (1)
tutorial (1)

Deep Learning in Genomics Course Material - ALL

  • This page contains ALL material for the GENE46100 Deep learning in genomics course, including test notes.

  • Find the 2025 syllabus here.

  • Edit github source here


Preprocessing and clustering

The data used in this basic preprocessing and clustering tutorial was collected from bone marrow mononuclear cells of healthy human donors and was part of openproblem’s…

 

Unit 04 Single Cell Genomics

© HakyImLab and Listed Authors - CC BY 4.0 License
May 12, 2025

Seurat - Guided Clustering Tutorial

gene46100
tutorial
For this tutorial, we will be analyzing the a dataset of Peripheral Blood Mononuclear Cells (PBMC) freely available from 10X Genomics. There are 2,700 single cells that were…
May 12, 2025
Satija lab

 

Borzoi prediction from personal genome - qmd version

gene46100
notebook
Using Borzoi for predicting on a personal genome, derived from VCF
May 5, 2025
Sofia Salazar

Predicting alternative polyadenylation with Borzoi

notebook
gene46100
Borzoi is a deep learning model that, similarily to Enformer:
May 5, 2025
Sofia Salazar

 

homework 6

gene46100
homework
If you run the analysis locally, make sure you clone the gene46100 conda environment and install the python packages as needed. See this jupyter notebook for an example.…
May 2, 2025

 

Enformer Architecture

gene46100
Enformer is a deep learning model designed for predicting gene expression and other functional genomics signals directly from DNA sequence. It was introduced by DeepMind and…
May 1, 2025
Haky Im

 

unit 03

gene46100
Slides for Unit 3 - Enformer, Borzoi, and PrediXcan link
May 1, 2025
Haky Im

 

Enformer training

gene46100
notebook
train enformer on human and mouse
Apr 28, 2025

Enformer usage neanderthal - jupyter notebook version

gene46100
notebook
predict neanderthal epigenome with enformer
Apr 28, 2025

 

homework 5

gene46100
homework
© HakyImLab and Listed Authors - CC BY 4.0 License
Apr 25, 2025

Embeddings Analysis

** embeddings provided by Henry from his DNA GPT model**
Apr 23, 2025
Haky Im

 

homework 4

gene46100
homework
Watch Karpathy’s nanoGPT tutorial on youtube link and answer the questions in the form
Apr 18, 2025

 

Building a GPT - companion notebook annotated

Companion notebook from Karpathys video on building a minimal GPT, annotated by cursors LLM with summary from gemini.
Apr 15, 2025
Andrey Karpathy

 

how to install local llm

how-to
llm
#how-to install ollama open webui local llm on a macbook adapted using conda instead of pyenv from https://medium.com/@hautel.alex2000/build-your-local-ai-from-zero-to-a-cust…
Apr 14, 2025
Haky Im

 

homework 3

gene46100
homework
© HakyImLab and Listed Authors - CC BY 4.0 License
Apr 10, 2025

 

Calibrating hyperparameters with weights and biases

gene46100
project
notebook
Weights and biases is a platform used for AI developers to track, visualize and manage their ML models and experiments. The coolest part is that W&B allows you to log…
Apr 10, 2025
Sofia Salazar

 

TF binding prediction challenge

gene46100
project
Competition details for TF binding prediction challenge
Apr 8, 2025
Haky Im

 

homework 2

gene46100
homework
© HakyImLab and Listed Authors - CC BY 4.0 License
Apr 4, 2025

Updated - DNA score prediction with Pytorch

gene46100
notebook
created by Erin Wilson. Downloaded from here.
Apr 4, 2025
Erin Wilson

TF Binding prediction project

gene46100
project
notebook
The goal of this project is to create a neural network model that predicts TF binding strength in a DNA sequence.
Apr 3, 2025
Sofia Salazar

Fit linear mode to linear data

We generate a data Y linearly dependent on features X: Y = X\beta + \epsilon
Apr 3, 2025
Haky Im

 

Measuring information in DNA sequence

In the analysis of DNA sequences, particularly when studying motifs and patterns, two key concepts from information theory are entropy and information content. These…
Apr 3, 2025
Haky Im

 

homework 1

gene46100
homework
© HakyImLab and Listed Authors - CC BY 4.0 License
Mar 28, 2025

Quick introduction to deep learning

gene46100
notebook
Understand and implement gradient descent to fit a linear model
Mar 25, 2025
Boxiang Liu, modified by Haky Im

 

preparing environment for unit 00

gene46100
how-to
setting up conda environement and installing packages
Mar 24, 2025
Haky Im

Building a GPT From Scratch summary

gene46100
slides
gpt
Karpathy’s gpt_dev.ipynb summarized by gemini, reviewed by me.
Mar 24, 2025
Haky Im

Gradient descent illustration

© HakyImLab and Listed Authors - CC BY 4.0 License
Jan 1, 2025
Haky Im

 

Conda Environments for Quarto Documents

how-to
How to specify a specific Conda environment when rendering quarto documents.
Jan 6, 2024
Rich Leyshon

Borzoi prediction from personal genome - jupyter notebook

gene46100
notebook
Using Borzoi for predicting on a personal genome, derived from VCF
Sep 24, 2023
Sofia Salazar

Slides unit 00

An MLP is a fully connected feedforward neural network — the classic deep learning architecture.
Feb 20, 2023

 

TODO

© HakyImLab and Listed Authors - CC BY 4.0 License
Jan 1, 2020
No matching items

    © HakyImLab and Listed Authors - CC BY 4.0 License