Path: blob/master/incubator/IC50_synthetic-bayesian.ipynb
411 views
Problem Type
IC50 determination is a commonplace problem type for experimental biochemists. The goal is to determine the concentration of an inhibitor that causes a protein to have 1/2 as much activity as without the inhibitor.
Classically, this is done by fitting a logistic function of some sort, where the IC50 is a parameter in the model. We then want to determine IC50, given the data at hand.
In the implementation here, we allow for multi-drug IC50 determination. Additionally, I also show how we can have a mix of noisy data and have non-uniform coverage over the drug concentration space, and still come to a fairly precise determination of the IC50 values as long as we have dense enough data around the true IC50 area. The simulated data below assumes that a first-pass experiment was done over a wide range of data points, followed by a second-pass set of experiments where a range of concentrations was filled in with more resolution.
Data structure
To use it with this model, the data should be structured as such:
Each row is one measurement for one concentration of drug.
The columns should indicate, at the minimum:
What treatment group the sample belonged to.
The measured value.
Extensions to the model
None
Reporting summarized findings
Here are examples of how to summarize the findings.
The IC50 of drug X was
mean
(95% HPD: [lower
,upper
]).
Other notes
None
Note how there's sparse coverage in the regions outside of the true IC50s.
Note the posterior distributions around the IC50s are fairly nice and tight.