This page was generated from docs/source/notebooks/quickstart/quickstart.ipynb. Interactive online version:

Quickstart/Cheat-Sheet

Since this documentation is written in a jupyter-notebook we will import a little ipython helper function to display file with syntax highlighting.

[1]:

from glotaran.utils.ipython import display_file

To start using pyglotaran in your project, you have to import it first. In addition we need to import some extra components for later use.

[2]:

from glotaran.analysis.optimize import optimize
from glotaran.io import load_model
from glotaran.io import load_parameters
from glotaran.io import save_dataset
from glotaran.io.prepare_dataset import prepare_time_trace_dataset
from glotaran.project.scheme import Scheme

Let us get some example data to analyze:

[3]:

from glotaran.examples.sequential import dataset

dataset

Like all data in pyglotaran, the dataset is a xarray.Dataset. You can find more information about the xarray library the xarray hompage.

The loaded dataset is a simulated sequential model.

Plotting raw data

Now we lets plot some time traces.

[4]:

plot_data = dataset.data.sel(spectral=[620, 630, 650], method="nearest")
plot_data.plot.line(x="time", aspect=2, size=5);

../../_images/notebooks_quickstart_quickstart_8_0.svg

We can also plot spectra at different times.

[5]:

plot_data = dataset.data.sel(time=[1, 10, 20], method="nearest")
plot_data.plot.line(x="spectral", aspect=2, size=5);

../../_images/notebooks_quickstart_quickstart_10_0.svg

Working with models

To analyze our data, we need to create a model.

Create a file called model.yaml in your working directory and fill it with the following:

[8]:

display_file("model.yaml", syntax="yaml")

[8]:

type: kinetic-spectrum

initial_concentration:
 input:
   compartments: [s1, s2, s3]
   parameters: [input.1, input.0, input.0]

k_matrix:
 k1:
   matrix:
     (s2, s1): kinetic.1
     (s3, s2): kinetic.2
     (s3, s3): kinetic.3

megacomplex:
 m1:
   k_matrix: [k1]

irf:
 irf1:
   type: gaussian
   center: irf.center
   width: irf.width

dataset:
 dataset1:
   initial_concentration: input
   megacomplex: [m1]
   irf: irf1

Now you can load the model file.

[9]:

model = load_model("model.yaml")

You can check your model for problems with model.validate.

[10]:

model.validate()

[10]:

'Your model is valid.'

Working with parameters

Now define some starting parameters. Create a file called parameters.yaml with the following content.

[11]:

display_file("parameters.yaml", syntax="yaml")

[11]:

input:
 - ['1', 1, {'vary': False, 'non-negative': False}]
 - ['0', 0, {'vary': False, 'non-negative': False}]

kinetic: [
    0.5,
    0.3,
    0.1,
]

irf:
 - ['center', 0.3]
 - ['width', 0.1]

[12]:

parameters = load_parameters("parameters.yaml")

You can model.validate also to check for missing parameters.

[13]:

model.validate(parameters=parameters)

[13]:

'Your model is valid.'

Since not all problems in the model can be detected automatically it is wise to visually inspect the model. For this purpose, you can just print the model.

[14]:

model

[14]:

Model

Type: kinetic-spectrum

Initial Concentration

input:
Label: input
Compartments: [‘s1’, ‘s2’, ‘s3’]
Parameters: [input.1, input.0, input.0]
Exclude From Normalize: []

K Matrix

k1:
Label: k1
Matrix:
- (‘s2’, ‘s1’): kinetic.1
- (‘s3’, ‘s2’): kinetic.2
- (‘s3’, ‘s3’): kinetic.3

Irf

irf1 (gaussian):
Label: irf1
Type: gaussian
Center: irf.center
Width: irf.width
Normalize: True
Backsweep: False

Dataset

dataset1:
Label: dataset1
Megacomplex: [‘m1’]
Initial Concentration: input
Irf: irf1

Megacomplex

m1 (None):
Label: m1
K Matrix: [‘k1’]

The same way you should inspect your parameters.

[15]:

parameters

[15]:

input:

Label

Value

StdErr

Min

Max

Vary

Non-Negative

Expr

1

1

0

-inf

inf

False

False

None

0

0

0

-inf

inf

False

False

None
irf:

Label

Value

StdErr

Min

Max

Vary

Non-Negative

Expr

center

0.3

0

-inf

inf

True

False

None

width

0.1

0

-inf

inf

True

False

None
kinetic:

Label

Value

StdErr

Min

Max

Vary

Non-Negative

Expr

1

0.5

0

-inf

inf

True

False

None

2

0.3

0

-inf

inf

True

False

None

3

0.1

0

-inf

inf

True

False

None

Optimizing data

Now we have everything together to optimize our parameters. First we import optimize.

[16]:

scheme = Scheme(model, parameters, {"dataset1": dataset})
result = optimize(scheme)
result

   Iteration     Total nfev        Cost      Cost reduction    Step norm     Optimality
       0              1         7.5712e+00                                    1.36e+02
       1              2         7.5710e+00      1.95e-04       1.97e-05       1.16e-02
       2              3         7.5710e+00      1.38e-12       3.77e-09       2.27e-06
Both `ftol` and `xtol` termination conditions are satisfied.
Function evaluations 3, initial cost 7.5712e+00, final cost 7.5710e+00, first-order optimality 2.27e-06.

[16]:

Optimization Result
Number of residual evaluation	3
Number of variables	5
Number of datapoints	151200
Degrees of freedom	151195
Chi Square	1.51e+01
Reduced Chi Square	1.00e-04
Root Mean Square Error (RMSE)	1.00e-02

Model

Type: kinetic-spectrum

Initial Concentration

input:
Label: input
Compartments: [‘s1’, ‘s2’, ‘s3’]
Parameters: [input.1: 1.00000e+00 (fixed), input.0: 0.00000e+00 (fixed), input.0: 0.00000e+00 (fixed)]
Exclude From Normalize: []

K Matrix

k1:
Label: k1
Matrix:
- (‘s2’, ‘s1’): kinetic.1: 4.99982e-01 (StdErr: 7e-05 ,initial: 5.00000e-01)
- (‘s3’, ‘s2’): kinetic.2: 2.99994e-01 (StdErr: 4e-05 ,initial: 3.00000e-01)
- (‘s3’, ‘s3’): kinetic.3: 1.00005e-01 (StdErr: 5e-06 ,initial: 1.00000e-01)

Irf

irf1 (gaussian):
Label: irf1
Type: gaussian
Center: irf.center: 2.99998e-01 (StdErr: 5e-06 ,initial: 3.00000e-01)
Width: irf.width: 1.00000e-01 (StdErr: 7e-06 ,initial: 1.00000e-01)
Normalize: True
Backsweep: False

Dataset

dataset1:
Label: dataset1
Megacomplex: [‘m1’]
Initial Concentration: input
Irf: irf1

Megacomplex

m1 (None):
Label: m1
K Matrix: [‘k1’]

[17]:

result.optimized_parameters

[17]:

input:

Label

Value

StdErr

Min

Max

Vary

Non-Negative

Expr

1

1

0

-inf

inf

False

False

None

0

0

0

-inf

inf

False

False

None
irf:

Label

Value

StdErr

Min

Max

Vary

Non-Negative

Expr

center

0.299998

5.01464e-06

-inf

inf

True

False

None

width

0.1

6.70888e-06

-inf

inf

True

False

None

kinetic:

Label	Value	StdErr	Min	Max	Vary	Non-Negative	Expr
1	0.499982	7.26317e-05	-inf	inf	True	False	None
2	0.299994	4.19618e-05	-inf	inf	True	False	None
3	0.100005	4.78474e-06	-inf	inf	True	False	None

You can get the resulting data for your dataset with result.get_dataset.

[18]:

result_dataset = result.data["dataset1"]
result_dataset

Visualize the Result

The resulting data can be visualized the same way as the dataset. To judge the quality of the fit, you should look at first left and right singular vectors of the residual.

[19]:

residual_left = result_dataset.residual_left_singular_vectors.sel(left_singular_value_index=0)
residual_right = result_dataset.residual_right_singular_vectors.sel(right_singular_value_index=0)
residual_left.plot.line(x="time", aspect=2, size=5)
residual_right.plot.line(x="spectral", aspect=2, size=5);

../../_images/notebooks_quickstart_quickstart_36_0.svg

../../_images/notebooks_quickstart_quickstart_36_1.svg

Finally, you can save your result.

[20]:

save_dataset(result_dataset, "dataset1.nc")

Quickstart/Cheat-Sheet

Plotting raw data

Preparing data

Working with models

Working with parameters

Model

Initial Concentration

K Matrix

Irf

Dataset

Megacomplex

Optimizing data

Model

Initial Concentration

K Matrix

Irf

Dataset

Megacomplex

Visualize the Result