Grama: Basics

Grama: Basics#

Purpose: We will make use of the grama package to work with models, which will give us a powerful toolkit to explore and make sense of physical scenarios. To that end, in this exercise we’ll go through a brief introduction to the grama package.

Setup#

import grama as gr
DF = gr.Intention()
%matplotlib inline

Grama Verbs#

Grama is organized around two object classes and four verb classes: The objects are

Data (df), as represented by a pd.DataFrame: Represents observations from physical experiments, simulations from models, and so on.
Models (md), as represented by a gr.Model: Represents a model, including inputs, functions providing outputs, and uncertainties.

The four verb classes allow us to move between the two object types, as summarized in the image below:

Grama verb class diagram

Verb Classes#

Functions in grama are best thought of as “verbs”: functions that perform a specified action. The verb classes map between data and models. Every grama verb starts with a two-letter prefix, which gives us a hint as to how the verbs should be used. We’ve already seen a variety of transformation tf_* verbs for working with data. There are four primary verb classes, with an additional class for plotting.

Class	Prefix	In	Out
Transform	`tf_*`	`df`	`df`
Composition	`cp_*`	`md`	`md`
Fitting	`ft_*`	`df`	`md`
Evaluation	`ev_*`	`md`	`df`
Plotting	`pt_*`	`df`	(Plot)

Working with (Grama) models#

Grama is best-explained by way of a simple example. The following code initializes a grama model for a cantilever beam. There is a large amount of mathematical theory about cantilever beams, but we will ignore most of that theory for the moment.

from grama.models import make_cantilever_beam
md_beam = make_cantilever_beam()

Model summary#

Grama models contain a large amount of mathematical structure. Evaluation ev_* verbs help us make use of that structure to generate data that we can mine for insights. However, before we can start evaluating a model, we need to know some basic facts about it. To that end, let’s take a look at a summary of the example beam model.

q1 Inspect a model summary#

Printing the model gives a high-level summary of the model. Run the code below, and answer the questions under observations below.

# TASK: No need to edit; run and inspect
md_beam

model: Cantilever Beam

  inputs:
    var_det:
      t: [2, 4]
      w: [2, 4]

    var_rand:
      H: (+1) norm, {'loc': 500.0, 'scale': 100.0}
      V: (+1) norm, {'loc': 1000.0, 'scale': 100.0}
      E: (+0) norm, {'loc': 29000000.0, 'scale': 1450000.0}
      Y: (-1) norm, {'loc': 40000.0, 'scale': 2000.0}

    copula:
      Independence copula

  functions:
      cross-sectional area: ['w', 't'] -> ['c_area']
      limit state: stress: ['w', 't', 'H', 'V', 'E', 'Y'] -> ['g_stress']
      limit state: displacement: ['w', 't', 'H', 'V', 'E', 'Y'] -> ['g_disp']

Observations

What variables var does the model have?
- t, w, H, V, E, Y
Which variables are deterministic det?
- t, w
Which variables are random rand?
- H, V, E, Y
What functions does the model have?
- cross-sectional area, limit state: stress, limit state: displacement
What outputs do those functions provide?
- c_area, g_stress, g_disp

Importance of context#

Models are much more than just mathematical statements; they are used in a particular context. Grama has no way to store the infinite complexities of context in something as rigid as computer code. Thus, when interpreting the results from a model, it is important for the analyst to keep track of this context.

One basic way to record context is to tabulate facts about the inputs and outputs of a model. While this is a small subset of the full model context, it is an important subset.

Symb.	I/O	Meaning	Units	Meaning
`t`	Input	Beam cross-section height	in	A taller beam resists more vertical load
`w`	Input	Beam cross-section width	in	A wider beam resists more horizontal load
`H`	Input	Tip horizontal force	lbf	A greater forces leads to higher stress and displacement
`V`	Input	Tip vertical force	lbf	A greater forces leads to higher stress and displacement
`E`	Input	Elasticity	psi	A stiffer beam (greater `E`) better resists displacement
`Y`	Input	Yield strength	psi	A higher strength better resists stress
——-	——	—————————	—–	——————————————
`c_area`	Output	Beam cross-section area	in^2	A larger beam weights more
`g_stress`	Output	Maximum stress failure criterion	(-)	Stress failure can lead to structure collapse
`g_disp`	Output	Tip displacement failure criterion	in	Excessive deformation leads to misalignment

It is also important to note that the failure criteria are formulated such that g > 0 corresponds to success, and g <= 0 corresponds to failure.

With this bit of context, we can make more sense of results from the model.

Generate data from a model#

The philosophy behind grama is to generate data from a model and study that data to gain insights about the model. We’ll practice this data generation in the next few tasks. First, let’s print the model summary again to help with the next task:

# NOTE: No need to edit; us this for reference for the next task
md_beam

model: Cantilever Beam

  inputs:
    var_det:
      t: [2, 4]
      w: [2, 4]

    var_rand:
      H: (+1) norm, {'loc': 500.0, 'scale': 100.0}
      V: (+1) norm, {'loc': 1000.0, 'scale': 100.0}
      E: (+0) norm, {'loc': 29000000.0, 'scale': 1450000.0}
      Y: (-1) norm, {'loc': 40000.0, 'scale': 2000.0}

    copula:
      Independence copula

  functions:
      cross-sectional area: ['w', 't'] -> ['c_area']
      limit state: stress: ['w', 't', 'H', 'V', 'E', 'Y'] -> ['g_stress']
      limit state: displacement: ['w', 't', 'H', 'V', 'E', 'Y'] -> ['g_disp']

q2 DataFrame evaluation#

The following code attempts to evaluate the model at specified input values. However, the code has errors. Fix the code below by specifying values for the missing inputs.

# TASK: Add values for the missing variables
(
    md_beam
    >> gr.ev_df(
        df=gr.df_make(
            Y=4e4,
            V=1e3,
            E=29e7,
            H=5e2,

            w=3.0,
            t=3.0,
        )
    )
)

	Y	V	E	H	w	t	g_disp	c_area	g_stress
0	40000.0	1000.0	290000000.0	500.0	3.0	3.0	2.063115	9.0	0.166667

The verb gr.ev_df() is a very basic evaluation where we have to specify values for every input. However, there are other evaluation verbs that make intelligent choices for us.

q3 Sinew evaluation#

Use the verb gr.ev_sinews() to generate “sweep” data from the beam model at nominal levels for the deterministic inputs. You will need to specify one short keyword argument to complete this task.

Hint: Remember that you can put your cursor inside a function’s arguments in a code cell and press Shift + Tab to bring up the documentation. Use this to determine how to set nominal levels for the deterministic variables.

# TASK: Successfully call gr.ev_sinews() with nominal levels for the deterministic variables
(
    md_beam
    >> gr.ev_sinews(df_det="nom")
)

	Y	H	E	V	sweep_var	sweep_ind	t	w	g_disp	c_area	g_stress
0	38991.356136	267.365213	2.826352e+07	1030.057000	H	0	3.0	3.0	0.394120	9.0	0.253994
1	38991.356136	377.935965	2.826352e+07	1030.057000	H	0	3.0	3.0	0.336441	9.0	0.192566
2	38991.356136	423.529033	2.826352e+07	1030.057000	H	0	3.0	3.0	0.307564	9.0	0.167236
3	38991.356136	456.927270	2.826352e+07	1030.057000	H	0	3.0	3.0	0.284633	9.0	0.148682
4	38991.356136	486.028970	2.826352e+07	1030.057000	H	0	3.0	3.0	0.263472	9.0	0.132514
...	...	...	...	...	...	...	...	...	...	...	...
115	40279.420598	496.167260	2.970458e+07	1080.597377	Y	2	3.0	3.0	0.276727	9.0	0.131005
116	40861.454599	496.167260	2.970458e+07	1080.597377	Y	2	3.0	3.0	0.276727	9.0	0.145556
117	41529.419348	496.167260	2.970458e+07	1080.597377	Y	2	3.0	3.0	0.276727	9.0	0.162255
118	42441.280698	496.167260	2.970458e+07	1080.597377	Y	2	3.0	3.0	0.276727	9.0	0.185052
119	44652.695748	496.167260	2.970458e+07	1080.597377	Y	2	3.0	3.0	0.276727	9.0	0.240337

120 rows × 11 columns

A “sinew” evaluation sweeps across input variables while holding the other variables constant. This process is repeated for multiple starting points and for each input variable. We can get a sense for the input values that gr.ev_sinews() selects by visualizing part of the input space. The following shows a scatterplot for two of the variables:

# NOTE: No need to edit; run and inspect
(
    md_beam
    >> gr.ev_sinews(df_det="nom")
    
    >> gr.ggplot(gr.aes("V", "H", color="factor(sweep_ind)"))
    + gr.geom_point()
)

../_images/b8d3944209341dfae2989c5bf7ec49860a8e0141ab6a508438d2cc6e7df33b30.png

<ggplot: (8739965030943)>

Note that for each sweep (identified by sweep_ind) we see a “cross” emanating from a single point outwards, parallel to both coordinate axes. This process is repeated for every input variable in order to generate the data.

Visualize data from a model#

Generating data is only part of working with a model; we must also study the data to make sense of it. One of the best ways to study data is to visualize it. We can make manual plots with ggplot(), but grama provides a much quicker way: Many evaluation verbs work with the special verb gr.pt_auto() to generate a sensible default visualization.

q4 Visualize sinew evaluation data#

Use the gr.pt_auto() routine to visualize the outputs from the sinew evaluation. Answer the questions under observations below.

# TASK: Use gr.pt_auto() to visualize the sinew data
(
    md_beam
    >> gr.ev_sinews(df_det="nom")
    >> gr.pt_auto()
)

Calling plot_sinew_outputs....

../_images/5e1b9d874f6241f581ed768fbff41326726d109a8a6bbf8e4749ce66ab23aeed.png

<ggplot: (8739964777689)>

Observations

Do any inputs affect the output c_area? What about w or t? How do you know? (Or not know?)
- None of the inputs visualized have any effect on c_area. The inputs w and t have an effect on c_area, but we can’t tell based on this plot. I know this because c_area = w * t is the cross-sectional area of the beam.
Does the input Y affect the output g_disp? How do you know?
- The input Y has no effect on g_disp; it has zero slope.
Does the input E increase or decrease g_disp?
- The input E tends to increase g_disp.

Where this course is going#

This was just a brief introduction to how grama works and what it can do. The rest of this course is about learning tools to make full use of the grama toolkit. The exercises in this course are organized into four different categories (grama, data, vis, and stat) that represent four broad areas of working with grama:

Build, evaluate, and understand a model with grama (`grama`)#

We will learn tools to build and sanity-check models using grama. This will involve more evaluation and visualization tools to make sense of models, but also probabilistic tools to build models that encode our uncertainties about a scenario.

Wrangle data (`data`)#

To manage the data outputs from grama models, we will learn a variety of general-purpose wrangling tools. These will help us work with grama models, but the same tools are useful for any dataset.

Visualize data to gain insights (`vis`)#

To make sense of data, we will learn a variety of visualization techniques. These will help us gain insights into grama models, but the same techniques are useful for any dataset.

Use statistics to quantify uncertainty (`stat`)#

Datasets are always limited and models inherently include uncertainties. Statistics will allow us to quantify these uncertainties and make informed decisions using data and models.