Package: LSTbook 0.6.2.0003

LSTbook: Data and Software for "Lessons in Statistical Thinking"

"Lessons in Statistical Thinking" D.T. Kaplan (2024) <https://dtkaplan.github.io/Lessons-in-statistical-thinking/> is a textbook for a first or second course in statistics that embraces data wrangling, causal reasoning, modeling, statistical adjustment, and simulation. 'LSTbook' supports the student-centered, tidy, pipeline-oriented computing style featured in the book.

Authors:Daniel Kaplan [aut, cre], Randall Pruim [aut]

LSTbook_0.6.2.0003.tar.gz
LSTbook_0.6.2.0003.zip(r-4.7)LSTbook_0.6.2.0003.zip(r-4.6)LSTbook_0.6.2.0003.zip(r-4.5)
LSTbook_0.6.2.0003.tgz(r-4.6-any)LSTbook_0.6.2.0003.tgz(r-4.5-any)
LSTbook_0.6.2.0003.tar.gz(r-4.7-any)LSTbook_0.6.2.0003.tar.gz(r-4.6-any)
LSTbook_0.6.2.0003.tgz(r-4.6-emscripten)
manual.pdf |manual.html
DESCRIPTION |NEWS
card.svg |card.png
LSTbook/json (API)

# Install 'LSTbook' in R:
install.packages('LSTbook', repos = c('https://dtkaplan.r-universe.dev', 'https://cloud.r-project.org'))

Bug tracker:https://github.com/dtkaplan/lstbook/issues

Datasets:
  • AAUP - 1984 salaries in various professional fields
  • Anthro_F - Anthropometric data from college-aged women
  • Big - Short, simple data frames for textbook examples
  • Birdkeepers - Birdkeeping and Lung Cancer
  • Births2022 - Records on births in the US in 2022
  • Boston_marathon - Winning times in the Boston Marathon
  • Butterfly - World records in the 100 & 200 m butterfly swim
  • Calif_precip - Annual precipitation in California locations
  • Callback - Resume Experiment Data
  • Clock_auction - Data from McClave-Sincich _Statistics_ 11/e
  • CRDS - Smoking and lung function among youths
  • Dowsing - Data from McClave-Sincich _Statistics_ 11/e
  • Econ_outlook_poll - SIMULATED data from an economic outlook poll
  • FARS - Annual summaries concerning motor-vehicle related fatalities in the US#'
  • Framingham - Data from the Framingham heart study
  • Galton - Galton's dataset of parent and child heights
  • Geography_journals - Data from McClave-Sincich _Statistics_ 11/e
  • Germany1933vote - Voting patterns in the 1933 German national election
  • Gilbert - Data from the trial of serial killer Kristen Gilbert
  • Go_vote - Get out the vote experiment
  • Gradepoint - Sample from a college registrar's database
  • Grades - Sample from a college registrar's database
  • Hill_racing - Winning times in Scottish Hill races, 2005-2017
  • McCredie_Kurtz - "Big Five" personality ratings for college first-year students
  • Monocacy_river - Data on run-off from the Monocacy river at Jug Bridge, Maryland.
  • MPG - Fuel economy measurements on US car models
  • Names_and_race - Resume Experiment Data
  • Natality_2014 - Medical info on each birth in the US in 2014
  • Nats - Short, simple data frames for textbook examples
  • Offspring - Relative sizes offspring/parent for many species
  • Orings - Space Shuttle O-Ring Failures
  • Penguins - Body measurements on penguins
  • PGA_index - Data from McClave-Sincich _Statistics_ 11/e
  • PIDD - Pima Indians Diabetes Database
  • Sessions - Sample from a college registrar's database
  • Shipping_losses - Shipping losses in 1941 in the Atlantic
  • sim_00 - Simulations for use in _Lessons in Statistical Thinking_
  • sim_01 - Simulations for use in _Lessons in Statistical Thinking_
  • sim_02 - Simulations for use in _Lessons in Statistical Thinking_
  • sim_03 - Simulations for use in _Lessons in Statistical Thinking_
  • sim_04 - Simulations for use in _Lessons in Statistical Thinking_
  • sim_05 - Simulations for use in _Lessons in Statistical Thinking_
  • sim_06 - Simulations for use in _Lessons in Statistical Thinking_
  • sim_07 - Simulations for use in _Lessons in Statistical Thinking_
  • sim_08 - Simulations for use in _Lessons in Statistical Thinking_
  • sim_09 - Simulations for use in _Lessons in Statistical Thinking_
  • sim_10 - Simulations for use in _Lessons in Statistical Thinking_
  • sim_11 - Simulations for use in _Lessons in Statistical Thinking_
  • sim_12 - Simulations for use in _Lessons in Statistical Thinking_
  • sim_flights - Simulations for use in _Lessons in Statistical Thinking_
  • sim_medical_observations - Simulations for use in _Lessons in Statistical Thinking_
  • sim_prob_21.1 - Simulations for use in _Lessons in Statistical Thinking_
  • sim_satgpa - Simulations for use in _Lessons in Statistical Thinking_
  • sim_school1 - Simulations for use in _Lessons in Statistical Thinking_
  • sim_school2 - Simulations for use in _Lessons in Statistical Thinking_
  • sim_vaccine - Simulations for use in _Lessons in Statistical Thinking_
  • STAR - STAR Project Data
  • Tiny - Short, simple data frames for textbook examples
  • UCB_applicants - Roster of applicants to six major departments at UC Berkeley
  • US_wildfires - Monthly tallies of wildfires in the US from 2000 to 2022
  • Wheat - Experimental data on the yield of winter wheat
  • Whickham - Data from the Whickham survey

On CRAN:

Conda:

4.62 score 5 stars 55 scripts 273 downloads 34 exports 32 dependencies

Last updated from:eb7b48e266. Checks:8 ERROR, 1 OK. Indexed: yes.

TargetResultTimeFilesSyslog
linux-devel-x86_64ERROR150
source / vignettesERROR203
linux-release-x86_64ERROR151
macos-release-arm64ERROR165
macos-oldrel-arm64ERROR137
windows-develERROR411
windows-releaseERROR360
windows-oldrelERROR389
wasm-releaseOK130

Exports:add_plot_labelsadd_slope_roseadd_violin_ruleranova_summarybernoulliblock_bycat2valuecategoricalconf_intervaldag_drawdatasim_intervenedatasim_makedatasim_runeachlabel_zero_onemix_withmodel_evalmodel_plotmodel_skeletonmodel_trainmodel_valuesmosaic_cull_for_dontilespoint_plotR2random_levelsrandom_termsregression_summaryresamplertermshuffletake_sampletrialszero_one

Dependencies:backportsbroomclicpp11dplyrfarvergenericsggplot2gluegtableisobandlabelinglifecyclemagrittrMASSpillarpkgconfigpurrrR6RColorBrewerrlangS7scalesstringistringrtibbletidyrtidyselectutf8vctrsviridisLitewithr

Readme and manuals

Help Manual

Help pageTopics
1984 salaries in various professional fieldsAAUP
Convenience function for adding labels to point_plot or others without needing the ggplot2 + pipe.add_plot_labels
Add a slope "rose" to a plot.add_slope_rose add_violin_ruler
Anthropometric data from college-aged womenAnthro_F
Birdkeeping and Lung CancerBirdkeepers
Records on births in the US in 2022Births2022
Winning times in the Boston MarathonBoston_marathon
World records in the 100 & 200 m butterfly swimButterfly
Annual precipitation in California locationsCalif_precip
Resume Experiment DataCallback Names_and_race
Helpers for specifying nodes in simulationsbernoulli block_by cat2value categorical each mix_with random_levels
Summaries of regression modelsanova_summary conf_interval R2 regression_summary
Smoking and lung function among youthsCRDS
Draw a DAGdag_draw
Construct and modify data simulationsdatasim_intervene datasim_make datasim_to_igraph
Run a datasim simulation, producing a data framedatasim_run
SIMULATED data from an economic outlook pollEcon_outlook_poll
Utilitiesexplanatory_vars formula_from_mod get_training_data response_values response_var
Annual summaries concerning motor-vehicle related fatalities in the US#'FARS
Data from the Framingham heart studyFramingham
Galton's dataset of parent and child heightsGalton
Voting patterns in the 1933 German national electionGermany1933vote
Data from the trial of serial killer Kristen GilbertGilbert
Get out the vote experimentGo_vote
Winning times in Scottish Hill races, 2005-2017Hill_racing
Data from McClave-Sincich _Statistics_ 11/eClock_auction Dowsing Geography_journals McClave_Sincich PGA_index
"Big Five" personality ratings for college first-year studentsMcCredie_Kurtz
Evaluate a model on inputsmodel_eval
Helper functions to evaluate modelsmodel_eval_fun
Check model type against model specification and datamodel_family
Graph a model functionmodel_plot
Convert a model to a skeletonmodel_skeleton
train a model, easilymodel_train
Construct a model and return the model valuesmodel_values
Data on run-off from the Monocacy river at Jug Bridge, Maryland.Monocacy_river
Cull objects used with do()mosaic_cull_for_do mosaic_cull_for_do.aggregated.stat mosaic_cull_for_do.anova mosaic_cull_for_do.aov mosaic_cull_for_do.cointoss mosaic_cull_for_do.default mosaic_cull_for_do.fitdistr mosaic_cull_for_do.htest mosaic_cull_for_do.lm mosaic_cull_for_do.matrix mosaic_cull_for_do.table
Fuel economy measurements on US car modelsMPG
Medical info on each birth in the US in 2014Natality_2014
Short, simple data frames for textbook examplesBig Nats Tiny
Create vector based on roughly equally sized groupsntiles
Relative sizes offspring/parent for many speciesOffspring
Space Shuttle O-Ring FailuresOrings
Body measurements on penguinsPenguins
Pima Indians Diabetes DatabasePIDD
One-step data graphicspoint_plot
Nice printing of some internal objectsprint.datasim
A printing method for model objectsprint.model_object
Create columns with random numbers for modelingrandom_terms
Sample from a college registrar's databaseGradepoint Grades Registrar Sessions
Generate a random term in a model.rterm
Shipping losses in 1941 in the AtlanticShipping_losses
Simulations for use in _Lessons in Statistical Thinking_sim_00 sim_01 sim_02 sim_03 sim_04 sim_05 sim_06 sim_07 sim_08 sim_09 sim_10 sim_11 sim_12 sim_flights sim_medical_observations sim_objects sim_prob_21.1 sim_satgpa sim_school1 sim_school2 sim_vaccine
Evaluate a tilde expression on a data framesplit_tilde
STAR Project DataSTAR
Samples from various kinds of objectsresample shuffle take_sample take_sample.default
Run the left side of the pipeline multiple times.trials
Roster of applicants to six major departments at UC BerkeleyUCB_applicants
Monthly tallies of wildfires in the US from 2000 to 2022US_wildfires
Experimental data on the yield of winter wheatWheat
Data from the Whickham surveyWhickham
Zero-one transformation for categorical variablelabel_zero_one zero_one