Potpourri: Statistics #79

Bayes Rules! An Introduction to Bayesian Modeling with R
A friendly introduction to machine learning compilers and optimizers
A History of Polar Area / Coxcomb / Rose charts & how to make them in R’s ggplot2
A Dataset of Cryptic Crossword Clues
– Survival Analysis: Part I: Basic concepts and first analyses, Part II: Multivariate data analysis – an introduction to concepts and methods, Part III: Multivariate data analysis – choosing a model and assessing its adequacy and fit, Part IV: Further concepts and methods in survival analysis
Dataviz Accessibility Resources
RegExplain
A Succinct Intro to R
Deep Learning’s Diminishing Returns
Working with Google Sheets from R
The Rise of the Pandemic Dashboard
Predicting FT Trending Topics
The Art of Linear Algebra: Graphic Notes on “Linear Algebra for Everyone”
Modeling Possibly Nonlinear Confounders
ggHoriPlot: build horizon plots in ggplot2
Finding the Eras of MTV’s The Challenge Through Clustering
Why data scientists shouldn’t need to know Kubernetes
Creating a Dataset from an Image in R Markdown using reticulate
Neural Networks from scratch
plotDK: Plot Summary Statistics as Choropleth Maps of Danish Administrative Areas
The Power of Parameterized Reports With Plumber
Riding tables with {gt} and {gtExtras}
How to explain gradient boosting
How to visualize decision trees
Speech and Language Processing
Sexy up your logistic regression model with logit dotplots
AI’s Islamophobia problem
Possession Is The Puzzle Of Soccer Analytics. These Models Are Trying To Solve It.


Previous posts: #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 #11 #12 #13 #14 #15 #16 #17 #18 #19 #20 #21 #22 #23 #24 #25 #26 #27 #28 #29 #30 #31 #32 #33 #34 #35 #36 #37 #38 #39 #40 #41 #42 #43 #44 #45 #46 #47 #48 #49 #50 #51 #52 #53 #54 #55 #56 #57 #58 #59 #60 #61 #62 #63 #64 #65 #66 #67 #68 #69 #70 #71 #72 #73 #74 #75 #76 #77 #78

Potpourri: Statistics #78

Investigation of Data Irregularities in Doing Business 2018 and Doing Business 2020
Dyadic Clustering in International Relations
Forecasting: Principles and Practice
Data Disasters
A Quick How-to on Labelling Bar Graphs in ggplot2
Data visualisation using R, for researchers who don’t use R
Easy access to high-resolution daily climate data for Europe
Put R Models in Production
Machine learning, explained
Three ways to visualize binary survey data
In defense of simple charts
Modern Statistics with R
How to avoid machine learning pitfalls: a guide for academic researchers
Tune xgboost models with early stopping to predict shelter animal status
Machine-learning on dirty data in Python: a tutorial
I saw your RCT and I have some worries! FAQs
Up and running with officedown
Use racing methods to tune xgboost models and predict home runs
The 5-minute learn: Create pretty and geographically accurate transport maps in R
R’s Internal Data Formats: .Rda, .RData, .rds
Improve Your Code – Best Practices for Durable Code
An educator’s perspective of the tidyverse
Estimating regression coefficients using a Neural Network (from scratch)
Let users choose which plot you want to show
A look into ANOVA. The long way.
3 alternatives to a discrete color scale legend in ggplot2
Downloading the Census Household Pulse Survey in R
The Stata Guide
The Four Pipes of magrittr
Introducing {facetious} – alternate facets for ggplot2
Alternatives to Simple Color Legends in ggplot2
Top 3 Coding Best Practices from the Shiny Contest
Visualizing ordinal variables
Making Shiny apps mobile friendly
Climate circles
Elegant and informative maps with tmap
Exploring R² and regression variance with Euler/Venn diagrams
Exploring Pamela Jakiela’s simple TWFE diagnostics with R
The marginaleffects package for R
A lightweight data validation ecosystem with R, GitHub, and Slack
Create spatial square/hexagon grids and count points inside in R with sf
A daily updated JSON dataset of all the Open House London venues, events, and metadata
Animating Network Evolutions with gganimate
Beyond Bar and Box Plots
Causal Inference in R Workshop
Odds != Probability
How to visualize polls and results of the German election with Datawrapper
Irreproducibility in Machine Learning
tidybundestag
A collection of themes for RStudio
Shiny, Tableau, and PowerBI: Better Business Intelligence
Automate PowerPoint Production Using R
Estimating graph dimension with cross-validated eigenvalues
Understanding text size and resolution in ggplot2
Introduction to linear mixed models


Previous posts: #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 #11 #12 #13 #14 #15 #16 #17 #18 #19 #20 #21 #22 #23 #24 #25 #26 #27 #28 #29 #30 #31 #32 #33 #34 #35 #36 #37 #38 #39 #40 #41 #42 #43 #44 #45 #46 #47 #48 #49 #50 #51 #52 #53 #54 #55 #56 #57 #58 #59 #60 #61 #62 #63 #64 #65 #66 #67 #68 #69 #70 #71 #72 #73 #74 #75 #76 #77

Potpourri: Statistics #77 (Excel)

XPlus
333 Excel Shortcuts for Windows and Mac
Excel VBA Introduction
101 Excel Functions you should know
How to Create a Dot Plot in Excel
How to Create a Fan Chart in Excel
How to Create a Non-Ribbon Sankey Diagram in Excel
How to Create a Horizontal Bar Graph With Endpoints In Excel
How to Create a Dumbbell Chart in Excel
How to Create a Lollipop Chart in Excel
How To Create a Waffle Fourfold Chart in Excel Using Conditional Formatting
How to Create a Bivariate Area Chart in Excel
How to Create a Range Bar Graph in Excel
How to Create a Fourfold Chart in Excel
How to Create a Bar Chart With Color Ranges in Excel
How to Create a Grid Map In Excel
How to Create a Unit Chart in Excel
How to Create a Scatterplot with Dynamic Reference Lines in Excel
How to Create a Barcode Plot in Excel
How to Create a Strip Plot in Excel
How to Create a Heatmap In Excel
How to Create a Grid Map With Circles In Excel
How to Create a Grid Map With Sparklines in Excel
How to Create a Density Scatterplot In Excel
How to Create a Bar Chart With Labels Above Bar in Excel
How to Create a Scatterplot Matrix In Excel
Tufte in Excel – The Bar Chart
Tufte in Excel – The Box Plot
Tufte in Excel – The Slopegraph
Tufte in Excel – The Dot-Dash-Plot
Tufte in Excel – Sparklines


Previous posts: #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 #11 #12 #13 #14 #15 #16 #17 #18 #19 #20 #21 #22 #23 #24 #25 #26 #27 #28 #29 #30 #31 #32 #33 #34 #35 #36 #37 #38 #39 #40 #41 #42 #43 #44 #45 #46 #47 #48 #49 #50 #51 #52 #53 #54 #55 #56 #57 #58 #59 #60 #61 #62 #63 #64 #65 #66 #67 #68 #69 #70 #71 #72 #73 #74 #75 #76

Potpourri: Statistics #76

Introduction to Deep Learning — 170 Video Lectures from Adaptive Linear Neurons to Zero-shot Classification with Transformers
The Identification Zoo: Meanings of Identification in Econometrics
Why you sometimes need to break the rules in data viz
A Concrete Introduction to Probability (using Python)
R packages that make ggplot2 more beautiful (Vol. I)
R packages that make ggplot2 more powerful (Vol. II)
Etimating multilevel models for change in R
Static and dynamic network visualization with R
Open Source RStudio/Shiny on AWS Fargate
Functional PCA with R
When Graphs Are a Matter of Life and Death
Python Projects with Source Code
The Stata workflow guide
15 Tips to Customize lines in ggplot2 with element_line()
7 Tips to customize rectangle elements in ggplot2 element_rect()
8 tips to use element_blank() in ggplot2 theme
Introduction to Machine Learning Interviews Book
Introduction to Python for Social Science
21 Must-Read Data Visualization Books, According to Experts
Introduction to Modern Statistics
The Difference Between Random Factors and Random Effects
Creating a figure of map layers in R
Reasons to Use Tidymodels
Professional, Polished, Presentable: Making great slides with xaringan
Polished summary tables in R with gtsummary
Top 10 Ideas in Statistics That Have Powered the AI Revolution
The New Native Pipe Operator in R
RMarkdown Tips and Tricks
Iterative visualizations with ggplot2: no more copy-pasting
Scaling Models in Poltical Science
Setting up and debugging custom fonts
A Pirate’s Favorite Programming Language
Tired: PCA + kmeans, Wired: UMAP + GMM
Three simple ideas for better election poll graphics
Exploratory Functional PCA with Sparse Data
Efficient simulations in R
The Beginner’s Guide to the Modern Data Stack
How to become a better R code detective?
A short introduction to grammar of graphics (via ggplot2)
Workflows for querying databases via R
A Handbook for Teaching and Learning with R and RStudio
Writing reproducible manuscripts in R
ggpairs in R- A Brief Introduction to ggpairs


Previous posts: #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 #11 #12 #13 #14 #15 #16 #17 #18 #19 #20 #21 #22 #23 #24 #25 #26 #27 #28 #29 #30 #31 #32 #33 #34 #35 #36 #37 #38 #39 #40 #41 #42 #43 #44 #45 #46 #47 #48 #49 #50 #51 #52 #53 #54 #55 #56 #57 #58 #59 #60 #61 #62 #63 #64 #65 #66 #67 #68 #69 #70 #71 #72 #73 #74 #75

Potpourri: Statistics #75

Introducing pewmethods: An R package for working with survey data
Exploring survey data with the pewmethods R package
Weighting survey data with the pewmethods R package
Analyzing international survey data with the pewmethods R package
autumn: Fast, Modern, and Tidy Raking
Data science for economists
Papers about Causal Inference and Language
Yale Applied Empirical Methods PHD Course
Spreadsheet Munging Strategies
Visual Vocabulary: Designing with data
What can we learn from a country’s diplomatic gifts?
Map, Walk, Pivot
The Epidemiologist R Handbook
Machine learning with {tidymodels}
Choose your own tidymodels adventure
Applied Spatial Statistics with R
ggplot: the placing and order of aesthetics matters
Introduction to Functional Data Analysis with R
Visualizing Distributions with Raincloud Plots with ggplot2
A Chat with Andrew on MLOps: From Model-centric to Data-centric AI
ISLR tidymodels Labs
Plotting maps with ggplot2
R instructions for our research projects
A gentle introduction to deep learning in R using Keras
Everything You Always Wanted to Know About ANOVA
Replication Materials for “The Flying Bomb and the Actuary” (Shaw and Shaw, 2019)
Colors and emotions in data visualization
Rookie R mistakes
10 Tips to Customize Text Color, Font, Size in ggplot2 with element_text()
Writing unit tests in R
The Good, the Bad and the Ugly: how to visualize Machine Learning data
A curated list of APIs, open data and ML/AI projects on climate change
R for SEO
Using Geospatial Data in R
Good Data Scientist, Bad Data Scientist
The Evolution of a ggplot (Ep. 1)
Do Wide and Deep Networks Learn the Same Things?


Previous posts: #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 #11 #12 #13 #14 #15 #16 #17 #18 #19 #20 #21 #22 #23 #24 #25 #26 #27 #28 #29 #30 #31 #32 #33 #34 #35 #36 #37 #38 #39 #40 #41 #42 #43 #44 #45 #46 #47 #48 #49 #50 #51 #52 #53 #54 #55 #56 #57 #58 #59 #60 #61 #62 #63 #64 #65 #66 #67 #68 #69 #70 #71 #72 #73 #74

Potpourri: Statistics #74 (Python)

What the f*ck Python!
Notes On Using Data Science & Machine Learning To Fight For Something That Matters
Computational and Inferential Thinking: The Foundations of Data Science
data-science-ipython-notebooks
How to make an awesome Python package in 2021
Tutorial: Working with Large Data Sets using Pandas and JSON in Python
Data analysis with Python – Summer 2019
Introduction to Linear Algrebra for Applied Machine Learning with Python
Speeding Up Your Python Code!
Python for Non-Programmers
Web Scraping 101 with Python
Full Stack Python
R Markdown Python Engine
Introduction to Programming and Numerical Analysis


Previous posts: #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 #11 #12 #13 #14 #15 #16 #17 #18 #19 #20 #21 #22 #23 #24 #25 #26 #27 #28 #29 #30 #31 #32 #33 #34 #35 #36 #37 #38 #39 #40 #41 #42 #43 #44 #45 #46 #47 #48 #49 #50 #51 #52 #53 #54 #55 #56 #57 #58 #59 #60 #61 #62 #63 #64 #65 #66 #67 #68 #69 #70 #71 #72 #73

Potpourri: Statistics #73

Which color scale to use when visualizing data
When to use quantitative and when to use qualitative color scales
When to use sequential and when to use diverging color scales
When to use classed and when to use unclassed color scales
Patterns, predictions, and actions: A story about machine learning
Principles for data analysis workflows
A Comprehensive Introduction to Command Line for R Users
Reading Data from Multiple Excel Sheets and Converting it to Individual Data Frames in R
Making a ggplot theme
Visualizing with Text
Why I love dplyr’s across
A Multiverse Analysis of Interaction Effects
3200+ searchable R articles and packages
Using Excel Templates as Tables in R Shiny
A Basic Checklist for Observational Studies in Political Science
Understanding p-values Through Simulations: An Interactive Visualization
One Way ANOVA with R
Driving Alone vs. Public Transportation in Pittsburgh
The Effect: An Introduction to Research Design and Causality
k-Means 101: An introductory guide to k-Means clustering in R
Bivariate dasymetric map
Introductory time-series forecasting with torch
Lightweight Machine Learning Classics with R
flextable gallery
Exploring other {ggplot2} geoms
Bayesian statistics with R


Previous posts: #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 #11 #12 #13 #14 #15 #16 #17 #18 #19 #20 #21 #22 #23 #24 #25 #26 #27 #28 #29 #30 #31 #32 #33 #34 #35 #36 #37 #38 #39 #40 #41 #42 #43 #44 #45 #46 #47 #48 #49 #50 #51 #52 #53 #54 #55 #56 #57 #58 #59 #60 #61 #62 #63 #64 #65 #66 #67 #68 #69 #70 #71 #72

Potpourri: Statistics #72 (Monty Hall problem)

Monty Hall Simulations
Making the Monty Hall problem weirder but obvious
The Intuitive Monty Hall Problem
The psychology of the Monty Hall problem: Discovering psychological mechanisms for solving a tenacious brain teaser
The Collider Principle in Causal Reasoning: Why the Monty Hall Dilemma Is So Hard
Rationality, the Bayesian standpoint, and the Monty-Hall problem
Josh Miller’s alternative, more intuitive, formulation of Monty Hall problem
Monty Hall problem solved in tidyverse


Previous posts: #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 #11 #12 #13 #14 #15 #16 #17 #18 #19 #20 #21 #22 #23 #24 #25 #26 #27 #28 #29 #30 #31 #32 #33 #34 #35 #36 #37 #38 #39 #40 #41 #42 #43 #44 #45 #46 #47 #48 #49 #50 #51 #52 #53 #54 #55 #56 #57 #58 #59 #60 #61 #62 #63 #64 #65 #66 #67 #68 #69 #70 #71

Potpourri: Statistics #71

SDS 375/395 Data Visualization in R
Demystifying the coalesce function
Data Viz Bookmarks
Data Science: A First Introduction
Crime by the Numbers
The value of p
The Tidyverse in a Table
Sample Size Justification
Learn tidytext with my new learnr course
Using random effects in GAMs with mgcv
Public Policy Analytics: Code & Context for Data Science in Government
How to run 100 regressions without loops in R
Spreadsheet mistakes – news stories
Weights in statistics
Importing Multiple Files Quickly and Efficiently
Making Sense of Sensitivity: Extending Omitted Variable Bias
Microsoft365R: an R interface to the Microsoft 365 suite
fixest: Fast Fixed-Effects Estimations
Grab World Bank Data in R with {WDI}
Lists are my secret weapon for reporting stats with knitr
Building a team of internal R packages
Tidyverse Skills for Data Science in R
Practical Applications in R for Psychologists
Transform List into Dataframe with tidyr and purrr
Main terms and concepts in R
A complete guide to scales
Computational Thinking for Social Scientists
A Crash Course in Good and Bad Controls
Causal design patterns for data analysts
Modern Data Science with R
Generating SQL with {dbplyr} and sqlfluff
Hypothesis test by hand
How to Use Git/GitHub with R
Testing for normality
Scrape Hundreds of PDF Documents From the Web with R and rvest
Radial Patterns in ggplot2
a gRadual intRoduction to Shiny
Reading tables from images with magick
ggplot Wizardry Hands-On


Previous posts: #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 #11 #12 #13 #14 #15 #16 #17 #18 #19 #20 #21 #22 #23 #24 #25 #26 #27 #28 #29 #30 #31 #32 #33 #34 #35 #36 #37 #38 #39 #40 #41 #42 #43 #44 #45 #46 #47 #48 #49 #50 #51 #52 #53 #54 #55 #56 #57 #58 #59 #60 #61 #62 #63 #64 #65 #66 #67 #68 #69 #70