Search recipe steps
Recipes
Find recipe steps in the tidymodels framework to help you prep your data for modeling.
To learn about the recipes package, see Get Started: Preprocess your data with recipes. Use the filter below to search for recipe steps across tidymodels packages.
recipes
step_harmonic
Add sin and cos terms for harmonic analysis
tdarec
step_vpd_algebraic_functions
Algebraic Functions Vectorization of Persistent Homology
recipes
step_relu
Apply (smoothed) rectified linear transformation
textrecipes
step_tokenize_bpe
BPE Tokenization of Character Variables
tdarec
step_vpd_betti_curve
Betti Curve Vectorization of Persistent Homology
timetk
step_box_cox
Box-Cox Transformation using Forecast Methods
recipes
step_BoxCox
Box-Cox transformation for non-negative data
textrecipes
step_lda
Calculate LDA Dimension Estimates of Tokens
sparseR
step_center_to
Centering numeric data to a value besides their mean
timetk
step_ts_clean
Clean Outliers and Missing Data for Time Series
kerasnip
step_collapse
Collapse Predictors into a single list-column
textrecipes
step_tokenmerge
Combine Multiple Token Variables Into One
tdarec
step_vpd_complex_polynomial
Complex Polynomial Vectorization of Persistent Homology
recipes
step_classdist_shrunken
Compute shrunken centroid distances for classification models
recipes
step_unorder
Convert ordered factors to unordered factors
recipes
step_ordinalscore
Convert ordinal factors to numeric scores
recipes
step_bin2factor
Create a factors from A dummy variable
recipes
step_profile
Create a profiling version of a data set
recipes
step_count
Create counts of patterns using regular expressions
actxps
step_expose
Create exposure records in a recipes step
recipes
step_indicate_na
Create missing data column indicators
tdarec
step_vpd_descriptive_statistics
Descriptive Statistics Vectorization of Persistent Homology
embed
step_discretize_cart
Discretize numeric variables with CART
embed
step_discretize_xgb
Discretize numeric variables with XgBoost
themis
step_downsample
Down-Sample a Data Set Based on a Factor Variable
tdarec
step_vpd_euler_characteristic_curve
Euler Characteristic Curve Vectorization of Persistent Homology
scimo
step_aggregate_list
Feature aggregation step based on a defined list
scimo
step_aggregate_hclust
Feature aggregation step based on a hierarchical clustering
scimo
step_rownormalize_tss
Feature normalization step using total sum scaling
scimo
step_select_kruskal
Feature selection step using Kruskal test
scimo
step_select_wilcoxon
Feature selection step using Wilcoxon test
scimo
step_select_background
Feature selection step using background level
scimo
step_select_cv
Feature selection step using the coefficient of variation
textrecipes
step_tokenfilter
Filter Tokens Based on Term Frequency
textrecipes
step_stopwords
Filtering of Stop Words for Tokens Variables
timetk
step_fourier
Fourier Features for Modeling Seasonality
recipes
step_poly_bernstein
Generalized bernstein polynomial basis
recipes
step_dummy_multi_choice
Handle levels in multiple predictors together
timetk
step_holiday_signature
Holiday Feature (Signature) Generator
recipes
step_impute_mode
Impute nominal data using the most common value
recipes
step_impute_lower
Impute numeric data below the threshold of measurement
recipes
step_impute_roll
Impute numeric data using a rolling window statistic
recipes
step_impute_median
Impute numeric data using the median
recipes
step_impute_linear
Impute numeric variables via a linear model
textrecipes
step_dummy_hash
Indicator Variables via Feature Hashing
MachineShop
step_kmedoids
K-Medoids Clustering Variable Selection
embed
step_lencode
Likelihood encoding using analytical formula
timetk
step_log_interval
Log Interval Transformation for Constrained Interval Forecasting
timetk
step_ts_impute
Missing Data Imputation for Time Series
recipes
step_nnmf_sparse
Non-negative matrix factorization signal extraction with lasso penalization
textrecipes
step_text_normalization
Normalization of Character Variables
tdarec
step_vpd_normalized_life_curve
Normalized Life Curve Vectorization of Persistent Homology
bestNormalize
step_orderNorm
ORQ normalization (orderNorm) for recipes implementation
OptimalBinningWoE
step_obwoe
Optimal Binning and WoE Transformation Step
timetk
step_ts_pad
Pad: Add rows to fill gaps and go from low to high frequency
textrecipes
step_pos_filter
Part of Speech Filtering of Token Variables
extrasteps
step_encoding_binary
Perform binary encoding of factor variables
tdarec
step_vpd_persistence_block
Persistence Block Vectorization of Persistent Homology
tdarec
step_vpd_persistence_image
Persistence Image Vectorization of Persistent Homology
tdarec
step_vpd_persistence_landscape
Persistence Landscape Vectorization of Persistent Homology
tdarec
step_vpd_persistence_silhouette
Persistence Silhouette Vectorization of Persistent Homology
tdarec
step_vpd_persistent_entropy_summary
Persistent Entropy Summary Vectorization of Persistent Homology
tdarec
step_pd_raster
Persistent homology of raster data (images)
recipes
step_kpca_poly
Polynomial kernel PCA signal extraction
textrecipes
step_sequence_onehot
Positional One-Hot encoding of Tokens
textrecipes
step_word_embeddings
Pretrained Word Embeddings of Tokens
recipes
step_kpca_rbf
Radial basis function kernel PCA signal extraction
healthyR.ai
step_hai_scale_zero_one
Recipes Data Scale to Zero and One
healthyR.ai
step_hai_fourier_discrete
Recipes Step Fourier Discrete Generator
healthyR.ai
step_hai_winsorized_move
Recipes Step Winsorized Move Generator
healthyR.ai
step_hai_winsorized_truncate
Recipes Step Winsorized Truncate Generator
healthyR.ts
step_ts_acceleration
Recipes Time Series Acceleration Generator
healthyR.ts
step_ts_velocity
Recipes Time Series velocity Generator
bestNormalize
step_best_normalize
Run bestNormalize transformation for recipes implementation
bestNormalize
step_bestNormalize
Run bestNormalize transformation for recipes implementation
bestNormalize
step_bestNormalize_new
Run bestNormalize transformation for recipes implementation
recipes
step_range
Scaling numeric data to a specific range
textrecipes
step_tokenize_sentencepiece
Sentencepiece Tokenization of Character Variables
tdarec
step_pd_degree
Separate persistent pairs by homological degree
recipes
step_novel
Simple value assignments for novel factor levels
timetk
step_slidify_augment
Slidify Rolling Window Transformation (Augmented Version)
embed
step_pca_sparse_bayes
Sparse Bayesian PCA Signal Extraction
MachineShop
step_spca
Sparse Principal Components Analysis Variable Reduction
embed
step_collapse_cart
Supervised Collapsing of Factor Levels
embed
step_lencode_bayes
Supervised Factor Conversions into Linear Functions using Bayesian Likelihood Encodings
embed
step_lencode_mixed
Supervised Factor Conversions into Linear Functions using Bayesian Likelihood Encodings
embed
step_lencode_glm
Supervised Factor Conversions into Linear Functions using Likelihood Encodings
important
step_predictor_retain
Supervised Feature Selection via A Single Filter
important
step_predictor_best
Supervised Feature Selection via Choosing the Top Predictors
important
step_predictor_desirability
Supervised Multivariate Feature Selection via Desirability Functions
embed
step_umap
Supervised and unsupervised uniform manifold approximation and projection (UMAP)
tdarec
step_vpd_tent_template_functions
Tent Template Functions Vectorization of Persistent Homology
textrecipes
step_tfidf
Term Frequency-Inverse Document Frequency of Tokens
collinear
step_collinear
Tidymodels recipe step for multicollinearity filtering
timetk
step_timeseries_signature
Time Series Feature (Signature) Generator
extrasteps
step_date_after
Time after Recurrent Date Time Event
extrasteps
step_date_before
Time before Recurrent Date Time Event
extrasteps
step_date_nearest
Time to Nearest Recurrent Date Time Event
tdarec
step_vpd_tropical_coordinates
Tropical Coordinates Vectorization of Persistent Homology
themis
step_upsample
Up-Sample a Data Set Based on a Factor Variable
textrecipes
step_tokenize_wordpiece
Wordpiece Tokenization of Character Variables
embed
step_collapse_stringdist
collapse factor levels using stringdist
No matching items