Models

Documentation > Models > Model Training > Classification Models

Classification Models

Classification models predict discrete categories or classes (e.g., 'spam' vs 'not spam', 'cat' vs 'dog', 'approved' vs 'rejected'). ML Clever offers a range of powerful classification algorithms you can train on your preprocessed datasets. Explore the models below to understand their strengths and typical use cases.

You can train these models either manually by selecting algorithms and tuning parameters via the Manual Training interface, or let the platform automatically find the best models using AutoML Training.

Available Classification Algorithms

Regression Models

Powerful predictive modeling algorithms for your no-code machine learning applications. Select from industry-standard regression techniques with optimal configurations.

Machine Learning Made Simple

Our no-code platform leverages the power of scikit-learn, TensorFlow, and PyTorch to provide state-of-the-art predictive modeling capabilities with minimal setup. Each model is carefully documented with implementation guidance, parameter configurations, and performance characteristics to help you choose the right tool for your analytical needs without writing a single line of code.

Random Forest Classifier

Robust ensemble method that reduces overfitting by combining multiple decision trees.

Predictive Capabilities

When to use Random Forest Classifier: ideal for handling mixed data types, capturing non-linear patterns, and providing robust performance with minimal tuning.

Model Constraints

Limitations of Random Forest Classifier: can be computationally intensive and lacks the interpretability of simpler models.

Technical Details

Statistical Foundation

Random Forest Classifier employs robust estimation techniques to minimize squared errors and provide optimal linear unbiased estimates under standard assumptions.

Computational Efficiency

Training complexity scales with input dimensionality and sample size, with efficient matrix operations for production deployment.

Data Requirements

Requires normalized, non-collinear input features for optimal performance. Handles continuous and categorical variables with appropriate preprocessing.

Model Parameters

Configure these parameters to optimize the Random Forest Classifier for your specific use case. Each parameter affects model training, performance, and prediction accuracy.

Parameter	Type	Default	Description
n_estimators	Integer	`100`	Number of trees in the forest.
max_depth	Integer	`None`	Maximum depth of the tree.
min_samples_split	Integer	`2`	Minimum number of samples required to split an internal node.
min_samples_leaf	Integer	`1`	Minimum number of samples required to be at a leaf node.
max_features	String	`sqrt`	Number of features to consider when looking for the best split.
bootstrap	Boolean	`True`	Whether bootstrap samples are used when building trees.
oob_score	Boolean	`False`	Whether to use out-of-bag samples to estimate generalization accuracy.
class_weight	String or Dict	`None`	Weights associated with classes in the form {class_label: weight}.
criterion	String	`gini`	Function to measure the quality of a split.
random_state	Integer	`None`	Controls the randomness of the estimator.
ccp_alpha	Float	`0.0`	Complexity parameter used for Minimal Cost-Complexity Pruning.
max_leaf_nodes	Integer	`None`	Grow trees with max_leaf_nodes in best-first fashion.
max_samples	Float or Integer	`None`	Number of samples to draw from X to train each base estimator.
min_impurity_decrease	Float	`0.0`	A node will be split if this split induces a decrease of the impurity greater than or equal to this ...
min_weight_fraction_leaf	Float	`0.0`	Minimum weighted fraction of the sum total of weights required to be at a leaf node.
monotonic_cst	Array of Integers	`None`	Constraint to enforce monotonicity in the predictions with respect to certain features.
n_jobs	Integer	`None`	Number of jobs to run in parallel.
verbose	Integer	`0`	Controls the verbosity when fitting and predicting.
warm_start	Boolean	`False`	When True, reuse the solution of the previous call to fit and add more estimators to the ensemble.

Detailed Parameter Reference

Logistic Regression

Logistic Regression is simple and interpretable for binary classification problems.

Predictive Capabilities

Ideal for binary classification problems with a linear decision boundary and when interpretability is a priority.

Model Constraints

May underperform on complex datasets that require modeling non-linear relationships.

Technical Details

Statistical Foundation

Logistic Regression employs robust estimation techniques to minimize squared errors and provide optimal linear unbiased estimates under standard assumptions.

Computational Efficiency

Training complexity scales with input dimensionality and sample size, with efficient matrix operations for production deployment.

Data Requirements

Requires normalized, non-collinear input features for optimal performance. Handles continuous and categorical variables with appropriate preprocessing.

Model Parameters

Configure these parameters to optimize the Logistic Regression for your specific use case. Each parameter affects model training, performance, and prediction accuracy.

Parameter	Type	Default	Description
penalty	String	`l2`	Used to specify the norm used in the penalization.
C	Float	`1.0`	Inverse of regularization strength; must be a positive float.
class_weight	String or Dict	`None`	Weights associated with classes in the form {class_label: weight}.
dual	Boolean	`False`	Dual or primal formulation. Dual formulation is only implemented for l2 penalty.
fit_intercept	Boolean	`True`	Specifies if a constant (a.k.a. bias or intercept) should be added to the decision function.
intercept_scaling	Float	`1.0`	Useful only when the solver 'liblinear' is used and fit_intercept is set to True.
l1_ratio	Float or None	`None`	The Elastic-Net mixing parameter, with 0 <= l1_ratio <= 1. Only used if penalty is 'elasticnet'.
max_iter	Integer	`1000`	Maximum number of iterations taken for the solvers to converge.
multi_class	String	`auto`	If 'ovr', a binary problem is fit for each label; otherwise, a multinomial loss is minimized.
n_jobs	Integer	`None`	Number of CPU cores used when parallelizing over classes.
random_state	Integer	`None`	Controls the randomness of the estimator.
solver	String	`liblinear`	Algorithm to use in the optimization problem.
tol	Float	`0.0001`	Tolerance for stopping criteria.
verbose	Integer	`0`	For the liblinear and lbfgs solvers, set verbose to any positive number for increased logging.
warm_start	Boolean	`False`	When set to True, reuse the solution of the previous call to fit as initialization, otherwise, start...

Detailed Parameter Reference

KNN

KNN (K-Nearest Neighbors) is simple but can be computationally expensive with large datasets.

Predictive Capabilities

Ideal for instance-based learning where similar instances drive the prediction, especially in small-to-medium datasets.

Model Constraints

Can be computationally expensive with large datasets and is sensitive to feature scaling and noisy data.

Technical Details

Statistical Foundation

KNN employs robust estimation techniques to minimize squared errors and provide optimal linear unbiased estimates under standard assumptions.

Computational Efficiency

Training complexity scales with input dimensionality and sample size, with efficient matrix operations for production deployment.

Data Requirements

Requires normalized, non-collinear input features for optimal performance. Handles continuous and categorical variables with appropriate preprocessing.

Model Parameters

Configure these parameters to optimize the KNN for your specific use case. Each parameter affects model training, performance, and prediction accuracy.

Parameter	Type	Default	Description
n_neighbors	Integer	`5`	Number of neighbors to use.
weights	String	`uniform`	Weight function used in prediction.
algorithm	String	`auto`	Algorithm used to compute the nearest neighbors.
leaf_size	Integer	`30`	Leaf size passed to BallTree or KDTree; affects the speed of construction and query, as well as memo...
metric	String	`minkowski`	The distance metric to use for the tree.
metric_params	Dict	`None`	Additional keyword arguments for the metric function.
n_jobs	Integer	`None`	Number of parallel jobs to run for neighbors search.
p	Integer	`2`	Power parameter for the Minkowski metric.

Detailed Parameter Reference

n_neighbors

IntegerDefault: 5

Number of neighbors to use.

Technical Implementation Guidance

Typically between 3 and 10 depending on the dataset's density

Effect on Model Performance

Determines how many nearest data points influence the prediction; too few may be noisy, too many may smooth out class distinctions

Valid Numerical Range

1-20

weights

StringDefault: uniform

Weight function used in prediction.

Technical Implementation Guidance

Use 'uniform' for equal weighting or 'distance' to give closer neighbors more influence

Effect on Model Performance

Affects how neighbor distances contribute to the decision, influencing model sensitivity and performance

Valid Numerical Range

Options: uniform, distance

algorithm

StringDefault: auto

Algorithm used to compute the nearest neighbors.

Technical Implementation Guidance

Set to 'auto' to let the model choose the optimal algorithm based on data characteristics

Effect on Model Performance

Determines the efficiency and performance of the neighbor search, with some algorithms better for high-dimensional data

Valid Numerical Range

Options: auto, ball_tree, kd_tree, brute

leaf_size

IntegerDefault: 30

Leaf size passed to BallTree or KDTree; affects the speed of construction and query, as well as memory usage.

Technical Implementation Guidance

30 is standard; adjust based on available memory and desired query speed

Effect on Model Performance

Impacts the trade-off between tree construction speed and query performance

Valid Numerical Range

Any positive integer

metric

StringDefault: minkowski

The distance metric to use for the tree.

Technical Implementation Guidance

Use 'minkowski' for general purposes; 'euclidean' or 'manhattan' if the specific distance type is required

Effect on Model Performance

Different metrics can capture various notions of similarity, affecting neighbor selection and prediction accuracy

Valid Numerical Range

Options: euclidean, manhattan, chebyshev, minkowski

metric_params

DictDefault: None

Additional keyword arguments for the metric function.

Technical Implementation Guidance

None unless custom adjustments to the metric are needed

Effect on Model Performance

Allows fine-tuning of the distance metric's behavior to better suit specific datasets

Valid Numerical Range

Dictionary or None

n_jobs

IntegerDefault: None

Number of parallel jobs to run for neighbors search.

Technical Implementation Guidance

Set to -1 to utilize all available CPU cores for faster computation

Effect on Model Performance

Enables parallel processing to speed up the neighbor search, especially useful for larger datasets

Valid Numerical Range

Any integer or None

IntegerDefault: 2

Power parameter for the Minkowski metric.

Technical Implementation Guidance

2 for Euclidean distance; 1 for Manhattan distance

Effect on Model Performance

Determines the type of distance calculation; different values change the metric's sensitivity to feature differences

Valid Numerical Range

Any positive integer

SVM

SVM (Support Vector Machine) is effective in high-dimensional spaces and works well with complex data.

Predictive Capabilities

Ideal for datasets with clear margins of separation and for non-linear classification tasks.

Model Constraints

Can be computationally expensive for large datasets and sensitive to the choice of kernel.

Technical Details

Statistical Foundation

SVM employs robust estimation techniques to minimize squared errors and provide optimal linear unbiased estimates under standard assumptions.

Computational Efficiency

Training complexity scales with input dimensionality and sample size, with efficient matrix operations for production deployment.

Data Requirements

Requires normalized, non-collinear input features for optimal performance. Handles continuous and categorical variables with appropriate preprocessing.

Model Parameters

Configure these parameters to optimize the SVM for your specific use case. Each parameter affects model training, performance, and prediction accuracy.

Parameter	Type	Default	Description
kernel	String	`rbf`	Specifies the kernel type to be used in the algorithm.
C	Float	`1.0`	Regularization parameter.

Detailed Parameter Reference

kernel

StringDefault: rbf

Specifies the kernel type to be used in the algorithm.

Technical Implementation Guidance

rbf is common for non-linear data; use linear for linearly separable data or poly for polynomial relationships

Effect on Model Performance

The kernel choice determines how data is transformed, directly influencing the ability to capture complex, non-linear patterns

Valid Numerical Range

Options: linear, poly, rbf, sigmoid

FloatDefault: 1.0

Regularization parameter.

Technical Implementation Guidance

Typically 1.0; adjust based on the trade-off between margin width and classification error

Effect on Model Performance

Controls the trade-off between maximizing the margin and minimizing misclassification, with lower values providing stronger regularization

Valid Numerical Range

0.1-100.0

Decision Tree

Decision Tree is easy to interpret but can overfit if not pruned properly.

Predictive Capabilities

Best for scenarios where interpretability is key and the dataset is manageable in size.

Model Constraints

Prone to overfitting without proper pruning and can be sensitive to small variations in the data.

Technical Details

Statistical Foundation

Decision Tree employs robust estimation techniques to minimize squared errors and provide optimal linear unbiased estimates under standard assumptions.

Computational Efficiency

Training complexity scales with input dimensionality and sample size, with efficient matrix operations for production deployment.

Data Requirements

Requires normalized, non-collinear input features for optimal performance. Handles continuous and categorical variables with appropriate preprocessing.

Model Parameters

Configure these parameters to optimize the Decision Tree for your specific use case. Each parameter affects model training, performance, and prediction accuracy.

Parameter	Type	Default	Description
criterion	String	`gini`	Function to measure the quality of a split.
splitter	String	`best`	Strategy used to choose the split at each node.
max_depth	Integer	`None`	The maximum depth of the tree.
min_samples_split	Integer	`2`	The minimum number of samples required to split an internal node.
min_samples_leaf	Integer	`1`	The minimum number of samples required to be at a leaf node.
min_weight_fraction_leaf	Float	`0.0`	The minimum weighted fraction of the sum total of weights required to be at a leaf node.
max_features	String	`None`	The number of features to consider when looking for the best split.
max_leaf_nodes	Integer	`None`	Grow trees with max_leaf_nodes in best-first fashion.
min_impurity_decrease	Float	`0.0`	A node will be split if this split induces a decrease of the impurity greater than or equal to this ...
class_weight	String or Dict	`None`	Weights associated with classes in the form {class_label: weight}.
ccp_alpha	Float	`0.0`	Complexity parameter used for Minimal Cost-Complexity Pruning.
monotonic_cst	Array of Integers	`None`	Constraint to enforce monotonicity in the predictions with respect to certain features.
random_state	Integer	`None`	Controls the randomness of the estimator.

Detailed Parameter Reference

Gradient Boosting

Gradient Boosting handles complex datasets well by combining weak learners into a strong learner.

Predictive Capabilities

Ideal for complex datasets where boosting can iteratively improve weak learners to achieve high accuracy.

Model Constraints

Can be sensitive to noisy data and outliers; training may be time-consuming with many boosting stages.

Technical Details

Statistical Foundation

Gradient Boosting employs robust estimation techniques to minimize squared errors and provide optimal linear unbiased estimates under standard assumptions.

Computational Efficiency

Training complexity scales with input dimensionality and sample size, with efficient matrix operations for production deployment.

Data Requirements

Requires normalized, non-collinear input features for optimal performance. Handles continuous and categorical variables with appropriate preprocessing.

Model Parameters

Configure these parameters to optimize the Gradient Boosting for your specific use case. Each parameter affects model training, performance, and prediction accuracy.

Parameter	Type	Default	Description
n_estimators	Integer	`100`	Number of boosting stages to perform.
learning_rate	Float	`0.1`	Learning rate shrinks the contribution of each tree by the learning_rate.
loss	String	`deviance`	Loss function to be optimized.
subsample	Float	`1.0`	The fraction of samples used for fitting the individual base learners.
criterion	String	`friedman_mse`	The function to measure the quality of a split.
min_samples_split	Integer	`2`	The minimum number of samples required to split an internal node.
min_samples_leaf	Integer	`1`	The minimum number of samples required to be at a leaf node.
min_weight_fraction_leaf	Float	`0.0`	The minimum weighted fraction of the sum total of weights required to be at a leaf node.
max_depth	Integer	`3`	The maximum depth of the individual regression estimators.
min_impurity_decrease	Float	`0.0`	A node will be split if this split induces a decrease of the impurity greater than or equal to this ...
init	String or Estimator	`None`	An estimator object that is used to compute the initial predictions.
random_state	Integer	`None`	Controls the randomness of the estimator.
max_features	String or Integer	`None`	The number of features to consider when looking for the best split.
verbose	Integer	`0`	Enable verbose output.
max_leaf_nodes	Integer	`None`	Grow trees with max_leaf_nodes in best-first fashion.
warm_start	Boolean	`False`	When set to True, reuse the solution of the previous call to fit and add more estimators to the ense...
validation_fraction	Float	`0.1`	Proportion of training data to set aside as validation set for early stopping.
n_iter_no_change	Integer	`None`	Number of iterations with no improvement to wait before early stopping.
tol	Float	`1e-4`	Tolerance for the early stopping.
ccp_alpha	Float	`0.0`	Complexity parameter used for Minimal Cost-Complexity Pruning.

Detailed Parameter Reference

XGBoost

XGBoost is a powerful and efficient implementation of gradient boosting that handles sparse data and scales well.

Predictive Capabilities

Ideal for large-scale classification tasks, especially when dealing with structured and sparse data.

Model Constraints

Sensitive to hyperparameter tuning and may overfit if not regularized properly.

Technical Details

Statistical Foundation

XGBoost employs robust estimation techniques to minimize squared errors and provide optimal linear unbiased estimates under standard assumptions.

Computational Efficiency

Training complexity scales with input dimensionality and sample size, with efficient matrix operations for production deployment.

Data Requirements

Requires normalized, non-collinear input features for optimal performance. Handles continuous and categorical variables with appropriate preprocessing.

Model Parameters

Configure these parameters to optimize the XGBoost for your specific use case. Each parameter affects model training, performance, and prediction accuracy.

Parameter	Type	Default	Description
n_estimators	Integer	`100`	Number of boosting rounds.
learning_rate	Float	`0.1`	Learning rate shrinks the contribution of each tree.
max_depth	Integer	`6`	Maximum depth of a tree.
min_child_weight	Float	`1`	Minimum sum of instance weight needed in a child.
subsample	Float	`1.0`	Subsample ratio of the training instance.
colsample_bytree	Float	`1.0`	Subsample ratio of columns when constructing each tree.
gamma	Float	`0`	Minimum loss reduction required to make a further partition.
alpha	Float	`0`	L1 regularization term on weights.
lambda	Float	`1`	L2 regularization term on weights.
random_state	Integer	`None`	Seed for random number generator.
objective	String	`binary:logistic`	Specify the learning task and objective.
booster	String	`gbtree`	Type of boosting model to use.
verbosity	Integer	`1`	Verbosity of printing messages.

Detailed Parameter Reference

CatBoost

CatBoost is a gradient boosting library that is particularly strong with categorical data and provides automatic handling of categorical features.

Predictive Capabilities

Ideal for datasets with a high proportion of categorical features and when minimal preprocessing is desired.

Model Constraints

May require careful tuning on numerical features and can be computationally intensive with large datasets.

Technical Details

Statistical Foundation

CatBoost employs robust estimation techniques to minimize squared errors and provide optimal linear unbiased estimates under standard assumptions.

Computational Efficiency

Training complexity scales with input dimensionality and sample size, with efficient matrix operations for production deployment.

Data Requirements

Requires normalized, non-collinear input features for optimal performance. Handles continuous and categorical variables with appropriate preprocessing.

Model Parameters

Configure these parameters to optimize the CatBoost for your specific use case. Each parameter affects model training, performance, and prediction accuracy.

Parameter	Type	Default	Description
iterations	Integer	`1000`	The number of boosting iterations.
learning_rate	Float	`0.03`	Learning rate.
depth	Integer	`6`	Depth of the tree.
l2_leaf_reg	Float	`3`	L2 regularization term on weights.
bagging_temperature	Float	`1.0`	Controls the Bayesian bootstrap sampling.
random_strength	Float	`1.0`	Score regularization coefficient.
border_count	Integer	`254`	The number of splits for numerical features.
random_seed	Integer	`None`	Random number generator seed.
boosting_type	String	`Plain`	Type of boosting used.
verbose	Integer	`0`	Verbosity level.

Detailed Parameter Reference

LightGBM

LightGBM is a highly efficient gradient boosting framework that supports fast training and low memory usage.

Predictive Capabilities

Ideal for large-scale data and high-dimensional feature spaces where fast training and low memory usage are crucial.

Model Constraints

May require careful tuning and can be sensitive to overfitting on small datasets.

Technical Details

Statistical Foundation

LightGBM employs robust estimation techniques to minimize squared errors and provide optimal linear unbiased estimates under standard assumptions.

Computational Efficiency

Training complexity scales with input dimensionality and sample size, with efficient matrix operations for production deployment.

Data Requirements

Requires normalized, non-collinear input features for optimal performance. Handles continuous and categorical variables with appropriate preprocessing.

Model Parameters

Configure these parameters to optimize the LightGBM for your specific use case. Each parameter affects model training, performance, and prediction accuracy.

Parameter	Type	Default	Description
num_leaves	Integer	`31`	Maximum number of leaves in one tree.
learning_rate	Float	`0.1`	Learning rate.
n_estimators	Integer	`100`	Number of boosting rounds.
max_depth	Integer	`-1`	Maximum depth of a tree.
min_data_in_leaf	Integer	`20`	Minimum number of data needed in a leaf.
feature_fraction	Float	`1.0`	Subsample ratio of features.
bagging_fraction	Float	`1.0`	Subsample ratio of training data.
bagging_freq	Integer	`0`	Frequency of bagging.
lambda_l1	Float	`0.0`	L1 regularization term.
lambda_l2	Float	`0.0`	L2 regularization term.
min_split_gain	Float	`0.0`	Minimum gain to make a split.
random_state	Integer	`None`	Random number generator seed.
objective	String	`binary`	Learning objective.
verbose	Integer	`1`	Verbosity of output.

Detailed Parameter Reference

Next Steps

After exploring the available models:

Train a Model

Proceed to train your chosen model(s) using Manual or AutoML training on a preprocessed dataset.

Manual Training AutoML Training

Learn about Evaluation

Understand how classification models are evaluated using metrics like Accuracy, Precision, Recall, F1-Score, and AUC.

Model Evaluation Metrics

Was this page helpful?

Need help?Contact Support

Questions?Contact Sales

Last updated: 5/16/2025

ML Clever Docs

Models

Classification Models

Available Classification Algorithms

Machine Learning Made Simple

Available Regression Models

Random Forest Classifier

Statistical Foundation

Computational Efficiency

Data Requirements

Detailed Parameter Reference

Logistic Regression

Statistical Foundation

Computational Efficiency

Data Requirements

Detailed Parameter Reference

KNN

Statistical Foundation

Computational Efficiency

Data Requirements

Detailed Parameter Reference

SVM

Statistical Foundation

Computational Efficiency

Data Requirements

Detailed Parameter Reference

Decision Tree

Statistical Foundation

Computational Efficiency

Data Requirements

Detailed Parameter Reference

Gradient Boosting

Statistical Foundation

Computational Efficiency

Data Requirements

Detailed Parameter Reference

XGBoost

Statistical Foundation

Computational Efficiency

Data Requirements

Detailed Parameter Reference

CatBoost

Statistical Foundation

Computational Efficiency

Data Requirements

Detailed Parameter Reference

LightGBM

Statistical Foundation

Computational Efficiency

Data Requirements

Detailed Parameter Reference

Next Steps

Train a Model

Learn about Evaluation

Was this page helpful?

Models