Probability Distribution

A probability distribution is a way to describe how likely different outcomes are in an experiment. It tells us what outcomes are possible and how likely they are to occur. In other words, it’s a mathematical function that provides the probabilities of occurrence of different possible outcomes. Types of Probability…

Continue reading

Gini Index

In Decision Tree, the major challenge is the identification of the attribute for the root node in each level. This process is known as attribute selection. There are two popular attribute selection measures: Gini Index Information Gain Gini Index calculates the amount of probability of a specific feature that is…

Continue reading

Decision Trees

The decision tree algorithm is one of the most widely used algorithms in Machine Learning. It is a supervised learning algorithm. A decision tree uses a tree-like model to make predictions. It resembles an upside-down tree. A decision tree builds classification or regression models in the form of a tree…

Continue reading

Bootstrapping

Bootstrapping is a resampling method that involves taking repeated samples (called ‘bootstrap samples’) from a dataset with replacement. It is used to estimate the distribution of a statistic and to calculate confidence intervals and significance tests. Here is the basic procedure: Draw a Sample: Randomly select n observations from the…

Continue reading

Naive Bayes vs Logistic Regression

Naive Bayes is a linear classifier using Bayes Theorem and strong independence condition among features. Given a data set with n features represented by Naive Bayes states the probability of output: Y from features F_i is, Bayes theorem states that: Logistic regression is a linear classification method that learns the probability…

Continue reading

Naive Bayes

Naive Bayes is a very popular Supervised Classification algorithm. This algorithm is called “Naive” because it makes a naive assumption that each feature is independent of other features. It is near to impossible to find such data sets in real life. Bayes’ theorem is the base for Naive Bayes Algorithm….

Continue reading

Logistic Regression

Logistic Regression is a supervised classification algorithm that is used to predict the probability of a categorical dependent variable using a given set of independent variables. It is a predictive analysis algorithm and based on the concept of probability. The most common use of logistic regression models is in binary classification problems. Some…

Continue reading

Gradient Descent

Gradient Descent is an optimization algorithm used to find the values of the parameters of any function that minimizes the cost function. The average difference of the squares of all the predicted values of y and the actual values of y is called a Cost Function. It is also called…

Continue reading

Linear Regression

Linear regression is a supervised machine learning algorithm used for modeling the relationship between a dependent variable and one or more independent variables by fitting a linear equation. I would like to say it is the starting point of anyone’s ML journey! Linear regression is the simplest and most widely…

Continue reading