2020 – Mineetha Chandralekha

Probability Distribution

December 30, 2020 August 16, 2023Statistics

A probability distribution is a way to describe how likely different outcomes are in an experiment. It tells us what outcomes are possible and how likely they are to occur. In other words, it’s a mathematical function that provides the probabilities of occurrence of different possible outcomes. Types of Probability…

Gini Index

December 29, 2020 August 23, 2023Machine Learning, Supervised Learning

In Decision Tree, the major challenge is the identification of the attribute for the root node in each level. This process is known as attribute selection. There are two popular attribute selection measures: Gini Index Information Gain Gini Index calculates the amount of probability of a specific feature that is…

The Cost Function in Decision Tree

December 28, 2020 September 19, 2023Machine Learning

The cost function in the context of decision trees refers to a metric used to determine the “quality” of a split at any given node. Depending on the nature of the task (classification or regression), different cost functions are used: Classification: Gini Impurity: It measures the disorder in a set….

Decision Trees

December 21, 2020 September 19, 2023Machine Learning, Supervised Learning

The decision tree algorithm is one of the most widely used algorithms in Machine Learning. It is a supervised learning algorithm. A decision tree uses a tree-like model to make predictions. It resembles an upside-down tree. A decision tree builds classification or regression models in the form of a tree…

Bootstrapping

December 15, 2020 August 19, 2023Statistics

Bootstrapping is a resampling method that involves taking repeated samples (called ‘bootstrap samples’) from a dataset with replacement. It is used to estimate the distribution of a statistic and to calculate confidence intervals and significance tests. Here is the basic procedure: Draw a Sample: Randomly select n observations from the…

Naive Bayes vs Logistic Regression

November 30, 2020 August 19, 2023Machine Learning

Naive Bayes is a linear classifier using Bayes Theorem and strong independence condition among features. Given a data set with n features represented by Naive Bayes states the probability of output: Y from features F_i is, Bayes theorem states that: Logistic regression is a linear classification method that learns the probability…

Naive Bayes

November 29, 2020 August 19, 2023Machine Learning, Supervised Learning

Naive Bayes is a very popular Supervised Classification algorithm. This algorithm is called “Naive” because it makes a naive assumption that each feature is independent of other features. It is near to impossible to find such data sets in real life. Bayes’ theorem is the base for Naive Bayes Algorithm….

Logistic Regression

November 20, 2020 September 19, 2023Machine Learning, Supervised Learning

Logistic Regression is a supervised classification algorithm that is used to predict the probability of a categorical dependent variable using a given set of independent variables. It is a predictive analysis algorithm and based on the concept of probability. The most common use of logistic regression models is in binary classification problems. Some…

Gradient Descent

October 30, 2020 September 19, 2023Machine Learning

Gradient Descent is an optimization algorithm used to find the values of the parameters of any function that minimizes the cost function. The average difference of the squares of all the predicted values of y and the actual values of y is called a Cost Function. It is also called…

Linear Regression

October 20, 2020 September 19, 2023Machine Learning

Linear regression is a supervised machine learning algorithm used for modeling the relationship between a dependent variable and one or more independent variables by fitting a linear equation. I would like to say it is the starting point of anyone’s ML journey! Linear regression is the simplest and most widely…

Year: 2020