Machine Learning – Page 3 – Mineetha Chandralekha

Transformer Architecture

September 16, 2025 October 20, 2025Machine Learning

The Transformer architecture lies at the heart of today’s large language models (LLMs) like GPT-4, Claude, and Gemini, revolutionizing how machines understand and generate text. Introduced in the 2017 paper “Attention Is All You Need” by Vaswani et al., this architecture replaced older recurrent models by offering a faster, more context-aware approach to processing…

The Transformer Revolution

September 16, 2025 September 16, 2025Machine Learning

Imagine you’re reading a long story. Halfway through, you come across the word “she.” To know who “she” is, your brain quickly looks back at earlier sentences: “Oh yes, the girl with the red umbrella!” That ability to look around and find the right connection is exactly what makes transformers…

The Magic of Tokenization: from words to numbers:

September 15, 2025 September 16, 2025Machine Learning

When you talk to an LLM, you type in words. But computers don’t understand words the way humans do. They only work with numbers. So, how does your sentence, “Good morning, how are you?” turn into something a machine can process? The answer is tokenization. Tokenization is the process of…

What is an LLM?

September 2, 2025 September 13, 2025Machine Learning

Large Language Models, or LLMs, are everywhere these days—chatbots, writing assistants, coding helpers, even customer support. But what exactly is an LLM? Every time you ask ChatGPT a question, you’re using an LLM. But how does it actually work? The easiest way to understand is to start simple. At its…

Time series analysis

August 20, 2023 September 17, 2023Machine Learning

Time series forecasting is a statistical technique used to predict future values based on historically observed data points ordered by time. Widely used in finance, economics, and business, it helps stakeholders anticipate future trends and make informed decisions. A time series is a sequence of data points, measured typically at…

Triple Exponential Smoothing

August 19, 2023 August 26, 2023Machine Learning

Triple Exponential Smoothing, commonly known as the Holt-Winters Method, extends upon Double Exponential Smoothing to address time series data that contains both a trend and a seasonal component. It incorporates three equations to capture the level, trend, and seasonality of a dataset, making it particularly useful for predicting values in…

Double Exponential Smoothing

August 17, 2023 August 26, 2023Machine Learning

Double Exponential Smoothing, also known as Holt’s Linear Exponential Smoothing, is a time series forecasting method that extends Simple Exponential Smoothing. While Simple Exponential Smoothing is best suited for time series without a trend, Double Exponential Smoothing can handle time series data with a trend but no seasonality. The primary…

Exponential Smoothing

August 17, 2023 August 26, 2023Machine Learning

Simple Exponential Smoothing (SES) is a time series forecasting method that is especially suitable for univariate data without a trend or seasonal pattern. It uses weighted averages of past observations to forecast future points. The method is ‘exponential’ because the weights decrease exponentially as observations get older. Key Concept: Smoothing…

Reinforcement Learning

July 2, 2023 September 17, 2023Machine Learning

Reinforcement Learning (RL) is a bit unique. It’s not like supervised learning where we have labeled data to guide the learning. But it’s also not unsupervised learning where the algorithm is left to find patterns on its own. In RL, we don’t give direct answers, but we do give feedback…

NLP

March 27, 2023 September 22, 2023Machine Learning

NLP, or Natural Language Processing, is a field at the intersection of computer science, artificial intelligence, and linguistics. Its goal is to enable computers to understand, interpret, and generate human languages in a way that is both meaningful and useful. Key Challenges in NLP: Ambiguity: Natural language is often ambiguous,…

Category: Machine Learning