Unit I AI and its Subfields

Introduction to Artificial Intelligence (AI), History, Definition, Artificial General Intelligence (AGI), Industry Applications of AI, Challenges in AI.
Knowledge Engineering, Machine Learning (ML), Computer Vision, Natural Language Processing (NLP), Robotics.

Introduction to Artificial Intelligence

Artificial Intelligence (AI) is a broad and rapidly evolving field of computer science focused on designing and building machines that can perform tasks that normally require human intelligence.

These tasks include:

Learning – the ability to improve performance based on experience (e.g., machine learning algorithms that learn from data)
Problem-solving – finding solutions to complex problems (e.g., optimizing routes, solving puzzles)
Decision-making – making choices under uncertainty (e.g., medical diagnosis systems)
Perception – interpreting sensory data like images, sounds, or videos (e.g., facial recognition)
Language understanding – processing and generating human language (e.g., chatbots, translation tools)

AI systems aim to replicate or simulate human cognitive functions using algorithms, statistical models, and data-driven approaches.

Types of AI

Narrow AI (Weak AI) – Designed for a specific task (e.g., voice assistants, recommendation systems).
Narrow AI (Weak AI) – Designed for a specific task (e.g., voice assistants, recommendation systems).
Super AI – Hypothetical systems that surpass human intelligence across all domains.

Img - AI unit 1_2.jpg

AI Techniques

Machine Learning – Systems that learn from data without being explicitly programmed.
Deep Learning – A subset of machine learning using neural networks with many layers.
Natural Language Processing (NLP) – Enabling machines to understand and generate human language.
Computer Vision – Allowing machines to interpret visual information.

History of Artificial Intelligence (AI)

The history of Artificial Intelligence spans several decades of research, innovation, and milestones. Here’s a structured overview from its early ideas to present-day advancements:

Early Ideas (Pre–20th Century)

The concept of intelligent machines can be traced back to ancient myths, stories, and automata.
Greek myths about mechanical beings and medieval thinkers like Aristotle explored logic and reasoning, laying philosophical foundations.
In the 17th century, Gottfried Wilhelm Leibniz worked on symbolic logic, which influenced later computational theories.

Birth of AI (1940s–1950s)

Alan Turing (1950): Introduced the Turing Test to assess whether a machine can exhibit intelligent behavior indistinguishable from a human.
John von Neumann: His work on stored-program computers provided the architecture for computational processes.
1956 – Dartmouth Conference: Considered the official birth of AI as a field. Researchers like John McCarthy, Marvin Minsky, Herbert Simon, and Allen Newell gathered to explore machine intelligence.

Early AI Programs (1950s–1970s)

Logic Theorist (1956): Created by Allen Newell and Herbert Simon; it could prove mathematical theorems.
General Problem Solver (1957): Another early AI program capable of solving puzzles.
ELIZA (1966): An early chatbot by Joseph Weizenbaum that simulated conversation.

The First AI Winter (1970s–1980s)

Progress slowed due to limitations in computing power and unrealistic expectations.
Funding and interest declined because early systems couldn't handle real-world complexities.
AI research faced skepticism, and this period became known as the AI Winter.

Expert Systems Era (1980s)

AI revived with Expert Systems, programs that used a set of rules to solve specific problems.
Example: MYCIN – A medical diagnosis system.
Governments and industries invested heavily, but these systems were expensive and hard to maintain.
The field again faced challenges, leading to a second AI Winter in the late 1980s.

Machine Learning and Big Data (1990s–2000s)

Researchers shifted focus from rule-based systems to machine learning, where systems learn from data.
Support Vector Machines (SVMs), decision trees, and other algorithms gained popularity.
With improved computing power and access to large datasets, AI systems became more practical.
AI started being used in areas like speech recognition and recommendation systems.

Deep Learning and Modern AI (2010s–Present)

Deep learning using neural networks revolutionized AI.
Image recognition, natural language processing, and self-driving cars became achievable.
Landmark achievements:
- AlphaGo (2016): Defeated human champions in the game of Go.
- GPT models (2018 onward): Large language models like GPT-3 and GPT-4 that understand and generate human-like text.
AI is now widely used in healthcare, finance, education, entertainment, and more.

Current Trends and Future Outlook

Ethical AI – Addressing concerns about bias, privacy, and fairness.
Explainable AI (XAI) – Making AI decisions understandable to humans.
AI Governance – Creating policies and regulations to ensure responsible use.
Artificial General Intelligence (AGI) – Still a long-term goal, but research continues toward machines that can think, reason, and learn like humans.

Summary Timeline

Year/Period	Key Event
Ancient times	Myths and philosophical ideas on intelligence
1950	Alan Turing’s Turing Test
1956	Dartmouth Conference – Birth of AI
1960s	ELIZA chatbot, early theorem-proving programs
1970s	First AI Winter due to limitations
1980s	Rise of expert systems
Late 1980s	Second AI Winter
1990s–2000s	Machine learning becomes dominant
2010s–present	Deep learning breakthroughs and real-world AI apps
Future	Ethical AI, AGI research, global regulation efforts

Artificial General Intelligence (AGI)

Artificial General Intelligence (AGI) refers to a type of machine intelligence that can perform any intellectual task that a human being is capable of. It is also known as strong AI or full AI. Current AI systems are limited to specific functions,

whereas AGI aims to replicate human cognitive abilities such as:

Reasoning
Problem-solving
Learning from experience
Understanding language, context, and emotions
Adapting to new situations

AGI systems are designed to apply knowledge across different domains without needing specialized programming. They can transfer skills, learn from new experiences, and improve themselves over time, much like humans do.

Key Features of AGI:

Broad Abilities: Able to perform diverse tasks without needing separate training for each.
Learning and Adaptation: Learns from experience and applies it to unfamiliar situations.
Human-like Understanding: Interprets language, context, and behavior intelligently.
Transferability: Applies knowledge across various domains and tasks.
Self-improvement: Enhances its own capabilities without external intervention.

Why is AGI Important? AGI represents the next frontier in artificial intelligence. It has the potential to revolutionize technology and society by creating machines that think, reason, and learn like humans. However, developing AGI is a major challenge due to its complexity and ethical considerations.

Differences between Artificial General Intelligence (AGI) and Narrow or Weak AI

Component	AGI (Artificial General Intelligence)	Narrow or Weak AI
Definition	A machine intelligence capable of performing any intellectual task a human can do.	Designed to perform a specific task or a limited set of tasks.
Scope of Abilities	Broad cognitive capabilities; can handle multiple tasks without specialized programming.	Limited to one or a few tasks; cannot adapt beyond its programmed scope.
Learning	Learns from experience and applies knowledge across different domains.	Learns only within a narrow context; cannot generalize learning.
Adaptability	Can adapt to new challenges and tasks autonomously.	Cannot adapt to new tasks without human intervention or reprogramming.
Understanding	Understands context, language nuances, meaning, and behavior like humans.	Operates based on rules and patterns; lacks deeper understanding.
Transferability	Applies skills and knowledge across various fields and domains.	Restricted to specific domains; cannot transfer knowledge to unrelated tasks.
Self-improvement	Capable of learning and improving independently over time.	Requires external updates or modifications for improvements.
Interaction with Humans	Communicates naturally and intelligently, resembling human interaction.	Interaction is rigid, rule-based, and task-specific.
Complexity	Highly complex; mirrors human reasoning and decision-making processes.	Relatively simple and task-oriented.
Goal	To create machines with human-like cognitive abilities and versatility.	To solve particular problems efficiently without human-like intelligence.

Img - AI unit 1_6.jpg

Industry Applications of AI

Artificial Intelligence (AI) is being widely applied across industries to improve efficiency, reduce costs, and enhance customer experience.

Industry Applications of AI

Industry	AI Applications
Healthcare	- Medical diagnosis and imaging analysis (e.g., identifying diseases from scans) - Personalized treatment plans - Drug discovery and development - Virtual health assistants and chatbots for patient interaction - Monitoring patient health using wearable devices
Finance	- Fraud detection and risk management - Algorithmic trading and investment strategies - Credit scoring and loan approval - Customer service automation using AI-powered chatbots - Predictive analytics for market trends
Retail & E-commerce	- Personalized product recommendations - Inventory management and demand forecasting - Visual search and virtual try-on features - Customer service automation – Price optimization
Manufacturing	- Predictive maintenance to avoid equipment failure - Quality control and defect detection - Supply chain optimization - Robotics for assembly and packaging - Process automation
Automotive	- Autonomous vehicles and driver-assist systems - Traffic pattern analysis for smart navigation - Predictive maintenance - Enhanced safety features using sensors and AI algorithms
Education	- Intelligent tutoring systems that personalize learning - Automated grading and assessment - Virtual classrooms and AI-driven content recommendations - Learning analytics for student performance tracking
Entertainment & Media	- Content recommendation engines (movies, music, games) - Automated video editing and production - AI-driven storytelling and scriptwriting tools - Enhanced user experience through interactive platforms
Energy & Utilities	- Smart grids and energy management - Forecasting energy consumption - Predictive maintenance of infrastructure - Optimization of renewable energy sources
Agriculture	- Crop monitoring using drone imagery - Pest and disease detection - Precision farming with sensor data - Automated irrigation and yield prediction
Human Resources	- Resume screening and candidate selection - Employee performance analytics - Predictive workforce planning - Training and onboarding automation

Advantages of AI in Industry:

Improved decision-making through data analysis
Automation of routine and repetitive tasks
Enhanced customer satisfaction through personalization
Increased efficiency and reduced operational costs
Faster innovation and development cycles

Challenges in AI

Artificial Intelligence offers tremendous benefits, but its development and deployment face several challenges that must be addressed for safe, ethical, and effective use.

Challenges In Table

Category	Challenges
Technical Challenges	- Data Quality and Availability: AI systems require large amounts of high-quality data, which may not always be available or may be biased. - Interpretability: Many AI models, especially deep learning ones, are “black boxes” whose decision-making processes are difficult to understand. - Scalability: Developing AI systems that perform reliably across different environments and large datasets is complex. - Robustness: AI models can be sensitive to small changes in input, leading to incorrect or unsafe outcomes. - Integration: Incorporating AI into existing systems and workflows can be technically challenging.
Ethical Challenges	- Bias and Fairness: AI can inherit biases from training data, leading to unfair or discriminatory outcomes. - Privacy: Collecting and using personal data for AI can compromise individual privacy rights. - Transparency: There is a need for clear explanations of how AI makes decisions to ensure accountability. - Accountability: Determining who is responsible when AI systems cause harm or make mistakes is complex.
Social Challenges	- Job Displacement: Automation may replace human jobs, leading to unemployment and economic disparity. - Trust: Users may be reluctant to trust AI systems due to fears about reliability, control, and misuse. - Security: AI systems can be vulnerable to attacks, such as adversarial inputs or data manipulation. - Access Inequality: Advanced AI technologies may be accessible only to wealthier organizations or countries, increasing global inequality.

Category	Challenges
Regulatory Challenges	- Lack of Standards: AI development often lacks standardized frameworks, making governance and safety regulation difficult. - Legal Frameworks: Existing laws may not cover AI’s unique risks, requiring new policies and legislation. - Cross-border Coordination: AI impacts are global, requiring cooperation between governments and industries.
Environmental Challenges	- High Energy Consumption: Training large AI models demands significant computational power, contributing to carbon emissions. - Sustainability: Efficient use of resources and energy management in AI systems is still a major concern.

Knowledge Engineering

Knowledge Engineering is a branch of Artificial Intelligence (AI) and Computer Science dedicated to the design, development, and maintenance of knowledge-based systems. Its primary objective is to capture, represent, organize, and utilize human expertise and domain-specific knowledge in a machine-readable form. This structured knowledge enables intelligent systems to reason, make informed decisions, and solve complex problems within specialized domain.

Steps in Knowledge Engineering

Knowledge Acquisition – The process of gathering expertise from domain specialists and converting it into a form understandable by computers. Techniques include expert interviews, surveys, observations, and analysis of existing documents or case studies.
Knowledge Representation – Organizing and structuring acquired knowledge in a machine-interpretable format. Ontologies, semantic networks, rules, frames, and logic-based models are common representation methods.
Knowledge Integration – Combining knowledge from different sources such as structured databases, unstructured text, and expert rules into a unified knowledge base.
Inference and Reasoning – Developing algorithms and mechanisms that allow the system to draw conclusions, make inferences, and apply logical reasoning.
Maintenance and Refinement – Continuously updating and improving the knowledge base to keep pace with domain changes. This ensures accuracy, relevance, and adaptability of the system over time.
Verification and Validation Checking that knowledge has been correctly captured and represented. Ensuring that the system’s results are consistent with real-world expectations and expert judgment.
Deployment and Interaction Integrating the knowledge-based system into real-world applications. Designing user-friendly interfaces that allow users to query the system and receive accurate, meaningful responses.

Img - AI unit 1_10.jpg

Machine Learning

Machine Learning (ML) is a branch of Artificial Intelligence (AI) that allows computers to automatically learn from data and enhance their performance on tasks without being explicitly programmed. In other words, ML systems identify patterns in data and use them to make predictions or decisions.

Here’s Tom Mitchell’s widely cited definition of Machine Learning: “A computer program is said to learn from experience E with respect to some class of tasks T and performance measure P, if its performance at tasks in T, as measured by P, improves with experience E.”

omponents:

T = Task (what the program is supposed to do)
P = Performance measure (how we evaluate the model)
E = Experience (data or feedback used for learning)

Example

Task (T) = Playing chess
Performance (P) = Win rate against opponents
Experience (E) = Games played.

Types of Machine Learning

The following table represents the different types of machine learning and its applications.

Table: Types of Machine Learning

Type	Definition	Example	Applications
1. Supervised Learning	The model learns from labeled data (input–output pairs) to make predictions.	Classification, Prediction (regression)	- Email spam detection - Stock price prediction - Medical diagnosis
2. Unsupervised Learning	The model learns patterns from unlabeled data, finding structure or relationships.	Clustering	- Customer segmentation - Market basket analysis - Anomaly detection
3. Reinforcement Learning	The model learns by trial and error, receiving rewards or penalties for actions.	Training a robot to navigate a maze	- Robotics - Game AI (e.g., AlphaGo) - Self-driving cars
4. Semi-Supervised Learning	Uses both labeled and unlabeled data to improve learning.	Classifying web content when only some pages are labeled	- Text classification - Image recognition with limited labels
5. Self-Supervised Learning	Model generates labels from input data itself to learn representations.	Predicting missing words in a sentence (used in NLP models like GPT)	- Natural Language Processing (NLP) - Computer vision - Speech recognition

Classification is a type of supervised learning task where the goal is to predict the category or class label of new observations based on past data.

It involves two phases:

Training Phase: The algorithm learns from a labeled dataset where each input has a known class.
Prediction Phase: The trained model predicts the class for new, unseen data.

Example	Input Features	Class Labels
Email spam detection	Email content, sender, subject	Spam / Not Spam
Disease diagnosis	Symptoms, age, test results	Disease / No Disease
Credit card fraud detection	Transaction amount, location	Fraud / Not Fraud
Handwritten digit recognition	Pixel values of the image	0, 1, 2, …, 9

Common Classification Algorithms:

Decision Tree
Random Forest
Support Vector Machine (SVM)
k-Nearest Neighbors (k-NN)
Naive Bayesian Classification
Artificial Neural Networks (ANN)

Regression is supervised learning used to predict continuous numeric values from input features, such as predicting house prices from size.
Regression involves two phases: training and prediction.

In the training phase, the model learns from a labeled dataset where each input has a known numeric output.
In the prediction phase, the trained model predicts the numeric value for new, unseen inputs.

Example	Input Features	Output (Continuous Value)
House price prediction	Area, location, number of rooms	Price in dollars
Temperature forecasting	Humidity, pressure, time	Temperature in °C
Stock price prediction	Historical prices, market trends	Stock price in dollars
Car resale value estimation	Age, mileage, brand	Price in dollars

The most important regression algorithms are:

Simple linear regression – One independent variable, one response variable
Multiple linear regression – Two or more independent variables, one response variable
Non-linear regression

Clustering

Clustering is an unsupervised learning technique that aims to group similar data points based on their features, without relying on predefined labels.

Input: Unlabeled data
Output: Groups or clusters of similar items

The algorithm examines the data and measures similarity between data points.
It then assigns data points to clusters such that points in the same cluster are more similar to each other than to those in other clusters

Examples of Clustering Tasks

Example	Input Features	Output (Clusters)
Customer segmentation	Age, income, buying behavior	High-value, Medium-value, Low-value customers
Document grouping	Text content, keywords	Sports, Politics, Technology documents
Image segmentation	Pixel values, color, texture	Different objects or regions in images
Market analysis	Purchase history, demographics	Customer groups with similar preferences

Img - AI unit 1_13.jpg

Important Clustering Algorithms

K-Means Clustering
Hierarchical Clustering
DBSCAN (Density-Based Spatial Clustering)
Gaussian Mixture Models (GMM)

Artificial Neurons An artificial neuron (also called a perceptron) is the basic computational unit of a neural network, inspired by the biological neuron in the human brain. It receives one or more inputs, processes them, and produces an output based on a function. Structure of an Artificial Neuron:

(Inputs x₁, x₂, … xₙ → weights w₁, w₂, … wₙ → Summation → Activation Function → Output)

Differences Between Biological and Artificial Neurons

Feature	Biological Neuron	Artificial Neuron
Basic Unit	Nerve cell in the brain or nervous system	Computational unit in an artificial neural network
Signal Type	Electrical (action potentials) and chemical (neurotransmitters)	Numerical values (real numbers)
Structure	Dendrites, cell body (soma), axon, synapses	Inputs, weights, bias, activation function, output
Input Handling	Receives signals from thousands of other neurons through dendrites	Receives multiple weighted inputs from other neurons or features
Processing	Non-linear integration of signals in the soma	Computes a weighted sum of inputs and applies an activation function
Output	Action potential transmitted via axon to other neurons	Output value sent to next layer or final result
Learning Mechanism	Synaptic plasticity: strengthens or weakens connections (Hebbian learning, LTP/LTD)	Adjusts weights and biases through optimization algorithms (e.g., gradient descent)
Communication Speed	Slower (~milliseconds per signal)	Very fast (microseconds per computation)

Feature	Biological Neuron	Artificial Neuron
Energy Source	Metabolic energy (ATP)	Electrical energy in a computer
Complexity	Highly complex, can self-organize, repair, and adapt	Simpler mathematical model, fully controlled by code and data
Flexibility	Can handle ambiguous and incomplete information naturally	Requires training data and defined network structure

Single layer perceptron is suitable for solving linearly separable problems such as AND/OR.
It is not suitable for solving linearly inseparable problems such as XOR.
To solve XOR problems, multilayer perceptron is required.

Multilayer Perceptron (MLP) A Multilayer Perceptron (MLP) is an artificial neural network composed of several layers of interconnected neurons (nodes) arranged in a feedforward structure.
It has:

one input layer
one or more hidden layers
one output layer

The Backpropagation algorithm is used to train multilayer perceptron networks.

Structure of an MLP

Deep Learning

Deep Learning (DL)is a type of Machine Learning that uses multi-layered neural networks to automatically learn features from large and complex data such as images, audio, and text, without manual feature engineering.

The following table represents different deep learning architectures:

Img - AI unit 1_15.jpg

Reinforcement Learning (RL)

Reinforcement Learning is a branch of machine learning in which an agent learns to make decisions by interacting with an environment, aiming to maximize cumulative rewards. Unlike supervised learning, it does not rely on labeled data; instead, the agent improves its performance through trial and error.

The following table represents key components in reinforcement learning:

Component	Description
Agent	The learner or decision maker.
Environment	The world the agent interacts with.
State (s)	The current situation of the agent in the environment.
Action (a)	Choices the agent can make in a given state.
Reward (r)	Feedback received after taking an action. Can be positive or negative.

Img - AI unit 1_16.jpg

Key Components of Reinforcement Learning

Component	Description
Policy (π)	Strategy used by the agent to decide actions based on states.
Value Function (V or Q)	Estimates the expected reward from a state (V) or state–action pair (Q).
Model	Optional; predicts the next state and reward given current state and action.

Working Procedure of Reinforcement Learning

The agent observes the current state.
Based on its policy, it selects an action (a).
The environment responds with a reward (r) and the next state (s’).
The agent updates its policy or value function to improve future decisions.
Repeat until the agent learns an optimal policy.

Types of Reinforcement Learning

Model-Free RL

The agent learns without knowing the environment’s dynamics.
Examples:
- Q-Learning
- SARSA

Model-Based RL The agent tries to learn a model of the environment and uses it for planning.

Popular RL Algorithms

Category	Algorithm
Value-Based	Q-Learning, Deep Q-Networks (DQN)
Policy-Based	REINFORCE, Policy Gradient
Actor-Critic	A3C, PPO, DDPG

Applications of RL:

Gaming: AlphaGo, Chess, Atari games
Robotics: Robot navigation, manipulation
Finance: Portfolio management, trading
Healthcare: Treatment planning, drug discovery
Autonomous Vehicles: Self-driving car decision making.

Img - AI unit 1_17.jpg

Computer Vision

Computer Vision is a field of artificial intelligence and computer science that enables computers to interpret, analyze, and understand visual information from the world, such as images or videos, in a way similar to human vision.
The goal is to automate tasks that the human visual system can perform.

Key Tasks in Computer Vision

Task	Description
Image Classification	Assigning a label to an entire image (e.g., cat, dog).
Object Detection	Identifying and locating objects in an image with bounding boxes.
Image Segmentation	Dividing an image into meaningful regions (semantic or instance segmentation).
Face Recognition	Identifying or verifying a person from facial images.
Optical Character Recognition (OCR)	Converting printed or handwritten text into machine-readable text.
Pose Estimation	Detecting human body keypoints and posture.
Image Generation / Enhancement	Tasks like super-resolution, image inpainting, and style transfer.

Object Detection is a computer vision task focused on not only identifying the class of objects in an image or video but also locating them. Unlike image classification, which assigns a single label to the entire image, object detection provides both the category of each object and its spatial location using bounding boxes.

Example Use Cases of Object Detection

Application Area	Example Use Case	Companies / Countries Using It
Autonomous Vehicles	Detecting pedestrians, vehicles, traffic signs	Tesla Autopilot (USA), Waymo (USA), Baidu Apollo (China)
Security & Surveillance	Monitoring public spaces for intruders	Airports & banks globally, Hikvision (China), Dahua (China)
Retail & Inventory Management	Product detection, automated checkout	Amazon Go (USA), Walmart smart shelves (USA), JD.com (China)
Healthcare & Medical Imaging	Detecting tumors or abnormalities in scans	IBM Watson Health (USA), Zebra Medical Vision (Israel), Aidoc (Israel)
Industrial Automation & Robotics	Detecting defects, sorting objects	Siemens (Germany), FANUC (Japan), ABB Robotics (Sweden/Switzerland)
Agriculture	Crop disease detection, yield estimation	Drone-based monitoring in USA, Netherlands, India
Augmented Reality & Gaming	Overlaying virtual elements on real objects	Pokémon Go (Global), IKEA Place (Global)
Wildlife & Environmental Monitoring	Counting animals, detecting poaching	African safari reserves, WWF projects, global camera traps

Face Recognition Face Recognition is a computer vision technology that identifies or verifies a person by analyzing facial features from an image or video. It matches the detected face against a database to recognize the individual.

Components of Face Recognition

Component	Description
Face Detection	Locating a face within an image or video frame.
Feature Extraction	Measuring unique facial characteristics like distance between eyes, nose shape, or jawline.
Face Matching / Recognition	Comparing extracted features with a database to identify or verify a person.

Applications of Face Recognition

Security & Surveillance: Airport security, public safety monitoring
Smartphones & Devices: Unlocking phones using Face ID
Banking & Payments: Biometric authentication for transactions
Social Media: Automatic tagging of people in photos
Law Enforcement: Identifying suspects or missing persons

Scene Understanding

Scene Understanding is a computer vision task where a system interprets an entire scene in an image or video, identifying objects, relationships, spatial layout, and context to understand what is happening.
It goes beyond object detection to analyze the overall environment and interactions.

Components of Scene Understanding

Component	Description
Object Detection	Identifying individual objects within the scene.
Semantic Segmentation	Classifying each pixel in the image according to object type or region (e.g., road, sky, car).
Instance Segmentation	Differentiating multiple instances of the same object type.
Contextual Understanding	Recognizing relationships and interactions between objects (e.g., a person riding a bicycle).
Scene Classification	Determining the overall type of scene (e.g., beach, city street, forest).

Applications of Scene Understanding

Autonomous Vehicles: Understanding traffic scenes, predicting pedestrian and vehicle behavior
Robotics: Navigation and manipulation in complex environments
Surveillance: Detecting unusual or suspicious activities in public spaces
Augmented Reality: Accurately overlaying virtual objects in real-world scenes
Smart Cities: Monitoring urban environments for traffic and crowd analysis

Medical Imaging refers to techniques and processes used to create visual representations of the interior of the body for clinical analysis, diagnosis, and treatment planning. AI and computer vision enhance medical imaging by automatically analyzing images, detecting anomalies, and assisting healthcare professionals.

Natural Language Processing (NLP)

Natural Language Processing (NLP) is a branch of artificial intelligence and linguistics that enables computers to comprehend, interpret, generate, and interact with human language, effectively bridging the gap between human communication and machine understanding.

Text Pre-processing Text pre-processing is an essential step in NLP. Common techniques include:

Lowercasing: Converting text to lowercase to ensure consistency and avoid case-related duplication (e.g., computer and Computer).
Tokenization: Splitting text into individual words or tokens (e.g., ‘and’, ‘the’).
Stop Word Removal: Eliminating common, uninformative words that don’t add meaning to the analysis.
Punctuation Removal: Removing punctuation marks that are often irrelevant in many NLP tasks.
Numerical and Special Character Removal: Removing numbers and other non-alphabetic characters based on the analysis need.
Whitespace Trimming: Removing unnecessary spaces, tabs, or line breaks.
Lemmatization and Stemming: Reducing words to their base or root form (e.g., running → run), consolidating related words.
Spell Checking and Correction: Identifying and correcting spelling errors.
Handling Contractions and Abbreviations: Expanding contractions (e.g., can’t → cannot) and standardizing abbreviations.
Handling HTML Tags: Removing or stripping HTML tags in text data.
Text Normalization: Standardizing text formats, such as converting dates to a consistent format.
Removing or Masking Personal Identifiable Information (PII): Replacing or removing sensitive information like names, addresses, or social security numbers for privacy and compliance.
Removing URLs and Email Addresses: Eliminating URLs and email addresses that may not be relevant for analysis.
Text Segmentation: Splitting text into segments or paragraphs as required by analysis tasks.
Sentence and Document Length Normalization: Ensuring uniform sentence/document lengths for tasks such as text classification.
Encoding and Decoding: Converting text between different character encodings whenever necessary.

Important Tasks in Natural Language Processing

Task	Description
Text Classification	Categorizing text into predefined labels (e.g., spam detection, sentiment analysis).
Named Entity Recognition (NER)	Identifying entities such as names, locations, dates, and organizations in text.
Part-of-Speech (POS) Tagging	Determining the grammatical role of each word in a sentence.
Machine Translation	Translating text from one language to another (e.g., English → French).
Question Answering	Extracting answers from text based on a given question.
Text Summarization	Producing concise summaries from longer documents.
Sentiment Analysis	Detecting emotions or opinions expressed in text.
Speech Recognition & Generation	Converting speech to text (ASR) or text to speech (TTS).
Dialogue Systems / Chatbots	Understanding and generating human-like conversational responses.

Text Classification Text classification is the process of assigning predefined labels or categories to a given piece of text.

In sentiment analysis, text is classified as positive, negative, or neutral based on the expressed opinion.
In topic classification, text is categorized into specific subjects or domains, making it easier to organize and manage information.

A simple sentiment analysis model takes text as input and outputs a label (e.g., positive, negative, neutral).
Although basic models exist, real-world applications often require more advanced techniques.

Examples of Sentiment Analysis:

“Very good, solid, good balance, comfortable… loved it” → Positive

Example of Topic Classification Categorizing a news article under topics like Politics, Sports, or Technology for better organization.

Named Entity Recognition (NER) Named Entity Recognition (NER) is a technique in Natural Language Processing (NLP) that focuses on identifying and extracting specific entities from text, such as:

names of people
organizations
locations
dates
times
predefined categories

NER plays a key role in information extraction and enhances the overall contextual understanding of text by machines.

Examples of NER

“The company TCS was founded in 1968.” NER identifies TCS as an organization and 1968 as a date.
“The meeting is going to be held at 10:00 AM today.” NER identifies 10:00 AM as a time and today as a date.

Applications of NER

Information Extraction: Retrieves details such as people, organizations, and locations for tasks like building knowledge bases or generating reports.
Machine Translation: Improves translation accuracy by correctly recognizing named entities in the source text.
Text Summarization: Enhances summarization by identifying key entities and ensuring they are properly represented.

NER is essential in many NLP systems, including:

chatbots
sentiment analysis tools
search engines

It is widely used in domains requiring structured insights from unstructured text.

Question Answering (QA) Question Answering (QA) systems are designed to automatically provide answers to questions posed in natural language.
These systems analyze the query, search for relevant information, and generate appropriate responses. QA systems can retrieve answers from:

databases
documents
knowledge bases
real-time sources

Types of QA Tasks

Extractive QA The system extracts the exact answer directly from the given text.

Example

[!question] “What is the capital of India?” **Answer:** *Delhi*

Abstractive QA The system generates answers in its own words, not limited to the original text.

Example

[!question] “What is the meaning of life?” **Answer:** A reflective or philosophical response.

Classifications Based on Scope

Closed-domain QA: Focuses on questions within a specific field (e.g., medicine, law).
Open-domain QA: Handles questions across diverse topics without domain restrictions.
Knowledge-base QA: Uses structured knowledge sources (e.g., DBpedia, Freebase) to answer fact-based queries.

Applications of QA Systems

Search engines: Provide direct answers to user queries.
Virtual assistants: Power systems like Amazon Alexa and Google Assistant.
Knowledge bases: Help users query structured information (product catalogs, medical databases).

QA systems are also used in platforms like Quora, and Kaggle competitions (e.g., identifying duplicate questions).
Modern large language models (LLMs) like ChatGPT combine retrieval and generative approaches to produce high-quality answers.

Machine Translation (MT) Machine Translation is the process of automatically converting text or speech from one language to another.
It works by understanding the input language, creating an intermediate form, and then producing the translated output in the target language.

Three Main Approaches to Machine Translation:

Rule-based MT: Uses grammar rules and dictionaries of both languages to perform translation.
Statistical MT Learns translation patterns by analyzing large collections of parallel texts in two languages.
Neural / AI-based MT Uses artificial neural networks to learn complex language patterns and produce more natural translations. Governments

Example

(In India) promote machine translation to support **digital empowerment** and **digital inclusion**.

Initiatives like Bhasha Daan encourage citizens to contribute open-source language datasets. Global tools like Google Translate support more than 120 languages.

Text Generation Text Generation is the process of automatically creating meaningful and natural-sounding text.
It can handle simple tasks (product descriptions) as well as advanced ones (story writing, language modelling).

Approaches to Text Generation

Rule-based methods: Follow predefined grammar rules.
Statistical methods: Use patterns learned from large text datasets.
Neural / AI-based methods: Use deep learning models to produce fluent, human-like text.

Common Uses of Text Generation

Chatbots: AI programs simulating conversations with humans; widely used in customer support.
Content creation: Automatically generating articles, blogs, or social media posts to help produce engaging content quickly.

Text Summarization Text Summarization is the process of creating short and clear summaries of long texts while keeping the main ideas and important details. It helps people quickly understand large amounts of information.

Types of Summarization

Extractive Summarization

Picks important sentences or phrases directly from the original text.
Keeps the exact wording but may sound less smooth or natural. Img - AI unit 1_24.jpg

Abstractive Summarization

Creates new sentences that may not exist in the original text.
Rewrites the content in a shorter and more natural way.
Produces more human-like summaries, but is harder to do.
Like machine translation, summarization can use rules, statistical methods, or AI-based techniques.

Common Uses of Text Summarization

News summarization: Quick summaries of news articles for easy reading.
Document summarization: Short versions of long research papers, reports, or documents.
Social media summarization: Condensing long posts or discussions into key points.
Content curation: Combining multiple sources into a single concise overview.

Text summarization helps people save time and easily find important information in large volumes of text.

Robotics

Robotics is a branch of science and engineering focused on designing, building, and using robots.
Robots are programmable machines that can perform tasks automatically or with human guidance.

Robotics integrates knowledge from:

mechanical engineering
electrical engineering
computer science
artificial intelligence

Key Components of Robotics

Sensors – Help robots sense surroundings (e.g., cameras, microphones, touch sensors).
Actuators – Parts that move the robot (e.g., motors, wheels, arms).
Control System – The “brain” of the robot; processes information and makes decisions.
Power Supply – Provides energy for the robot to operate.

Types of Robots

Industrial Robots: Used in factories for manufacturing, welding, and assembly.
Service Robots: Used in sectors like healthcare, cleaning, and customer service.
Img - AI unit 1_25.jpg
Military Robots: Used for surveillance, bomb disposal, and defense.
Humanoid Robots: Designed to look or behave like humans.
Autonomous Robots: Self-driving cars and drones that operate with minimal human input.

Applications of Robotics

Manufacturing: Automating repetitive tasks in industries.
Healthcare: Assisting in surgeries, rehabilitation, and patient care.
Exploration: Space rovers (e.g., on Mars) and deep-sea robots.
Agriculture: Automated harvesting, planting, and monitoring crops.
Household: Robotic vacuum cleaners and personal assistants.
World domination: A conspiracy theory that says: Robots will take over the world from their Creators (us Humans.)

Robotics has advanced rapidly with AI and machine learning, making robots more intelligent, adaptive, and capable of working alongside humans.

Comparison of Humanoid Robots

Robot	Developer	Year	Height	Purpose	Mobility	AI / Interaction
ASIMO	Honda	2000	130 cm	Research, assistance	Walks, runs, climbs stairs	Recognizes faces, voices, interacts with humans
Atlas	Boston Dynamics	2013	150–180 cm	Research, disaster response	Walks, runs, jumps, parkour	Limited AI; focus on navigation and manipulation
Pepper	SoftBank Robotics	2014	120 cm	Customer service, social interaction	Wheels, limited movement	Recognizes emotions, talks, guides people
Nao	SoftBank Robotics	2006	58 cm	Education, research	Walks, dances, gestures	Programmable for interaction and teaching
Sophia	Hanson Robotics	2016	165 cm	Social interaction, AI research	Limited mobility	Conversational AI, facial expressions, emotion recognition
iCub	Italian Institute of Technology	2004	104 cm	Cognitive research	Walks, manipulates objects	Learns via exploration and interaction
Robonaut 2 (R2)	NASA / GM	2011	180 cm	Space station assistance	Works in microgravity	Teleoperated with some autonomy

Unit I AI and its Subfields ​

Introduction to Artificial Intelligence ​

History of Artificial Intelligence (AI) ​

Artificial General Intelligence (AGI) ​

Industry Applications of AI ​

Challenges in AI ​

Knowledge Engineering ​

Machine Learning ​

Deep Learning ​

Reinforcement Learning (RL) ​

Computer Vision ​

Natural Language Processing (NLP) ​

Robotics ​

Unit I AI and its Subfields

Introduction to Artificial Intelligence

History of Artificial Intelligence (AI)

Artificial General Intelligence (AGI)

Industry Applications of AI

Challenges in AI

Knowledge Engineering

Machine Learning

Deep Learning

Reinforcement Learning (RL)

Computer Vision

Natural Language Processing (NLP)

Robotics