AI in a nutshell

ML allows computers to learn tasks directly from data(datasets)

Machine learning can be said to be a subfield of AI, which itself is a subfield of computer science.
Key concept:

Deep learning is a subfield of machine learning, which itself is a subfield of AI, which itself is a subfield of computer science.
Key concept:

Taxonomy of AI:

A taxonomy is a scheme for classifying many things that may be special cases of one another.
CS --> AI --> ML --> DL --> Data Science
! 200
The taxonomy scheme is composed of Euler diagrams overlapping with each others, similar to what happens with CS, AI, ML, DL, and DS

Key terminology:

This happens in 2 phases:

attachments/Screenshot 2025-06-04 at 17.30.29.png

  1. Training dataset (set of data), passing it to a learning algorithm, the result of the algorithm is a ML model.
    attachments/Screenshot 2025-06-04 at 17.33.24.png
  2. Once we have the ML model, we can feed it new data to make predictions or solving problems.

Inference: Using a linear model to make predictions

Linear model : ! 150

When we feed data to a ML algorithm, the quality of our prediction is determined by how well the parameters are aligned with reality.

How to pick parameters for accurate predictions?

This happens during the training phase (Phase 1)
In training, we want to minimise the discrepancy between our model predictions and real-world data.

How to do it? We quantify this discrepancy, also called the loss function.
! 300

Lost function equation

By taking the difference between these actual values and the model prediction of values.

As the goal is to find parameter values that gives us the smallest loss, we can take a mathematical approach like this:
! 300

More ML techniques: Different ways to learn parameters from data

attachments/Screenshot 2025-06-04 at 18.01.22.png

All these approaches are traditional Machine Learning Algorithms that can be used in different contexts.

The main challenge is to select the correct input variable (feature) for the model's prediction.

The process of picking the right input variable is called feature engineering.

Another way to implement AI is through Deep Learning:

One of these is the Neural Networks that learn optimal features on their own. (Remember that finding the right input feature was the main goal in traditional ML )

! 400
Deep Neural Network doing image recognition

The key technique in Deep Learning is neural networks, which are a series of operations that can approximate any function.

! 400

Neuron

We can see that on the right, the equation looks a lot like the linear model. But the key difference is the activation function (g), which is a nonlinear function.

Build up a network of Neurons:

! 100
We combine a set of datapoints as a neuron
!100
Combine multiple neuron with multiple inputs to form a layer
! 200
Combine multiple layers together to form a network = Neural Network

Type of Neurons:

! 200

Type of activation functions:

! 250

Type of Layers:

! 300

Type of Network architectures:

! 250

Training Neural Networks:
Powered by Forestry.md