Why views.py Usually Does NOT Import urls.py in Django (Beginner Confusion Explained)

This topic is empty.

Viewing 1 post (of 1 total)

Author

Posts
May 23, 2026 at 9:38 am #6630
Rajeev Bagra
Keymaster
A beginner learning Kaggle’s Model Validation tutorial may naturally expect that:
```
val_predictions  ==  val_y
```
because both appear to represent the same thing:
- house prices
- dependent variable (y)
- target values
So many learners wonder:

If both contain house prices, then why are there two separate variables?

The Actual Difference

Although both relate to the same target variable, they represent completely different things:

Variable Meaning

val_y Actual correct prices from dataset

val_predictions Prices guessed by the machine learning model

The Initial Kaggle Code

In the Kaggle tutorial, learners typically see code like this:
```
# get predicted prices on validation data
val_predictions = iowa_model.predict(val_X)

# print the top few validation predictions
print(val_predictions[:5])

# print the top few actual prices from validation data
print(val_y.head())
```
This often creates confusion because both outputs look like house prices.

What Actually Happens During Validation

Machine learning models are first trained using:
```
train_X
train_y
```
After training, the model is tested using validation data:
```
val_X
val_y
```
Here:
- val_X contains house features
- val_y contains the real house prices
Now the model is asked:

“Can you predict the prices for these houses?”

That prediction step happens here:
```
val_predictions = iowa_model.predict(val_X)
```
Important Beginner Insight

Think of:
- val_y as the answer sheet
- val_predictions as the student’s answers
The entire purpose of validation is to compare them.

Example

Suppose the real prices are:
```
val_y
```
House Actual Price

1 200000

2 150000

3 300000

Now suppose the model predicts:
```
val_predictions
```
House Predicted Price

1 210000

2 140000

3 310000

Notice:
- they are close
- but not identical
That difference is the prediction error.

Why Validation Exists

Without validation, we would never know:
- whether the model predicts accurately
- how close predictions are to reality
- whether the model generalizes well
Validation compares:
```
predicted values  vs  actual values
```
Machine Learning Is About Approximation

A beginner often unconsciously assumes:

“If the model predicts house prices, then the predictions should automatically equal the real prices.”

But machine learning is actually:

an attempt to estimate unknown outputs as closely as possible.

If predictions were always identical to actual values:
- there would be no prediction problem
- there would be no uncertainty
- machine learning would not even be necessary
How Kaggle Measures Error

One common metric used in the tutorial is MAE (Mean Absolute Error):

MAE = \frac{1}{n}\sum_{i=1}^{n}|y_i-\hat{y}_i|

where:
- y_i = actual values
- \hat{y}_i = predicted values
Smaller MAE means:
- predictions are closer to reality
- the model performs better
Another Important Observation

Notice the syntax difference:
```
print(val_predictions[:5])
print(val_y.head())
```
Why not use .head() for both?

Because:

Variable Data Type

val_predictions NumPy array

val_y Pandas Series

So:
- - NumPy arrays commonly use slicing:
    val_predictions[:5]
[code lang=text]
<ul>
<li>Pandas objects commonly use:
[/code]
```
val_y.head()
```
Core Takeaway

The important relationship is:
```
val_predictions = model guesses
val_y            = real answers
```
Machine learning validation exists to compare those two things.

The closer they are:
- the better the model
- the smaller the prediction error
- the more reliable the model becomes
That comparison is one of the central ideas behind model validation in machine learning.
Author

Posts

Variable	Meaning
`val_y`	Actual correct prices from dataset
`val_predictions`	Prices guessed by the machine learning model

House	Actual Price
1	200000
2	150000
3	300000

House	Predicted Price
1	210000
2	140000
3	310000

Variable	Data Type
`val_predictions`	NumPy array
`val_y`	Pandas Series

Viewing 1 post (of 1 total)

You must be logged in to reply to this topic.

Additional menu

The Actual Difference

The Initial Kaggle Code

What Actually Happens During Validation

Important Beginner Insight

Example

Why Validation Exists

Machine Learning Is About Approximation

How Kaggle Measures Error

Another Important Observation

Core Takeaway