Building LSTMs with PyTorch and Lightning AI Part 4: Training Step and Initial Predictions

wpnews.pro

cd /news/machine-learning/building-lstms-with-pytorch-and-ligh… · home › topics › machine-learning › article

[ARTICLE · art-41209] src=dev.to ↗ pub=2026-06-26T19:20Z topic=machine-learning verified=true sentiment=· neutral

Building LSTMs with PyTorch and Lightning AI Part 4: Training Step and Initial Predictions

A developer building LSTMs with PyTorch and Lightning AI implemented the training_step function and ran initial predictions without training. The model predicted Company A's stock price reasonably close to the observed value but Company B's prediction was far off, indicating the need for training.

read2 min views1 publishedJun 26, 2026

In the previous article, we finished the LSTM cell, explored the forward method and the Adam optimizer for the model.

In this article, we will explore the training_step()

function, and try to run the model without training.

The training_step()

function takes a batch of training data from one of the two companies, along with the index of that batch.

It then uses the forward()

function to make a prediction for that training example.

def training_step(self, batch, batch_idx):
    input_i, label_i = batch
    output_i = self.forward(input_i[0])
    loss = (output_i - label_i)**2

Next, it calculates the loss, which is the squared residual between the predicted value and the observed value.

We can also log the loss to easily track how it changes during training.

Lightning provides the log()

function for this purpose. It automatically stores the logs in a lightning_logs

directory.

We can log other values as well, such as the predictions for Company A and Company B.

Finally, we return the loss.

def training_step(self, batch, batch_idx):
    input_i, label_i = batch
    output_i = self.forward(input_i[0])
    loss = (output_i - label_i)**2

    self.log("train_loss", loss)

    if label_i == 0:
        self.log("out_0", output_i)
    else:
        self.log("out_1", output_i)

    return loss

So far, we have implemented the following:

lstm_unit()

.forward()

method to perform a forward pass through the unrolled LSTM.configure_optimizers()

.training_step()

.Now let's try using the model.

model = LSTMByHand()

print("\nComparing observed and predicted values")

print(
    "Company A: Observed = 0, Predicted =",
    model(torch.tensor([0., 0.5, 0.25, 1.])).detach()
)

print(
    "Company B: Observed = 1, Predicted =",
    model(torch.tensor([1., 0.5, 0.25, 1.])).detach()
)

Here, we pass a tensor containing the stock prices for Days 1 through 4. The model then predicts the value for Day 5.

The model returns both the prediction and its associated computation graph. We call .detach()

to remove the computation graph and retrieve only the prediction.

Running the code produces the following output:

Comparing observed and predicted values
Company A: Observed = 0, Predicted = tensor(-0.2321)
Company B: Observed = 1, Predicted = tensor(-0.2360)

The prediction for Company A is reasonably close to the observed value.

However, the prediction for Company B is quite far from the expected value.

In the next article, we will train the model to improve these predictions.

AI agents write code fast. They also silently remove logic, change behavior, and introduce bugs -- without telling you. You often find out in production.

git-lrc fixes this. It hooks into git commit and reviews every diff before it lands. 60-second setup. Completely free.

Any feedback or contributors are welcome! It's online, source-available, and ready for anyone to use.

Give it a ⭐ star on Github

source & further reading

dev.to — original article How a .NET dev built an AI assistant Has Anyone Measured How LLM Output Quality Degrades Across Multiple Compactions? what i learned on day 1 of a 3D reconstruction internship

~/api · this article 200

$curl api.wpnews.pro/v1/news/building-lstms-with-pyto…

Read original on dev.to → dev.to/rijultp/building-lstms-with-pytorch-and-l…

mentioned entities

PyTorch

Lightning AI

Adam optimizer

LSTM

Company A

Company B

git-lrc

HexmosTech

metadata

slugbuilding-lstms-with-pytorch-and-lightning-ai-part-4-training-step-and-initial

topic#machine-learning

secondary2 topics

sentimentneutral

canonicaldev.to

navigation

← prevXprize Founder Insists All the N…

next →How a .NET dev built an AI assis…

── more in #machine-learning 4 stories · sorted by recency

dev.to · 26 Jun · #machine-learning

Cat Dog Classification CNN Project

dev.to · 24 Jun · #machine-learning

Building LSTMs with PyTorch and Lightning AI Part 3: Finishing the LSTM Cell

dev.to · 26 Jun · #machine-learning

AI & Machine Learning Servers: The Hidden Infrastructure Powering the AI Revolution

dev.to · 26 Jun · #machine-learning

what i learned on day 1 of a 3D reconstruction internship

── more on @pytorch 3 stories trending now

wpnews · 19 Oct · #developer-tools

Windows Script to clean up and remove all ASUS software

wpnews · 28 May · #ai-startups

The Niche SaaS Opportunity Map 2026: Highly Demanded Subscribed Categories Beyond Mainstream

wpnews · 1 Nov · #developer-tools

Custom Zig Test Runner, better ouput, timing display, and support for special "tests:beforeAll" and "tests:afterAll" tests

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required