tech

PyTorch

Python deep learning framework - dynamic graphs, Pythonic API, strong research community

TL;DR

What: An open-source machine learning framework based on Python.

Why: Dynamic computation graphs, Pythonic API, strong research community, production ready.

Quick Start

Install:

pip install torch torchvision

Hello PyTorch:

import torch

# Create tensors
x = torch.tensor([1, 2, 3])
y = torch.tensor([4, 5, 6])
z = x + y
print(z)  # tensor([5, 7, 9])

# Check GPU
print(torch.cuda.is_available())

Cheatsheet

Operation	Code
Create tensor	`torch.tensor([1, 2, 3])`
Zeros/ones	`torch.zeros(3, 3)`
Random	`torch.rand(3, 3)`
Shape	`x.shape`
Reshape	`x.view(2, 3)`
To GPU	`x.to('cuda')`
Gradient	`x.requires_grad_(True)`

Gotchas

Neural network

import torch.nn as nn

class SimpleNN(nn.Module):
    def __init__(self):
        super().__init__()
        self.fc1 = nn.Linear(784, 128)
        self.fc2 = nn.Linear(128, 10)
        self.relu = nn.ReLU()

    def forward(self, x):
        x = self.relu(self.fc1(x))
        return self.fc2(x)

model = SimpleNN()

Training loop

import torch.optim as optim

criterion = nn.CrossEntropyLoss()
optimizer = optim.Adam(model.parameters(), lr=0.001)

for epoch in range(epochs):
    for inputs, labels in dataloader:
        optimizer.zero_grad()
        outputs = model(inputs)
        loss = criterion(outputs, labels)
        loss.backward()
        optimizer.step()

GPU usage

device = torch.device('cuda' if torch.cuda.is_available() else 'cpu')
model = model.to(device)
inputs = inputs.to(device)

Save and load

# Save
torch.save(model.state_dict(), 'model.pth')

# Load
model.load_state_dict(torch.load('model.pth'))
model.eval()

Next Steps

PyTorch Documentation - Official docs
PyTorch Tutorials - Step-by-step
PyTorch Examples - Code examples
Hugging Face - Pre-trained models