Predict Airfoil Lift with PyTorch

Replace a wind tunnel with a neural network that predicts lift in milliseconds.

High School Aerodynamics 2–4 weeks

Last reviewed: March 2026

Overview

Calculating the lift produced by a wing has traditionally required either expensive wind-tunnel testing or time-consuming computational fluid dynamics simulations. Machine learning offers a third path: train a neural network on thousands of known airfoil/lift-coefficient pairs, and it can make new predictions in microseconds. This is called a surrogate model, and it is one of the hottest techniques in modern aerospace design.

In this project you will use the UIUC Airfoil Coordinate Database—a collection of thousands of real airfoil profiles used on everything from gliders to fighter jets—to build a dataset of shape parameters and lift coefficients computed at a fixed angle of attack. You will design a simple feedforward neural network in PyTorch, train it on 80% of the data, and validate it on the remaining 20%.

PyTorch is the framework of choice for AI research at organizations like OpenAI, Meta, and NASA's Jet Propulsion Laboratory. Learning it now—even for a project this focused—gives you a genuine head start on the skills aerospace companies are actively hiring for in their ML engineering teams.

What You'll Learn

✓ Describe what a surrogate model is and explain when ML is preferable to direct simulation.
✓ Preprocess numerical tabular data for neural network training (normalization, train/test split).
✓ Define a feedforward neural network in PyTorch using <code>nn.Module</code>.
✓ Write a training loop with a loss function, optimizer, and validation step.
✓ Interpret R² score and MAE as regression metrics for a physics-based prediction task.

Step-by-Step Guide

Set up PyTorch and acquire data

Install PyTorch by following the instructions at pytorch.org for your OS (CPU-only is fine). Also install pandas, numpy, scikit-learn, and matplotlib. Download the UIUC Airfoil Database coordinate files from m-selig.ae.illinois.edu. Alternatively, use the pre-processed NACA airfoil dataset available on Kaggle that includes Cl values computed by XFOIL.

Build and explore the feature set

For each airfoil, extract geometric features: maximum camber, maximum camber position, maximum thickness, and thickness position (for NACA 4-digit series these come directly from the four digits). Your target variable is Cl (lift coefficient) at a fixed angle of attack—use 5° as a reasonable subsonic cruise condition. Create a pandas DataFrame with these five columns, drop any rows with NaN, and plot histograms of each feature to understand the distributions.

Normalize data and create PyTorch Datasets

Use StandardScaler from scikit-learn to normalize all features and the target to zero mean and unit variance (fit only on training data, then transform both sets). Convert your numpy arrays to PyTorch tensors with torch.tensor(..., dtype=torch.float32). Wrap them in a TensorDataset and create DataLoader objects with batch_size=32 and shuffle=True for the training set.

Define and initialize the neural network

Create a class LiftPredictor(nn.Module) with an __init__ that defines three linear layers: input→64, 64→32, 32→1. Add ReLU activations after the first two layers. In the forward method, pass input through each layer in sequence. Initialize the model, define nn.MSELoss() as your criterion, and use optim.Adam(model.parameters(), lr=1e-3) as optimizer.

Train the model and monitor loss

Write a training loop that runs for 100 epochs. In each epoch, iterate over batches: zero gradients, forward pass, compute loss, backward pass, optimizer step. After each epoch, compute validation loss without gradient tracking using torch.no_grad(). Plot training and validation loss curves. If validation loss rises while training loss continues to fall, you are overfitting—try adding a Dropout layer with rate 0.2.

Evaluate and interpret results

Un-scale predictions and targets back to original units using the scaler's inverse_transform. Compute R² score and mean absolute error with scikit-learn metrics. Create a scatter plot of predicted vs. actual Cl with a dashed diagonal line for reference. Calculate how long a single prediction takes vs. running XFOIL—typical speedup is 1000× or more. Document which airfoil shapes your model struggles with most and why.

Career Connection

See how this project connects to real aerospace careers.

Go Further

Extend the model to predict drag coefficient Cd as a second output, turning it into a multi-output regressor.
Train on data at multiple angles of attack (−5° to 15°) so the model learns the full polar curve, not just one operating point.
Use SHAP (SHapley Additive exPlanations) to identify which geometric feature has the largest influence on predicted lift.
Compare your neural network to a Gaussian Process surrogate model using scikit-learn and discuss the trade-offs in interpretability and prediction uncertainty.

Related Projects

High School Visualize Airflow Over a Wing with OpenFOAM Run a real computational fluid dynamics simulation and watch the airflow come alive. View Project → High School Predict Engine Failure with Kaggle Data Use real NASA sensor data to predict when a jet engine is about to fail. View Project →

← Back to All Projects More High School → High School Projects