GenCFD: Generative AI for CFD

This is the PyTorch based source code for the paper Generative AI for fast and accurate Statistical Computation of Fluids.

Overview

GenCFD is a PyTorch-based implementation designed for training and evaluating conditional score-based diffusion models for Computational Fluid Dynamics (CFD) tasks. These generative AI models enable fast, accurate, and robust statistical computation for simulating both two-dimensional and three-dimensional turbulent fluid flows.

🛠️ Installation

To set up a virtual environment and install the necessary dependencies for this project, follow these steps.

Create a Virtual Environment

python3 -m venv venv

Activate the Virtual Environment

source venv/bin/activate

Install dependencies: Make sure your virtual environment is active, then run:

pip install -r requirements.txt

Optional: Install GenCFD as a library If you would like to use GenCFD and its functions as a library, you can install it directly from the repository instead of step 3:

pip install git+https://github.com/camlab-ethz/GenCFD.git@main

🏋️ Training

Train a model from the root directory by using:

python3 -m GenCFD.train.train_gencfd \
--dataset <DATASET_NAME> \
--model_type <MODEL_NAME> \
--save_dir <DIRECTORY_PATH> \
--num_train_steps <INT>

Notes:

DATASET_NAME should be a valid dataset from the section Dataset.
MODEL_NAME should either be PreconditionedDenoiser for the two-dimensional case, or PreconditionedDenoiser3D in the three-dimensional case.
- 3D models have approximately 70M parameters with (64, 64, 64) resolution.
- Recommended: GPU with 32GB memory (batch size 5) or 24GB memory (batch size 4).

After training, a JSON file with model settings is saved in the output directory for use during inference.

For a fast training, use a compiled model in parallel:

torchrun --nproc_per_node=<INT> \
-m GenCFD.train.train_gencfd \
--world_size <INT> \
--dataset <DATASET_NAME> \
--model_type <MODEL_NAME> \
--save_dir <DIRECTORY_PATH> \
--num_train_steps <INT>
--compile

The flag --world_size determines the size of the group associated with a communicator and it has to correspond to the number of processes or trainers used for parallelization. The relevant flag to set this is --nproc_per_node. Another advice for fast training is to chooce a proper number of workers which can be specified through the flag --worker.

🧪 Inference and Evaluation

Run inference with:

python3 -m GenCFD.eval.evaluate_gencfd \
--dataset <DATASET_NAME> \
--model_type <MODEL_NAME> \
--model_dir <DIRECTORY_PATH> \
--compute_metrics \
--monte_carlo_samples <INT> \
--visualize \
--save_gen_samples\
--save_dir <DIRECTORY_PATH>

Run inference in parallel:

torchrun --nproc_per_node=<INT> \
-m GenCFD.eval.evaluate_gencfd \
--world_size <INT> \
--dataset <DATASET_NAME> \
--model_type <MODEL_NAME> \
--model_dir <DIRECTORY_PATH> \
--compute_metrics \
--monte_carlo_samples <INT> \
--visualize \
save_gen_samples \
--save_dir <DIRECTORY_PATH>

Also here the number of models spawned should be the same for both flags --world_size and --nproc_per_node. It's also possible to compile the model when run in a parallel or a sequential setup.

Options:

--compute_metrics: Computes evaluation metrics (e.g., mean and standard deviation) using Monte Carlo simulations.
--visualize: Generates a single inference sample for visualization.
--save_gen_samples: Saves randomly selected samples drawn from a uniform distribution

The number of sampling steps (--sampling_steps) for the Euler-Maruyama method should be preferably >30 for convergence.

⚙️ Summary of Additional Arguments

The following table summarizes key arguments that can help optimize memory usage or fine-tune model performance.

Action Arguments: Simply add the flag (e.g., --track_memory), no need to specify True or False.
Boolean Flags: Requires explicit specification of either True or False (e.g. --use_mixed_precision True)

A compiled version of the model can be used through adding the flag --compile. The compiler works without any issues on the following GPUs

NVIDIA GeForce RTX 3090
NVIDIA GeForce RTX 4090
NVIDIA Tesla V100-SXM2 32 GiB
NVIDIA Tesla V100-SXM2 32 GB
Nvidia Tesla A100

If there are some compiler warnings, you can always surpress them.

Argument	Type	Default	Scope	Description
`--nproc_per_node`	int	1	Both	Training or evaluation done with Distributed Data Parallel (DDP) across multiple machines.
`--world_size`	int	1	Both	To enable training for DDP and a parallelized evaluation use the same integer value as for `--nproc_per_node`
`--compile`	action	False	Both	Model can be compiled for faster training
`--dataset`	string	`<DATASET_NAME>`	Both	Dataset to use for training or evaluation. A list of available datasets is in the Dataset section.
`--save_dir`	string	`<DIRECTORY_PATH>`	Both	Directory to save models and metrics. If it doesn’t exist, it will be created automatically. Path is relative to the root directory.
`--model_type`	string	`PreconditionedDenoiser`	Both	Model type to use. For 2D, options include `PreconditionedDenoiser`. For 3D, `PreconditionedDenoiser3D` is recommended.
`--normalize_qk`	bool	False	Both	Should be used for the Nozzle3D dataset to stabilize training and backpropagation. Uses an L2 norm for the key and query matrix in the Axial Self Attention Layer.
`--padding_method`	str	`circular`	Both	Defines the padding method used for the dataset. Default is `circular`, but can be set to `zeros` for datasets like Nozzle3D where circular padding is not appropriate.
`--batch_size`	int	5	Both	The number of samples per batch for the dataloader.
`--consistent_weight`	float	0.0	Train	A flag for a variance loss in the diffusion model that helps regularize the model training by controlling consistency during training.
`--num_train_steps`	int	10_000	Train	Number of training steps. Increase for more training epochs or higher accuracy.
`--track_memory`	action	False	Train	If `True`, monitors memory usage for each training step.
`--use_mixed_precision`	bool	`True`	Train	Enables mixed precision computation for faster training, using less memory. Set to `False` for full precision (default `torch.float32`).
`--metric_aggregation_steps`	int	500	Train	Computes metrics (e.g., loss and its standard deviation) every specified number of training steps.
`--save_every_n_steps`	int	5000	Train	Saves a checkpoint of the model and optimizer after every `n` steps.
`--checkpoints`	bool	True	Train	If `False`, disables checkpoint storage during training.
`--num_blocks`	int	4	Train	Number of convolution blocks used in the model for each layer.
`--compute_metrics`	action	`False`	Eval	If set to `True`, computes evaluation metrics over multiple samples for statistical accuracy.
`--visualize`	action	`False`	Eval	If set to `True`, generates a single visualized inference sample from the dataset for quick inspection of model output. The sample is drawn from a uniform distribution.
`--sampling_steps`	int	100	Eval	Number of steps for the Euler-Maruyama method to solve the SDE during inference. Higher values generally improve convergence.
`--monte_carlo_samples`	int	100	Eval	Number of Monte Carlo samples to run for metric computation. Increase for more precise statistical results.
`--save_gen_samples`	action	`False`	Eval	If set to `True`, stores the generated and ground truth results for randomly selected samples drawn from a uniform distribution.

📊 Datasets

The table below provides a description of each dataset along with the corresponding flag argument for selection during training or evaluation.

Dataset	Type	Description	Use Case	Additional Flags used for Training and Evaluation
`ShearLayer3D`	Train	Cylindrical Shear Flow dataset	3D Model	`--compile`
`TaylorGreen3D`	Train	Taylor-Green Dataset	3D Model	`--compile`
`Nozzle3D`	Train	3D Nozzle dataset	3D Model	`--compile` `--batch_size 4` `--padding_method zeros` `--normalize_qk True` `--consistent_weight 0.5`
`ConditionalShearLayer3D`	Eval	Perturbed Cylindrical Shear Flow dataset	3D Model	`--compile` `--compute_metrics` `--save_gen_samples`
`ConditionalTaylorGreen3D`	Eval	Perturbed Taylor-Green dataset	3D Model	`--compile` `--compute_metrics` `--save_gen_samples`
`ConditionalNozzle3D`	Eval	Perturbed Nozzle dataset with only 1 macro perturbation and 4000 micro perturbations	3D Model	`--compile` `--compute_metrics` `--padding_method zeros` `--normalize_qk True` `--save_gen_samples`

🙏 Special Thanks

We would like to extend our deepest gratitude to the Google Research team for their groundbreaking work and open-source contributions. This project builds upon their foundational models and research, which have been instrumental in advancing the development of GenCFD.

For more details, please refer to their original work: swirl-dynamics.

📚 Citation

If you use this code or find it helpful in your research, please cite the following paper:

@misc{molinaro2025generativeaifastaccurate,
      title={Generative AI for fast and accurate statistical computation of fluids}, 
      author={Roberto Molinaro and Samuel Lanthaler and Bogdan Raonić and Tobias Rohner and Victor Armegioiu and Stephan Simonis and Dana Grund and Yannick Ramic and Zhong Yi Wan and Fei Sha and Siddhartha Mishra and Leonardo Zepeda-Núñez},
      year={2025},
      eprint={2409.18359},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2409.18359}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 119 Commits
GenCFD		GenCFD
colab		colab
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GenCFD: Generative AI for CFD

Overview

🛠️ Installation

🏋️ Training

Notes:

🧪 Inference and Evaluation

Options:

⚙️ Summary of Additional Arguments

📊 Datasets

🙏 Special Thanks

📚 Citation

About

Releases

Packages

Contributors 2

Languages

License

camlab-ethz/GenCFD

Folders and files

Latest commit

History

Repository files navigation

GenCFD: Generative AI for CFD

Overview

🛠️ Installation

🏋️ Training

Notes:

🧪 Inference and Evaluation

Options:

⚙️ Summary of Additional Arguments

📊 Datasets

🙏 Special Thanks

📚 Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages