Skip to content

A Large-Scale Multimodal Car Dataset with Computational Fluid Dynamics Simulations and Deep Learning Benchmarks

License

Notifications You must be signed in to change notification settings

Mohamedelrefaie/DrivAerNet

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

DrivAerNet++

Our new preprint: DrivAerNet++ paper here

DrivAerNet Paper: here

We present DrivAerNet++, the largest and most comprehensive multimodal dataset for aerodynamic car design. DrivAerNet++ comprises 8,000 diverse car designs modeled with high-fidelity computational fluid dynamics (CFD) simulations. The dataset includes diverse car configurations such as fastback, notchback, and estateback, with different underbody and wheel designs.

Design Parameters

Design parameters for the generation of the DrivAerNet++ dataset. Several geometric parameters with significant impact on aerodynamics were selected and varied within a specific range. These parameter ranges were chosen to avoid values that are either difficult to manufacture or not aesthetically pleasing.

Shape Variation

DrivAerNet++ covers all conventional car designs. The dataset encompasses various underbody and wheel designs to represent both internal combustion engine (ICE) and electric vehicles (EV). This extensive coverage allows for comprehensive studies on the impact of geometric variations on aerodynamic performance. By including a diverse set of car shapes, DrivAerNet++ facilitates the exploration of aerodynamic effects across different vehicle types, supporting both academic research and industrial applications.

Each 3D car geometry is parametrized with 26 parameters that completely describe the design. To create diverse car designs, we used two main morphing methods: morphing boxes and direct morphing. For a detailed description of the design parameters, their ranges, lower and upper bounds, please refer to the paper.

DrivAerNet_params-ezgif com-crop

Dataset Contents & Modalities

  • Parametric Models: Parametric models with tabular data, allowing extensive exploration of automotive design variations.
  • Point Cloud Data: Point cloud data for each car design.
  • 3D Car Meshes: Detailed 3D meshes of each car design, suitable for various machine learning applications.
  • CFD Simulation Data: High-fidelity CFD simulation data for each car design, including 3D volumetric fields, surface fields, and streamlines.
  • Aerodynamic Coefficients: Key aerodynamic metrics such as drag coefficient (Cd), lift coefficient (Cl), and more.

DatasetContents

Dataset Annotations

In addition to the CFD simulation data, our dataset includes detailed annotations for various car components (29 labels), such as wheels, side mirrors, and doors. These annotations are instrumental for a range of machine learning tasks, including classification, semantic segmentation, and object detection. The comprehensive labeling can also facilitate automated CFD meshing processes by providing precise information about different car components. By incorporating these labels, our dataset enhances the utility for developing and testing advanced algorithms in automotive design and analysis.

DrivAerNet_ClassLabels_new

Computational Cost

Running the high-fidelity CFD simulations for DrivAerNet++ required substantial computational resources. The simulations were conducted on the MIT Supercloud, leveraging parallelization across 60 nodes, totaling 2880 CPU cores, with each CFD case using 256 cores and 1000 GBs of memory. The full dataset requires 39 TB of storage space. The simulations took approximately 3 Γ— 10⁢ CPU-hours to complete.

Applications

DrivAerNet++ supports a wide array of machine learning applications, including but not limited to:

  • πŸš€ Data-driven design optimization: Optimize car designs based on aerodynamic performance.
  • 🧠 Generative AI: Train generative models to create new car designs based on performance or aesthetics.
  • 🎯 Surrogate models: Predict aerodynamic performance without full CFD simulations.
  • πŸ”₯ CFD simulation acceleration: Speed up simulations using machine learning and multi-GPU techniques.
  • πŸ“‰ Reduced Order Modeling: Create data-driven reduced-order models for efficient & fast aerodynamic simulations.
  • πŸ’Ύ Large-Scale Data Handling: Efficiently store and manage large datasets from high-fidelity simulations.
  • πŸ—œοΈ Data Compression: Implement high-performance lossless compression techniques.
  • 🌐 Part and shape classification: Classify car categories or components to enhance design analysis.
  • πŸ”§ Automated CFD meshing: Automate the meshing process based on car components to streamline simulations.

Results Replication

DrivAerNet++ serves as a valuable benchmark dataset due to its size and diversity. It provides extensive coverage of various car designs and configurations, making it ideal for testing and validating machine learning models in aerodynamic design. We provide the train, test, and validation splits in the following folder: train_val_test_splits.

To replicate the results based on the parametric tabular data, please refer to the ParametricModels directory and the code in AutoML_parametric.py.

To replicate the results based on geometric deep learning methods, please refer to the code saved in the DeepSurrogates directory.

image

Drag values for the 8k car designs can be found Here

Training Using Point Clouds

You can access the preprocessed point clouds for training at the following link:
DrivAerNet++ Processed Point Clouds (100k)

Training Using STL Files / Graphs

For training using the STL files (for graph-based methods or point sampling at a different resolution), please download the STLs from the following link:
DrivAerNet++ STL Files

The full data will be released after the review process on Harvard Dataverse.

Datasets Comparison

DrivAerNet++ stands out as the largest and most comprehensive dataset in the field of car design.

image

Contributing

We welcome contributions to improve the dataset or project. Please submit pull requests for review.

Maintenance and Support

Maintained by the DeCoDE Lab at MIT. Report issues via GitHub issues.

License

The code is distributed under the MIT License. The DrivAerNet++ dataset is distributed under the Creative Commons Attribution-NonCommercial (CC BY-NC) license. Full terms for the dataset license here.

Additional Resources

  • Tutorials: Link
  • Technical Documentation: [Link]

Previous Version

To replicate the code and experiments from the first version of DrivAerNet, please refer to the folder: DrivAerNet_v1. This includes 4,000 car designs based on the fastback category. The dataset can be accessed here.

Citations

To cite this work, please use the following reference:

@article{elrefaie2024drivaernet++,
  title={DrivAerNet++: A Large-Scale Multimodal Car Dataset with Computational Fluid Dynamics Simulations and Deep Learning Benchmarks},
  author={Elrefaie, Mohamed and Morar, Florin and Dai, Angela and Ahmed, Faez},
  journal={arXiv preprint arXiv:2406.09624},
  year={2024}
}

To cite the first version of DrivAerNet, please use the following reference:

@article{elrefaie2024drivaernet,
  title={DrivAerNet: A Parametric Car Dataset for Data-Driven Aerodynamic Design and Graph-Based Drag Prediction},
  author={Elrefaie, Mohamed and Dai, Angela and Ahmed, Faez},
  journal={arXiv preprint arXiv:2403.08055},
  year={2024}
}