Interpreting batch size parameter in constraints

uribarri · November 8, 2022, 9:30am

Hi all,

Regarding the batch size parameter that is used in Modulus constraints, I’ll use an example from the aneurysm tutorial:

from the python script:

outlet = PointwiseBoundaryConstraint(
    nodes=nodes,
    geometry=outlet_mesh,
    outvar={"p": 0},
    batch_size=cfg.batch_size.outlet,
)

and from the config file:
batch_size:
outlet: 650

I understand (I think) that this batch size is the number of points sampled on the constraint (in this case the outlet boundary). The points are randomly sampled on the boundary. Are they randomly sampled once, and the same points used in all training epochs; or are they re-sampled periodically? If the former, is there a way to force resampling during training? This would allow greater coverage over the boundary while keeping the number of points (and memory usage) down.

On a related note: if I want to sample a very large number of points in the volume (PointwiseInteriorConstraint) I’m likely to run out of memory if all the points are used at once. Is there a way to force mini-batch sampling of that large number of points?

Likely the answer to one of these questions also answers the other. I couldn’t derive a satisfactory answer by perusing either the documentation or the source.

Thanks.

ngeneva · November 8, 2022, 4:32pm

Hi @uribarri

Thanks for your interest in Modulus. The batch number is the number of points per iteration used for that particular constraint. There are actually two parameters in the constraints that govern the total number of points used in one epoch (side note, Modulus is centered around iterations not epochs):

batch_size : int
        Batch size used in training.
batch_per_epoch : int = 1000
        If `fixed_dataset=True` then the total number of points generated

The second parameter listed tells modulus how many batches are in a epoch of this constraint. By default its 1000, so in the example you gave the total number of points used to train the model for the outlet is 650k (650*1000).

The comment in the docstring answers the second part of your question. The fixed_dataset parameter, when set to False, will cause the constraint continuously sample points for training from a geometry object. This methodology isn’t used a lot since for most cases the entire training dataset can be stored in memory.

uribarri · November 9, 2022, 12:32pm

Hi @ngeneva ,

Thanks for your answer, which addressed all my questions. I was able to try out your suggestions and they seemed to work. I appreciate it.

On a related note, when fixed_dataset=False, the resampling of the training points generates a lot of messages in the console, such as the below, which makes following the progress of the training very difficult. Is there a way of suppressing those status messages?

Thanks.

Spherical Inside/Outside took: 0.000417707
[OptiX][ 4][  DISK CACHE]: Closed database: "/var/tmp/OptixCache_root/optix7cache.db"
[OptiX][ 4][  DISK CACHE]:     Cache data size: "29.2 KiB"
Timing for computeSignedField: 0.0308755s
Timing for computeSignedDistanceField: 0.0472712s
NUM Triangles: 52664
Timing for Build CPAT Model (build bvh): 0.0154761s
Timing for cpatResultsToArrays: 1.112e-05s
Timing for cpatDistanceField: 0.0160666s
Timing for computeDistanceField: 0.0160728s
[OptiX][ 4][       KNOBS]: All knobs on default.

[OptiX][ 4][  DISK CACHE]: Opened database: "/var/tmp/OptixCache_root/optix7cache.db"
[OptiX][ 4][  DISK CACHE]:     Cache data size: "29.2 KiB"
Timing for optixAccelBuild: 0.000104571s
Timing for buildAccelStructure: 0.00059272s
[OptiX][ 4][   DISKCACHE]: Cache hit for key: ptx-6835-key2d969543357fdcfff15750597ae49b71-sm_86-rtc1-drv510.47.03
[OptiX][ 4][COMPILE FEEDBACK]: 
[OptiX][ 4][COMPILE FEEDBACK]: Info: Pipeline has 1 module(s), 2 entry function(s), 5 trace call(s), 0 continuation callable call(s), 0 direct callable call(s), 9 basic block(s) in entry functions, 256 instruction(s) in entry functions, 7 non-entry function(s), 53 basic block(s) in non-entry functions, 627 instruction(s) in non-entry functions

ngeneva · November 9, 2022, 4:13pm

Hi @uribarri

Great, glad things are cleared up for you. Those messages are from PySDF for sampling points via OptiX. Presently there’s not a way to suppress those via Modulus at this time.

system · November 23, 2022, 4:13pm

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Issues with 3D Pipe Flow modeling Using Modulus Technical Support (PhysicsNeMo Only) modulus	2	103	January 4, 2025
About ```fixed_dataset``` and ```importance_measure ``` Technical Support (PhysicsNeMo Only)	1	794	January 19, 2023
How is the parametric PINN applied? Technical Support (PhysicsNeMo Only)	5	2200	July 14, 2022
Issues Emulating Simple Water Flow Technical Support (PhysicsNeMo Only) modulus	9	1027	June 28, 2023
Help for the use of SupervisedGridConstraint Technical Support (PhysicsNeMo Only)	1	959	February 25, 2023
Adding a constraint for speed of sound Technical Support (PhysicsNeMo Only) modulus	5	733	May 5, 2023
OptiX crashing when launching pipeline with big data OptiX	5	975	June 14, 2022
About limitations in data scale of Batch Normalization in cuDNN cuDNN cuda	2	830	October 7, 2023
How to model multiple Inflow/Outflow fixed locations + parameterize inflow velocities? Technical Support (PhysicsNeMo Only)	1	583	January 24, 2023
Variable sampling rate OptiX	7	626	December 9, 2022

Interpreting batch size parameter in constraints

Related topics