Bayesian DLMFG 2.0

sumitsinha · Nov 19, 2020 · d909bfe · d909bfe
1 parent d64b147
commit d909bfe
Showing 1 changed file with 62 additions and 91 deletions.
diff --git a/README.md b/README.md
@@ -2,7 +2,7 @@
 <img align="right" src="http://www.thebiponline.co.uk/bip/wp-content/uploads/2013/08/University-of-Warwick-WMG.png" alt="WMG" width="100">
 </a>
 
-# Bayesian Deep Learning for Manufacturing (dlmfg)
+# Bayesian Deep Learning for Manufacturing 2.0 (dlmfg)
 ## Object Shape Error Response (OSER)
 
 
@@ -18,24 +18,53 @@
 
 ***
 ## Overview 
-The open source **Bayesian Deep Learning for Manufacturing (dlmfg) Library** is built using a **TensorFlow**, **Tensorflow Probablity** and **Keras** back end to build Bayesian and deterministic deep learning models such as **3D Convolutional Neural Network** to enable **root cause analysis** and **quality control** in Sheet Metal Assembly Manufacturing Systems. The library can be used across various domains such as assembly systems, stamping, additive manufacturing and milling where the key problem is **Object Shape Error Detection and Estimation**. The library is build using Object Oriented Programming to enable extension and contribution from other related disciplines within the artificial intelligence community as well as the manufacturing community.
+The open source **Bayesian Deep Learning for Manufacturing (dlmfg) Library** is built using a **TensorFlow**, **TensorFlow Probablity** and **Keras** back end to build:  
 
+* Bayesian deep learning models such as **Bayesian 3D Convolutional Neural Network and Bayesian 3D U-net** to enable **root cause analysis** in Manufacturing Systems. 
+
+* Deep reinforcement learning models such as **Deep Deterministic Policy Gradients** to enable **control and correction** in Manufacturing Systems. 
+
+
+The library can be used across various domains such as assembly systems, stamping, additive manufacturing and milling where the key problem is **Object Shape Error Detection and Estimation**. The library is build using Object Oriented Programming to enable extension and contribution from other related disciplines within the artificial intelligence community as well as the manufacturing community.
+
+## Video 
+[**A Video for the work can be found here**](https://github.com/sumitsinha/Deep_Learning_for_Manufacturing/tree/master/resources/video)
+
+## Published Work 
 The published work can be found below: 
+
 * [**3D convolutional neural networks to estimate assembly process parameters using 3D point-clouds**](https://www.researchgate.net/publication/333942071_3D_convolutional_neural_networks_to_estimate_assembly_process_parameters_using_3D_point-clouds)
 
-* [**Deep learning enhanced digital twin for remote laser welding of aluminium structures**](https://www.sciencedirect.com/science/article/abs/pii/S0007850620301323)
+* [**Deep learning enhanced digital twin for closed loop quality control**](https://www.sciencedirect.com/science/article/abs/pii/S0007850620301323)
+
+*  **Object Shape Error Response using Bayesian 3D Convolutional Neural Network for Assembly Systems with Compliant Parts (to appear online)**
+
 
-*Other follow up papers under review*
+*  **Object Shape Error Response using Bayesian 3D U-Net for Multi-Station Assembly Systems with Non-Ideal Compliant Parts (In Review)**
 
-The library consists of the following items:
 
-1. **Datasets** -  Sheet Metal Assembly Datasets consisting of input i.e. Cloud of Point data with deviations of each node and output i.e. the process parameter variations for each of Cloud of Point. This open source dataset is the **first** dataset for sheet metal assembly manufacturing system and can be leveraged for building, training and benchmarking deep learning models for application in root cause analysis and quality control of manufacturing systems.
-2. **Library Modules** - Various python modules using a TensorFlow/Keras backend for model training, deployment, visualization, sampling, active learning, transfer learning, measurement characteristics generation. More details can be found in the documentation of the library. 
+*  **Scalable and Interpretable Learning for Object Shape Errors in Multi-Station Assembly Systems using Bayesian Deep Learning (To be submitted)**
+
+
+*Other follow up papers are planned*
+
 
 ## Documentation
 The complete documentation and other ongoing research can be found here: [**Documentation and Research**](https://sumitsinha.github.io/Deep_Learning_for_Manufacturing/html/index.html).
 
 
+## Highlights and New Additions
+
+1. [**Bayesian 3D U-Net**](https://github.com/sumitsinha/Deep_Learning_for_Manufacturing/blob/master/core/bayes_unet_hybrid_train.py) model integrating Bayesian layers and attention blocks for uncertainty quantification and superior decoder performance leveraging the *where to look capability* with multi-task capabilities to estimate bot real-valued(regression) and categorical(classification) based values. The Decoder is used to obtain real-valued segmentation maps
+2.  [**Deep Reinforcement Learning**](https://github.com/sumitsinha/Deep_Learning_for_Manufacturing/tree/master/deep_reinforcement_learning) using deep deterministic policy gradient (DDPG) and a custom made multi physics manufacturing environment to build agents to correct manufacturing systems
+3.  [**Closed Loop Sampling**](https://github.com/sumitsinha/Deep_Learning_for_Manufacturing/blob/master/core/dynamic_adaptive_model_train.py) for faster model training and convergence using epistemic uncertainty of the Bayesian CNN models
+4.  [**Matlab Python Integration**](https://github.com/sumitsinha/Deep_Learning_for_Manufacturing/tree/master/cae_simulations) to enable low latency connection between multi-physics manufacturing environments (Matlab) and TensorFlow based DDPG agents
+5.  [**Multi-Physics Manufacturing System Simulations**](https://github.com/sumitsinha/Deep_Learning_for_Manufacturing/tree/master/cae_simulations/cae_matlab) to generate custom datasets for various fault scenarios using Variation Response Method (VRM) kernel
+6.  [**Uncertainty guided continual learning**](https://github.com/sumitsinha/Deep_Learning_for_Manufacturing/tree/master/continual_learning) to enable life long/incremental training for multiple case studies
+7.  [**Exploratory notebooks**](https://github.com/sumitsinha/Deep_Learning_for_Manufacturing/tree/master/assembly_eda_studies) for various case studies
+8.  [**Datasets for Industrial multi-station case studies**](https://github.com/sumitsinha/Deep_Learning_for_Manufacturing/tree/master/pre_trained_models) for training and benchmarking deep learning models 
+
+
 ## Installation
 The library can be cloned using: 
 
@@ -50,60 +79,14 @@ The library consists of the following two key datasets:
 1.	[**3D Cloud of Point data with node deviations and process parameters for Single Part Car Halo Reinforcement**](https://sumitsinha.github.io/Deep_Learning_for_Manufacturing/html/case_study_halo.html) – Obtained due to variations in the Measurement Station locators and Stamping Process
 2.	[**3D Cloud of Point data with node deviations and process parameters for Two part assembly for Car Door Inner and Hinge Reinforcement**](https://sumitsinha.github.io/Deep_Learning_for_Manufacturing/html/case_study_inner_rf.html) – Obtained due to variations in the Assembly System locators and joining tools.
 
-## Deterministic Models 
 
-The basic 3D CNN model for single stage systems termed as **PointDevNet** has the following layers:
-
-```python
-
-> model.add(Conv3D(32,kernel_size=(5,5,5),strides=(2,2,2),activation='relu',input_shape=(voxel_dim,voxel_dim,voxel_dim,deviation_channels)))
-> model.add(Conv3D(32, kernel_size=(4,4,4),strides=(2,2,2),activation='relu'))
-> model.add(Conv3D(32, kernel_size=(3,3,3),strides=(1,1,1),activation='relu')) 
-> model.add(MaxPool3D(pool_size=(2,2,2)))
-> model.add(Flatten())
-> model.add(Dense(128,kernel_regularizer=regularizers.l2(0.02),activation='relu'))
-> model.add(Dense(self.output_dimension, activation=final_layer_avt))
-
-```
-The Multi-head CNN model having multiple CNN heads depending on the number of data sources in multi-stage systems termed as **MH-PointDevNet** has the following layers:
+## Bayesian 3D CNN Model Architecture
+Motivated by the recent development of Bayesian Deep Neural Networks Bayesian models considering parameters to be distributions have been build using TensorFlow Probability. The Aleatoric uncertainty have been modelled using Multi-variate normal distributions as outputs while the epistemic distributions have been modelled using distributions on model parameters by using Flip-out layers.
 
+The Bayesian 3D CNN model for single station system has the following layers:
+![Bayesian 3D CNN Model](https://github.com/sumitsinha/Deep_Learning_for_Manufacturing/blob/master/model_architecture/bayes_3d_cnn.png)
 
 ```python
-
-data_in=[None] * self.heads
-conv_3d_1=[None] * self.heads
-conv_3d_2=[None] * self.heads
-conv_3d_3=[None] * self.heads
-max_pool=[None] * self.heads
-flat=[None] * self.heads
-conv_3d_dropout_2=[None] * self.heads
-
-for i in range(self.heads):
-	data_in[i]=tf.keras.layers.Input(shape=(voxel_dim,voxel_dim,voxel_dim,deviation_channels))
-			conv_3d_1[i]=tf.keras.layers.Convolution3D(32,kernel_size=(5,5,5),strides=(2,2,2),activation=tf.nn.relu)(data_in[i])
-			conv_3d_2[i]=tf.keras.layers.Convolution3D(32, kernel_size=(4,4,4),strides=(2,2,2),activation=tf.nn.relu)(conv_3d_1[i])
-			conv_3d_dropout_2[i]=tf.keras.layers.Dropout(0.1)(conv_3d_2[i])
-			conv_3d_3[i]=tf.keras.layers.Convolution3D(32, kernel_size=(3,3,3),strides=(1,1,1),activation=tf.nn.relu)(conv_3d_dropout_2[i])
-			max_pool[i]=tf.keras.layers.MaxPooling3D(pool_size=[2, 2, 2])(conv_3d_3[i])
-			flat[i]=tf.keras.layers.Flatten()(max_pool[i])
-
-merge = tf.keras.layers.concatenate(flat)
-dropout_merge=tf.keras.layers.Dropout(0.2)(merge)
-hidden_1=tf.keras.layers.Dense(128,kernel_regularizer=tf.keras.regularizers.l2(l=self.regularizer_coeff),activation=tf.nn.relu)(dropout_merge)
-hidden_2=tf.keras.layers.Dense(64,kernel_regularizer=tf.keras.regularizers.l2(l=self.regularizer_coeff),activation=tf.nn.relu)(hidden_1)
-output=tf.keras.layers.Dense(self.output_dimension)(hidden_2)
-
-model=tf.keras.Model(inputs=data_in,outputs=output)
-
-```
-## Bayesian Models 
-Motivated by the recent development of Bayesian Deep Neural Networks Bayesian CNN considering parameters to be distributions have been build using TensorFlow Probability. The Aleatoric uncertainty have been modelled using Multi-variate normal distributions as outputs while the epistemic distributions have been modelled using distributions on model parameters by using Flip-out layers.
-
-The Bayesian version of the single station system has the following layers:
-![Bayesian 3D CNN Model](./model_architecture/bayes_3d_cnn.png)
-
-```python
-
 > negloglik = lambda y, rv_y: -rv_y.log_prob(y)
 > model = tf.keras.Sequential([
 tf.keras.layers.InputLayer(input_shape=(voxel_dim,voxel_dim,voxel_dim,deviation_channels)), tfp.layers.Convolution3DFlipout(32, kernel_size=(5,5,5),strides=(2,2,2),activation=tf.nn.relu),
@@ -120,19 +103,18 @@ model.compile(optimizer=tf.keras.optimizers.Adam(),loss=negloglik,metrics=[tf.ke
 ```
 
 
-## Encoder Decoder 3D U-Net Models
+## Bayesian 3D U-Net Model Architecture
 For scaling to multi-station systems consisting of both categorical and continuous process parameters and prediction of point-clouds (object shape error) in previous stations a 3D - Net Attention based architecture is leveraged.	
 
-![Bayesian 3D CNN Model](./model_architecture/unet_3d_cnn.png)
+![Bayesian 3D CNN Model](https://github.com/sumitsinha/Deep_Learning_for_Manufacturing/blob/master/model_architecture/unet_3d_cnn.png)
 
 **Down-sampling Kernel**
-![Bayesian 3D CNN Model](./model_architecture/down_sample.png)
+![Bayesian 3D CNN Model](https://github.com/sumitsinha/Deep_Learning_for_Manufacturing/blob/master/model_architecture/down_sample.png)
 
 **Attention based Up-Sampling Kernel**
-![Bayesian 3D CNN Model](./model_architecture/up_sample.png)
+![Bayesian 3D CNN Model](https://github.com/sumitsinha/Deep_Learning_for_Manufacturing/blob/master/model_architecture/up_sample.png)
 
 ```python
-
 		def attention_block(x, g, inter_channel):
 
 		    theta_x = Conv(inter_channel, [1,1,1], strides=[1,1,1])(x)
@@ -146,21 +128,25 @@ For scaling to multi-station systems consisting of both categorical and continuo
 		    att_x = multiply([x, rate])
 		    return att_x
 
+		input_size=(voxel_dim,voxel_dim,voxel_dim,deviation_channels)
+		inputs = Input(input_size)
+		x = inputs
+
 		# Down sampling
 		for i in range(depth):
 			out_channel = 2**i * filter_root
 
 			# Residual/Skip connection
-			res = Conv(out_channel, kernel_size=1, padding='same', use_bias=False, name="Identity{}_1".format(i))(x)
+			res = tfp.layers.Convolution3DFlipout(out_channel, kernel_size=1, kernel_divergence_fn=kl_divergence_function,padding='same', name="Identity{}_1".format(i))(x)
 
 			# First Conv Block with Conv, BN and activation
-			conv1 = Conv(out_channel, kernel_size=3, padding='same', name="Conv{}_1".format(i))(x)
+			conv1 = tfp.layers.Convolution3DFlipout(out_channel, kernel_size=3, kernel_divergence_fn=kl_divergence_function,padding='same', name="Conv{}_1".format(i))(x)
 			#if batch_norm:
 				#conv1 = BatchNormalization(name="BN{}_1".format(i))(conv1)
 			act1 = Activation(activation, name="Act{}_1".format(i))(conv1)
 
 			# Second Conv block with Conv and BN only
-			conv2 = Conv(out_channel, kernel_size=3, padding='same', name="Conv{}_2".format(i))(act1)
+			conv2 = tfp.layers.Convolution3DFlipout(out_channel, kernel_size=3, padding='same', kernel_divergence_fn=kl_divergence_function,name="Conv{}_2".format(i))(act1)
 			#if batch_norm:
 				#conv2 = BatchNormalization(name="BN{}_2".format(i))(conv2)
 
@@ -175,18 +161,20 @@ For scaling to multi-station systems consisting of both categorical and continuo
 			else:
 				x = act2
 
-		#Regression Outputs
-		feature_vector=Conv(self.output_dimension-categorical_outputs, 1, padding='same', activation=final_activation, name='Process_Parameter_output_regression')(x)
-		process_parameter_regression=GlobalAveragePooling3D(name='Regression_Outputs')(feature_vector)
+		feature_vector_reg=Conv(reg_kccs, 1, padding='same', activation=final_activation, name='Process_Parameter_Reg_output')(x)
+		process_parameter_reg=GlobalAveragePooling3D()(feature_vector_reg)
 
-		#Classification Outputs
-		feature_vector_categorical=Conv(categorical_outputs, 1, padding='same', activation=final_activation, name='Process_Parameter_output_classification')(x)
-		process_parameter_cont=GlobalAveragePooling3D()(feature_vector_categorical)
-		process_parameter_classification=Activation('sigmoid',name='Classification_Outputs')(process_parameter_cont)
+		feature_vector_cla=Conv(categorical_kccs, 1, padding='same', activation=final_activation, name='Process_Parameter_Cla_output')(x)
+		process_parameter_cla=GlobalAveragePooling3D()(feature_vector_cla)
 
-		#feature_vector=Flatten()(x)
-		#process_parameter=Dense(self.output_dimension)(feature_vector)
+		#feature_categorical=Flatten()(feature_vector)
+		#reg_output=tfp.layers.DenseFlipout(output_dimension,kernel_divergence_fn=kl_divergence_function)(process_parameter)
 
+		#Process Parameter Outputs
+		reg_distrbution=tfp.layers.DistributionLambda(lambda t:tfd.MultivariateNormalDiag(loc=t[..., :reg_kccs], scale_diag=aleatoric_tensor),name="regression_outputs")(process_parameter_reg)
+		cla_distrbution=Activation('sigmoid', name="classification_outputs")(process_parameter_cla)
+		#cla_distrbution=tfp.layers.DenseFlipout(categorical_kccs, kernel_divergence_fn=kl_divergence_function,activation=tf.nn.sigmoid,name="classification_outputs")(process_parameter_cla)
+
 		# Upsampling
 		for i in range(depth - 2, -1, -1):
 			out_channel = 2**(i) * filter_root
@@ -216,26 +204,9 @@ For scaling to multi-station systems consisting of both categorical and continuo
 			resconnection = Add(name="upAdd{}_1".format(i))([res, up_conv2])
 
 			x = Activation(activation, name="upAct{}_2".format(i))(resconnection)
-
 ```
 
 
-## Modules
-The library consists of following key modules. More details about the classes, objects and methods can be found in the documentation:
-
-* **Core** - key functions for model training, testing, data study and model deployment
-* **Dynamic Learning** - key program to dynamically using the CAE simulation of the system
-* **Adaptive and Dynamic Learning** - key program to dynamically learn from the CAE simulation of the system by adaptively generating samples using the uncertainty estimates of Bayesian Models
-* **CAE Simulations** - Matlab based library to generate close to real samples of manufacturing systems using Multi-Physics Finite Element (FE) based models 
-* **Transfer Learning** - Key functions to test state of art models (Voxnet, 3D-Unet) across different domains present in pre-trained networks and determine there application in manufacturing problems. This can also be leveraged for within domain transfer learning to check how transferable are models trained from data from one case study to another case study
-* **Utilities** - key functions for model benchmarking, Voxel constructions and constructing mixture density network 
-* **KMC** - key functions for generating key measurement characteristics for various process parameters within a manufacturing system
-* **Visualisation** - key functions to visualise model training and cloud of point or voxelized data
-* **Resources** - Resources required for model building such as Voxel mapping for Cloud-of-point data and nominal Nodes for the assembly geometry
-* **Trained Models** - Consists of a file structure created for each case study that is run. This file structure consists of training and testing logs, trained models and plots
-* **Pre-Trained Models** - Consists of state of the art pre-trained 3D CNN models such as Voxnet (Object Detection) and 3D U-Net (Medical Scan Segmentation)
-* **Configuration** - specify the configuration parameters for different case studies and other parametrised applications such as model training, model construction, Sampling for active learning, transfer learning etc.
-
 ## Verification and Validation
 Details of verification and validation of the model on an actual system can be found here: [**Real System Implementation**](https://sumitsinha.github.io/Deep_Learning_for_Manufacturing/html/real_system_implementation.html)