J. Eng. Dai, H., Khalil, E. B., Zhang, Y., Dilkina, B., and Song, L. (2017).
doi: 10.1007/s00366-010-0190-x, Kirsch, U. Once in 10-episode training, the performance of is tested for prescribed loading and boundary conditions. Learning combinatorial optimization algorithms over graphs, in Proceedings of the 31st International Conference on Neural Information Processing Systems, NIPS'17, (Long Beach, CA), 63516361. Topologies at steps 37, 60, 84, 100, 144, and 145 in the removal sequence are illustrated in Figure 11B. Because the optimization problem (Equation 3) contains constraint functions, the cost function F used in GA is defined using the penalty term as: where 1 and 2 are penalty coefficients for stress and displacement constraints; both are set to be 1000 in this study. arXiv:1702.05532. doi: 10.1007/s00158-008-0237-4, Hajela, P., and Lee, E. (1995). Construct. Mastering the game of go without human knowledge. When a function approximator is not utilized, the action value is updated using state s, chosen action a, observed next state s and reward r as: where > 0 is a learning rate that has an effect on convergence of the training. Arch. 13, 258266. Methods Appl.
Automatic design of optimal structures. infomax embeddings demos 27, 193200. doi: 10.1002/2475-8876.12059.
The number of training episodes is set as 5,000.
As shown in Figure 8B, the agent utilizes an reasonable policy to eliminate obviously unnecessary members connecting to supports at first, non-load-bearing members around the supports next, and members in the load path at last. Knowl. This is an evidence that the agent is capable of detecting the load path among members, and we estimate that this capability is mainly due to graph embedding because it extracts member features considering truss connectivity. Data Eng. Example 3: 6 6-grid truss (V = 0.1858 [m3]). The statistical data with respect to the maximum test score for each training are as follows; the average is 43.38, the standard deviation is 0.16, and the coefficient of variation is only 3.80 103. Comput. doi: 10.1007/s00158-019-02214-w. Mitchell, M. (1998). doi: 10.1007/s00158-012-0877-2, Hagishita, T., and Ohsaki, M. (2009).
Example 2: 3 2-grid truss (V = 0.0340 [m3]). The trainable parameters are optimized by a back-propagation method to minimize the loss function computed by estimated action value and observed reward.
KH and MO approved the final version of the manuscript, and agree to be accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved. Figure 4 plots the history of cumulative rewards in the test simulation recorded at every 10 episodes. Note that the nodes highlighted in blue are pin-supported, those in yellow are loaded. Similarly to loading condition L1 in Figure 5, several symmetric topologies are observed during the removal process, and the sub-optimal topology is a well-converged solution that does not contain unnecessary members.
Mnih, V., Kavukcuoglu, K., Silver, D., Rusu, A. The agent is trained and its performance is tested for a simple planar truss in section 4.1. Although the use of CNN-based convolution method is difficult to apply to trusses as they cannot be handled as pixel-wise data, the convolution is successfully implemented for trusses by introducing graph embedding, which has been extended in this paper from the standard node-based formulation to a member(edge)-based formulation. Eng. Training workflow utilizing RL and graph embedding. embeddings The one just before the terminal state is a sub-optimal truss; however, instability exists at the loaded node. Solids Struct. convolution hierarchical unsupervised embedding Combining machine learning models via adaptive ensemble weighting for prediction of shear capacity of reinforced-concrete deep beams. Note that the total CPU time t[s] for obtaining this removal sequence of members includes initialization of the truss structure, import of the trained RL agent, and computing the removal sequence. In the same manner as neural networks, a back-propagation method (Rumelhart et al., 1986), which is a gradient based method to minimize the loss function, can be used for solving Equation (11). accuracy embedding embedding The left two corners 1 and 7 are pin-supported and rightward and downward unit loads are separately applied at the bottom-right corner 43, as shown in Figure 11A in the loading condition L1. The training method for tuning the parameters is described below. Imagenet classification with deep convolutional neural networks, in Proceedings of the 25th International Conference on Neural Information Processing Systems - Vol. pytorch nodes pbg embedding billions starship AIAA J.
Note that the connection between the members in each pair is an unstable node, and must be fixed to generate a single long member. doi: 10.1007/s00158-004-0480-2, Papadrakakis, M., Lagaros, N. D., and Tsompanakis, Y. Symmetry properties in structural optimization: Some extensions. 8, 301304. 6, 679684.
-relaxed approach in structural topology optimization. Keywords: topology optimization, binary-type approach, machine learning, reinforcement learning, graph embedding, truss, stress and displacement constraints, Citation: Hayashi K and Ohsaki M (2020) Reinforcement Learning and Graph Embedding for Binary Truss Topology Optimization Under Stress and Displacement Constraints. In this study, = 0.99 is adopted, because cumulative reward indicating the amount of reduction of structural volume as a result of the action is much more important than the instant reward. The agent is trained using a 72-member truss with 4 4 grids. (1986). Each grid is a square whose side length is 1 m. The intersection of bracing members is not connected. Yu, Y., Hur, T., and Jung, J. This algorithm is terminated if the best cost function value fb is not updated for ns = 10 consecutive generations. Background information of deep learning for structural engineering.
Adv. embedding gram Since ^ is also computed using {1, , 6}, the action value Q(^,i) is dependent on = {1, , 9}. Gradient-based learning applied to document recognition, in Proceedings of the IEEE, 22782324. (2017). The parameters are tuned using a method based on 1-step Q-learning method, which is a frequently used RL method. To reduce the required capacity of a storage device, 1,000 sets of observed transitions (s, a, s, r) are stored at the maximum. COURSERA: Neural Netw. During the test, nodes 1 and 5 are pin-supported, and loads are applied at node 23 in positive x and negative y direction separately as different loading conditions, which is denoted as loading condition L1. The performance is also tested for other different trusses in sections 4.24.4 without re-training the results in section 4.1. 44, 315341. saved after the 5,000-episode training is regarded as the best parameters. Table 3.
Built Environ. (2018). JP18K18898. Multidiscip. Appl. 1, 419430. doi: 10.1109/TNN.1998.712192, Tamura, T., Ohsaki, M., and Takagi, J. It is notable that the agent was able to optimize the structure with the unforeseen boundary conditions which the agent has never experienced during the training. Comput. One of the loads applied at node 4 is an irregular case where pin-supports and the loaded node aligns on the same straight line.
It is verified from the numerical examples that the trained agent acquired a policy to reduce total structural volume while satisfying the stress and displacement constraints. Multidiscip. The size of embedded member feature nf is 100. Optim. Figure 7 shows the initial GS. Struct. arXiv:1704.01212. Multidiscip.
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest. (1989). Another approach may be to incorporate a rule-based method to create a hybrid optimization agent. A branch and bound algorithm for topology optimization of truss structures. Tieleman, T., and Hinton, G. (2012). Rev. Goodfellow, I. J., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., et al. Example 1: 4 4-grid truss (V = 0.0853 [m3]). Prayogo, D., Cheng, M.-Y., Wu, Y.-W., and Tran, D.-H. (2019). Although stress and displacement bounds have the same value and , respectively, for each member and DOF in this study, it should be noted that each member could have a different stress bound and each DOF could have a different displacement bound for each load case, which provides a versatility to the proposed method. embedding skip The upper-bound displacement for each boundary condition is computed by multiplying 100 to the maximum absolute value of displacement among the all DOFs of the initial GS with the same loading and boundary conditions; hence, varies depending on the structure and the loading and boundary conditions. 32, 33413357. Best scored removal process of members for loading condition L1 of Example 1; (A) initial GS, (B) removal sequence to the terminal state. However, in order to create a more reliable agent, it is necessary to implement the training with various topology, geometry, and loading and boundary conditions. Load and support conditions are randomly provided according to a rule so that the agent can be trained to have good performance for various boundary conditions.
Figure 9. From these results, the agent is confirmed to behave well for a different loading condition. 25, 121129. Methods Eng. Mech. likert embedding plotting metadata graphing Figure 8. Loading condition L2 of Example 3; (A) initial GS, (B) removal sequence of members. Figure 11. Figure 4. Machine learning for combinatorial optimization of brace placement of steel frames. Ohsaki, M. (1995). The agent trained in Example 1 is reused for a smaller 3 -grid truss without re-training. Nakamura, S., and Suzuki, T. (2018). It forms a very simple truss composed of six pairs of members connecting linearly. Optimising the load path of compression-only thrust networks through independent sets. embedding nonlinear investigating dimensionality efficacy classifying schemes dominant plotting Optim. Topping, B., Khan, A., and Leite, J. Blast-induced ground vibration prediction using support vector machine. Cambridge, MA: MIT Press. In the second boundary condition B2, the bottom center nodes 4 and 7 are pin-supported and upper tip nodes 3 and 12 are subjected to outward unit loads along x axis as shown in Figure 9A. Math. Struct. It implies that the agent possesses generalized performance for a complex structural optimization task. embedding Optim. Although it takes a long time for the training, the trained agent requires very low computational cost compared with GA at the application stage. The upper-bound stress is 200 N/mm2 for both tension and compression for all examples. Optim. Deepwalk: online learning of social representations. Enhanced artificial intelligence for ensemble approach to predicting high performance concrete compressive strength. Multidiscip. Furthermore, the robustness of the proposed method is also investigated by implementing 2,000-episode training using different random seeds for 20 times. GA algorithm is run for 10 times with different initial solutions that are generated randomly, and only the best result that yields a solution with the least total structural volume is provided in the GA column in Table 3. Right tip nodes are candidates to apply loading, and a horizontal or a vertical load with the fixed magnitude of 1.0 kN is applied at a randomly chosen node. doi: 10.1016/0045-7949(94)00617-C, Ohsaki, M., and Hayashi, K. (2017). The number of loading conditions is fixed as nload= 2, and accordingly, the sizes of inputs from nodes and members are 5 and 6, respectively. Mach. Genetic algorithms in truss topological optimization. Minimum weight design of elastic redundant trusses under multiple static loading conditions. 29, 190197. The boundary conditions are given at the beginning of each episode. Similarly to the boundary condition B1, the agent eliminates members that do not bear forces as shown in Figure 9B. Although the agent is applied to a larger-scale truss, a sparse optimal solution is successfully obtained. The initial cross-sectional area is 1,000 mm2, and the elastic modulus is 2.0 105 N/mm2 for all members of all examples.