
Seismic Hazard Visualization from Big Simulation Data: Construction of a Parallel Distributed Processing System for Ground Motion Simulation Data
Author(s) -
Takahiro Maeda,
Hiroyuki Fujiwara
Publication year - 2016
Publication title -
journal of disaster research
Language(s) - English
Resource type - Journals
eISSN - 1883-8030
pISSN - 1881-2473
DOI - 10.20965/jdr.2016.p0265
Subject(s) - computer science , data processing , seismic hazard , earthquake simulation , visualization , data mining , computation , real time computing , hazard , benchmark (surveying) , geographic information system , data processing system , database , algorithm , seismology , geology , remote sensing , geodesy , chemistry , organic chemistry
We have developed a data mining system of parallel distributed processing system which is applicable to the large-scale and high-resolution numerical simulation of ground motion by transforming into ground motion indices and their statistical values, and then visualize their values for the seismic hazard information. In this system, seismic waveforms at many locations calculated for many possible earthquake scenarios can be used as input data. The system utilizes Hadoop and it calculates the ground motion indices, such as PGV, and statistical values, such as maximum, minimum, average, and standard deviation of PGV, by parallel distributed processing with MapReduce. The computation results are being an output as GIS (Geographic Information System) data file for visualization. And this GIS data is made available via the Web Map Service (WMS). In this study, we perform two benchmark tests by applying three-component synthetic waveforms at about 80,000 locations for 10 possible scenarios of a great earthquake in Nankai Trough to our system. One is the test for PGV calculation processing. Another one is the test for PGV data mining processing. A maximum of 10 parallel processing are tested for both cases. We find that our system can hold the performance even when the total tasks is larger than 10. This system can enable us to effectively study and widely distribute to the communities for disaster mitigation since it is built with data mining and visualization for hazard information by handling a large number of data from a large-scale numerical simulation.