Journal "Software Engineering"
a journal on theoretical and applied science and technology
ISSN 2220-3397

Issue N11 2024 year

DOI: 10.17587/prin.15.600-608
Methodology of a Comprehensive Approach to the Analysis, Structuring, and Aggregation of Data Based on the Requirements of Resource-Intensive Applications
P. A. Sechenykh, Junior Researcher1, Senior Teacher2, p-sechenyh@mail.ru,
1 Federal Research Center "Computer Science and Control" of the Russian Academy of Sciences (FRC CSC RAS), Moscow, 119333, Russian Federation,
2 Moscow Aviation Institute, Moscow, 125993, Russian Federation
Corresponding author: Polina A. Sechenykh, Junior Researcher1, Senior Teacher2,
1 Federal Research Center "Computer Science and Control" of the Russian Academy of Sciences (FRC CSC RAS), Moscow, 119333, Russian Federation
2 Moscow Aviation Institute, Moscow, 125993, Russian Federation, E-mail: p-sechenyh@mail.ru
Received on September 09, 2024
Accepted on October 07, 2024

The work presents a methodology for in-depth data description for resource-intensive applied tasks, particularly in the field of materials science. It consists of three basic stages. In the first stage, quantitative and semantic requirements are formalized, and working datasets are defined, which can be classified as reference, structured, instrumental, or experimental-computational, depending on the source. These datasets are stored in a specialized research area for further investigation. The second stage involves the analysis and hierarchical structuring of the accumulated information content. This allows for the refinement and comparison of data, storage of different sets, and the application of various processing tools. In the third stage, the data is aggregated and represented as objects in the domain area according to the database schema of the application support system. The steps outlined in the proposed methodology enable full utilization of the specialized research area of informational content for building object models of application scenarios, structuring the domain data representation schema, and forming event-information forms. The practical application of this methodology in the examined subject area clarified the requirements, criteria, and filters for information search, data description, parameter cataloging, and their verification when solving a number of practical tasks. The software implementation of this approach could be oriented towards a web platform for local and cloud technologies, allowing remote collaborative access to content requirements and solution catalogs with role-based access for different categories of researchers and users.

Keywords: semantic requirements, data analysis, structuring of information content, BPMN diagrams, cataloging, domain model
pp. 600—608
For citation:
Sechenykh P. A. Methodology of a Comprehensive Approach to the Analysis, Structuring, and Aggregation of Data Based on the Requirements of Resource-Intensive Applications, Programmnaya Ingeneria, 2024, vol. 15, no. 11, pp. 600—608. DOI: 10.17587/prin.15.600-608. (in Russian).
References:
  1. Kimball R., Caserta J. The Data Warehouse ETL Toolkit: Practical Techniques for Extracting, Cleaning, Conforming, and Delivering Data, John Wiley & Sons, 2004, 528 p.
  2. Abgaryan K. K. Multiscale modeling in material science, Moscow, MAKS Press, 2017, 284 p. (in Russian).
  3. BPMN Specification, available at: https://www.bpmn.org (date of access 01.07.2024).
  4. Sechenykh P. A. Mathematical modeling of the metrical parameters of hexagonal close-packed metalls. Izvestiya Vysshikh Uchebnykh Zavedenii. Materialy Elektronnoi Tekhniki — Materials of Electronics Engineering, 2022, vol. 25, no. 4, pp. 283—287. DOI: 10.17073/1609-3577-2022-4-283-287 (in Russian).
  5. Abgaryan K. K., Gavrilov E. S. Informational support of the multiscale modeling integration platform, Sistemy i Sredstva Informatiki, 2019, vol. 29, no. 1, pp. 53—62 (in Russian).
  6. Crystallography Open Database, available at: http://www.crystallography.net (date of access 01.07.2024).
  7. NIST Inorganic Crystal Structure Database, available at: https://icsd.nist.gov/(date of access 01.07.2024).
  8. Sechenykh P. A. Deep specification of the structural properties of crystalline compounds in the information support system for materials science problems, Highly Available Systems? 2023, vol. 19, no. 4, рр. 51—62. DOI: 10.18127/j20729472-202304-04 (in Russian).
  9. NOU "INTUIT", available at: https://intuit.ru/studies/courses/611/467/lecture/28793?page=8#keyword161(date of access 29.08.2023).
  10. Hahn T. International Tables for Crystallography. vol. A, Springer, 2005, 911 p.
  11. Huheey J. E. Inorganic Chemistry. Principles of structure and reactivity / J. E. Huheey. New York, 1983.
  12. WebElements, available at: https://www.webelements.com (date of access 20.09.2022).
  13. NSM Archive — Physical Properties of Semiconductors, available at: http://www.matprop.ru (date of access 01.07.2024).
  14. Materials Studio, available at: https://www.3ds.com/prod-ucts/biovia/materials-studio (date of access 01.07.2024).
  15. Overview — USPEX, available at: https://uspex-team.org/ru/uspex/overview (date of access 01.07.2024).
  16. ToposPro, available at: https://topospro.com/ (date of access 01.07.2024).
  17. Chemical Abstracts Service, available at: https://www.cas.org/ (date of access 01.07.2024).
  18. Ulrichsweb — Global Serial Directory, available at: https://ulrichsweb.serialssolutions.com/. (date of access 01.07.2024).
  19. VASP — Vienna Ab initio Simulation Package, available at: https://www.vasp.at/ (date of access 01.07.2024).
  20. SIESTA — Spanish Initiative for Electronic Simulations with Thousands of Atoms, available at: https://siesta-project.org (date of access 01.07.2024).
  21. Sechenykh P. A., Abgaryan K. K. Mathematical modeling of the crystal structure of metal oxides, Mater. I Mezhdunar. konf. Matematicheskoe modelirovaniye v materialovedenii elektronnykh komponentov» (MMMEK-2019), Moscow, 2019, MAKS Press, 2019, pp. 74—76. DOI: 10.29003/m682.MMMSEC-2019.