Improving collective I/O performance with machine learning supported auto-tuning

dc.contributor.authorBagbaba, Ayse
dc.date.accessioned2021-11-02T09:27:23Z
dc.date.available2021-11-02T09:27:23Z
dc.date.issued2020de
dc.description.abstractCollective Input and output (I/O) is an essential approach in high performance computing (HPC) applications. The achievement of effective collective I/O is a nontrivial job due to the complex interdependencies between the layers of I/O stack. These layers provide the best possible I/O performance through a number of tunable parameters. Sadly, the correct combination of parameters depends on diverse applications and HPC platforms. When a configuration space gets larger, it becomes difficult for humans to monitor the interactions between the configuration options. Engineers has no time or experience for exploring good configuration parameters for each problem because of long benchmarking phase. In most cases, the default settings are implemented, often leading to poor I/O efficiency. I/O profiling tools can not tell the optimal default setups without too much effort to analyzing the tracing results. In this case, an auto-tuning solution for optimizing collective I/O requests and providing system administrators or engineers the statistic information is strongly required. In this paper, a study of the machine learning supported collective I/O auto-tuning including the architecture and software stack is performed. Random forest regression model is used to develop a performance predictor model that can capture parallel I/O behavior as a function of application and file system characteristics. The modeling approach can provide insights into the metrics that impact I/O performance significantly.en
dc.identifier.isbn978-1-7281-7445-7
dc.identifier.urihttp://nbn-resolving.de/urn:nbn:de:bsz:93-opus-ds-117811de
dc.identifier.urihttp://elib.uni-stuttgart.de/handle/11682/11781
dc.identifier.urihttp://dx.doi.org/10.18419/opus-11764
dc.language.isoende
dc.relationinfo:eu-repo/grantAgreement/EC/H2020/721865de
dc.relation.uridoi:10.1109/IPDPSW50202.2020.00138de
dc.rightsinfo:eu-repo/semantics/openAccessde
dc.subject.ddc004de
dc.titleImproving collective I/O performance with machine learning supported auto-tuningen
dc.typeconferenceObjectde
ubs.bemerkung.extern© 2020 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.de
ubs.fakultaetZentrale Einrichtungende
ubs.institutHöchstleistungsrechenzentrum Stuttgart (HLRS)de
ubs.konferenznameIEEE International Parallel and Distributed Processing Symposium (34th, 2020, Online)de
ubs.publikation.noppnyesde
ubs.publikation.source2020 IEEE International Parallel and Distributed Processing Symposium Workshops : IPDPSW. Piscataway, NJ : IEEE, 2020. - ISBN 978-1-7281-7445-7, S. 814-821de
ubs.publikation.typKonferenzbeitragde

Files

Original bundle

Now showing 1 - 1 of 1
Thumbnail Image
Name:
Bagbaba_iWAPT2020.pdf
Size:
554.63 KB
Format:
Adobe Portable Document Format
Description:
MPIIO auto-tuning; Bagbaba iWAPT 2020

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
3.39 KB
Format:
Item-specific license agreed upon to submission
Description: