| Literature DB >> 35610234 |
Matej Petković1,2, Luke Lucas3, Jurica Levatić4, Martin Breskvar4, Tomaž Stepišnik5,4, Ana Kostovska5,4, Panče Panov5,4, Aljaž Osojnik4, Redouane Boumghar6, José A Martínez-Heras7, James Godfrey6, Alessandro Donati6, Sašo Džeroski4, Nikola Simidjievski5,4,8, Bernard Ženko4, Dragi Kocev9,10.
Abstract
We present six datasets containing telemetry data of the Mars Express Spacecraft (MEX), a spacecraft orbiting Mars operated by the European Space Agency. The data consisting of context data and thermal power consumption measurements, capture the status of the spacecraft over three Martian years, sampled at six different time resolutions that range from 1 min to 60 min. From a data analysis point-of-view, these data are challenging even for the more sophisticated state-of-the-art artificial intelligence methods. In particular, given the heterogeneity, complexity, and magnitude of the data, they can be employed in a variety of scenarios and analyzed through the prism of different machine learning tasks, such as multi-target regression, learning from data streams, anomaly detection, clustering, etc. Analyzing MEX's telemetry data is critical for aiding very important decisions regarding the spacecraft's status and operation, extracting novel knowledge, and monitoring the spacecraft's health, but the data can also be used to benchmark artificial intelligence methods designed for a variety of tasks.Entities:
Year: 2022 PMID: 35610234 PMCID: PMC9130140 DOI: 10.1038/s41597-022-01336-z
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 8.501
Fig. 1Illustrations of how we calculate the descriptive features. (a) The solar aspect angles give the orientation of the spacecraft. The angle between the line Sun-MEX and the normal vector to the front side of the cube (), is shown. (b) A conceptual illustration of the elliptical orbit of MEX with Mars as a focal point. The two features and give the approximate position of MEX in the orbit. In this example, they give the (normalized) time since the last passing through the pericenter and the (normalized) time until the next passing through the apocenter. The sum of the values of the two features is always 1.0. Note that the illustration is not to scale. (c) An illustration of the preprocessing of the electrical currents. The known measurements on the interval (blue dots) and the first measurement before and after this interval (green dots) define the linearly interpolated curve from which the values at the different boundaries (red dots) are taken. The area under that curve (blue-shaded area), divided by the length of the interval Δt, is the average value of the electrical current for the given time interval.
Summary of the provided datasets at each time resolution: Number of examples, number of features per group, the number of targets, proportion of missing values and dataset size (measured in megabytes (MB)).
| resolution (min) | examples | targets | features | proportion of missing values | size [MB] | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| dmop | evt | ftl | influx | lt | total | |||||
| 1 | 3957119 | 33 | 380 | 2 | 23 | 182 | 2 | 589 | 0.094% | 17892 |
| 5 | 791424 | 33 | 380 | 2 | 23 | 182 | 2 | 589 | 0.094% | 3616 |
| 10 | 395712 | 33 | 380 | 2 | 23 | 182 | 2 | 589 | 0.095% | 1817 |
| 15 | 263808 | 33 | 380 | 2 | 23 | 147 | 2 | 554 | 0.100% | 1110 |
| 30 | 131904 | 33 | 380 | 2 | 23 | 112 | 2 | 519 | 0.110% | 503 |
| 60 | 65952 | 33 | 380 | 2 | 23 | 77 | 2 | 484 | 0.120% | 225 |
Fig. 2Top: A sample of the real values of the electrical current (in ) running through four MEX power lines located at different parts of the spacecraft for a selected time window (first week of January 2009), at a 15 min resolution. Bottom: Comparison of value distributions (in ) at different time-resolutions (1 min, 15 min and 60 min) to the unprocessed raw data of four MEX power lines (b) NPWD2372, (c) NPWD2791, (d) NPWD2721, and (e) NPWD2771) illustrated in (a). We can see that, in general, the prepossessed data have expected properties: For fast-changing power-lines the modes are joined at higher time-resolutions, whereas for slower it remain similar. From a data-analysis perspective, this further justifies the need of analysing MEX’s behaviour at different time-resolutions.
Fig. 3Distributions of a descriptive energy-influx feature panels@influx at different time resolutions (1 min, 15 min and 60 min) in terms of (a) box-plots of actual values and (b) density plot of normalized values of the influx (). panels@influx denotes the influx on the spacecraft’s solar panels. We observe that, as expected, while influx values vary in magnitude between time-resolutions, their distribution properties remain similar. Moreover, (c) on a macro-scale (period 2009-2014) the energy-influx measured at the solar panels, panels@influx, depends on the value of the solar constant. However, on a (d) micro-scale (12-13th February 2009), the same influx depends more on the angle and occurrence of (pen)umbras. Such behavior is expected for this feature. For visualisation purposes, the values depicted in subfigures (b), (c) and (d) are normalized. Values in (b) are normalized to the min-max interval of the 1min dataset, while (c) and (d) to [0, 1] interval.
| Measurement(s) | electric current |
| Technology Type(s) | current readings in spacecraft housekeeping telemetry |
| Sample Characteristic - Environment | outer space |