Checking one-second data
1. Data checking principle
A data-check routine for one second data should make use of one-minute (preferably checked data) and one-second step_1 data. Contents of these data sets are compared, and, for this purpose, partly recalculated based on INTERMAGNET and IAGA recommended techniques. A quick test will investigate data from a randomly chosen month and is sufficient to check the principal correctness underlying routines, formats and meta information. A full test will perform a detailed data check for all data. A report with all check details should be obtained and made accessible to the data submitter.
2. Issues to be checked
-
Submitted files and formats: Are all requested files available and are they submitted in correct and readable formats (IAGA-2002, IAF, IMCDF).
-
Meta information: Do all files contain the requested meta information and is this meta information consistent between all different files.
-
Data content: Are the IAF one-minute, one-hour, and daily averages consistent and obtained using IM filter procedures. Are F values provided for one-minute and one-second, and are they independent measures of the field. Are delta F variations within acceptable limits. IMCDF: Is the leap second table up-to-date.
-
Data consistency: Can one-minute data be reproduced by filtering one-second data using IM recommended procedures on filtering and outlier treatment.
-
Baseline variation: Are periodic measurements available. Are residuals and amplitudes in an acceptable range.
-
Eventually: Is the spectral data content as expected for a natural geomagnetic signal or are there any significant technical contributions. Noise level.
3. Step wise performance of the data check
(as performed with MagPy 0.9.6)
-
directories and existence of files
-
file names, access and basic header information
-
data content and consistency of minute data
-
second data and consistency with minute data (IM recommendations)
-
basevalues and adopted baseline variation
-
yearly means, meta information
-
activity - "not included"
The results of each step are summarized and rated with a grade from 1 to 6. Grades of 5 and above indicate "significant" issues regarding IM recommendations to be considered. A detailed description on how to use MagPy for these tests can be found in the manual (http://cobs.zamg.ac.at/data/index.php/en/downloads/category/1-magnetism).
4. Issues and suggestions
-
Currently one second data is uploaded in two different formats using, however, various different packing routines. Should we suggest a common submission format? For IAGA2002 I would recommend daily zipped IAGA files. For IMCDF monthly files are already requested.
-
threshold for differences between minute data and IM filtered one-second data: how much difference do we tolerate?
5. Recommendations for analysis
- To avoid downloading all files, one could mount the ftp directories on the local system. However, most files are uploaded as various different archives.
- Run only quick tests first. Only for data passing quick test, run full test.
- When using MagPy eventually skip step 5 (baseline) and 6 (meta, yearmean) for quick test, if minute data has been accepted
- Some issues are easy to be solved, others give indications for underlying procedures.
Example:
- if filtered second data differs from minute data only for few days, this indicates that most likely the outlier treatment differs from the IAGA 90% rule
- if the difference of filtered second data and minute data is evenly distributed with values exceeding the numerical uncertainty then most likely a different filter procedure (window, type) was used
- if the difference is not evenly distributed and some "function" is observable, then eventually different baseline treatments might be the cause
6. Example analysis:
Please note, the following table contains only a test analysis and is not related to any decision from the data committee. All test are also used as a run-time test for MagPy. Error messages related to certain files structures (e.g. step 5 failure related to BLV files or step 6 related to yearmean files) do not necessary indicate that the file structure is wrong. MagPy eventually just cannot interpret the format correctly yet. With Python3 MagPy >= 0.9.6 is required for the analysis.
2016
| Obs |
Format (second) |
QuickTest |
Observed issues |
QuickTest result |
FullTest |
Issues |
Summary |
| WIC |
IMCDF |
1,1,1,1,1,3,1 |
Location and AltitudeDiff |
second data OK |
|
|
|
| UPS |
IMCDF daily |
1,5,1,4, |
second data in daily cdfs, minor filter differences for two days, step5 failed |
Monthly files obligatory? |
|
|
|
| TUC |
IAGA (year zip) |
1, |
iaga second data not readable |
|
|
|
|
| SJG |
IAGA (year zip) |
1,1,5,1,3,3,2 |
inconsistency in IAF daily mean 2016-11-21, yearly means differ between yearmean and blv files |
second data OK |
|
|
|
| SIT |
IAGA (monthly zip) |
1,1,3,4,3,3,2 |
hourly IAF complete?, filtered one-second differs slightly from one-minute |
second data OK, check z comp |
|
|
|
| SHU |
IAGA (year zip) |
1,1,3,4,3,3,2 |
hourly IAF complete?, filtered one-second differs slightly from one-minute |
second data OK, check z comp |
|
|
|
| NEW |
|
|
|
|
|
|
|
| MMB |
IMCDF daily |
1,5, |
second data in daily cdfs, IAF data not readable |
Monthly files obligatory? |
|
|
|
| MCQ |
|
|
|
|
|
|
|
| MAW |
IMCDF |
1 |
no leap second information |
|
|
|
|
| LYC |
|
|
|
|
|
|
|
| LRM |
IMCDF |
1 |
no leap second information |
|
|
|
|
| KNY |
|
|
|
|
|
|
|
| KDU |
|
|
|
|
|
|
|
| KAK |
|
|
|
|
|
|
|
| HRN |
|
|
|
|
|
|
|
| HON |
|
|
|
|
|
|
|
| HLP |
|
|
|
|
|
|
|
| HER |
|
|
|
|
|
|
|
| GNG |
|
|
|
|
|
|
|
| FRN |
|
|
|
|
|
|
|
| FRD |
|
|
|
|
|
|
|
| EBR |
IMCDF |
1,3,1,1, |
file names not as expected for ImagCDF, step5 failed |
second data OK |
|
|
|
| DED |
|
|
|
|
|
|
|
| CTA |
|
|
|
|
|
|
|
| CSY |
|
|
|
|
|
|
|
| CNB |
|
|
|
|
|
|
|
| CMO |
|
|
|
|
|
|
|
| CKI |
|
|
|
|
|
|
|
| BSL |
|
|
|
|
|
|
|
| BRW |
|
|
|
|
|
|
|
| BOU |
|
|
|
|
|
|
|
| BEL |
|
|
|
|
|
|
|
| BDV |
|
|
|
|
|
|
|
| ASP |
|
|
|
|
|
|
|
| ABK |
|
|
|
|
|
|
|
Checking one-second data
1. Data checking principle
A data-check routine for one second data should make use of one-minute (preferably checked data) and one-second step_1 data. Contents of these data sets are compared, and, for this purpose, partly recalculated based on INTERMAGNET and IAGA recommended techniques. A quick test will investigate data from a randomly chosen month and is sufficient to check the principal correctness underlying routines, formats and meta information. A full test will perform a detailed data check for all data. A report with all check details should be obtained and made accessible to the data submitter.
2. Issues to be checked
Submitted files and formats: Are all requested files available and are they submitted in correct and readable formats (IAGA-2002, IAF, IMCDF).
Meta information: Do all files contain the requested meta information and is this meta information consistent between all different files.
Data content: Are the IAF one-minute, one-hour, and daily averages consistent and obtained using IM filter procedures. Are F values provided for one-minute and one-second, and are they independent measures of the field. Are delta F variations within acceptable limits. IMCDF: Is the leap second table up-to-date.
Data consistency: Can one-minute data be reproduced by filtering one-second data using IM recommended procedures on filtering and outlier treatment.
Baseline variation: Are periodic measurements available. Are residuals and amplitudes in an acceptable range.
Eventually: Is the spectral data content as expected for a natural geomagnetic signal or are there any significant technical contributions. Noise level.
3. Step wise performance of the data check
(as performed with MagPy 0.9.6)
directories and existence of files
file names, access and basic header information
data content and consistency of minute data
second data and consistency with minute data (IM recommendations)
basevalues and adopted baseline variation
yearly means, meta information
activity - "not included"
The results of each step are summarized and rated with a grade from 1 to 6. Grades of 5 and above indicate "significant" issues regarding IM recommendations to be considered. A detailed description on how to use MagPy for these tests can be found in the manual (http://cobs.zamg.ac.at/data/index.php/en/downloads/category/1-magnetism).
4. Issues and suggestions
Currently one second data is uploaded in two different formats using, however, various different packing routines. Should we suggest a common submission format? For IAGA2002 I would recommend daily zipped IAGA files. For IMCDF monthly files are already requested.
threshold for differences between minute data and IM filtered one-second data: how much difference do we tolerate?
5. Recommendations for analysis
Example:
- if filtered second data differs from minute data only for few days, this indicates that most likely the outlier treatment differs from the IAGA 90% rule
- if the difference of filtered second data and minute data is evenly distributed with values exceeding the numerical uncertainty then most likely a different filter procedure (window, type) was used
- if the difference is not evenly distributed and some "function" is observable, then eventually different baseline treatments might be the cause
6. Example analysis:
Please note, the following table contains only a test analysis and is not related to any decision from the data committee. All test are also used as a run-time test for MagPy. Error messages related to certain files structures (e.g. step 5 failure related to BLV files or step 6 related to yearmean files) do not necessary indicate that the file structure is wrong. MagPy eventually just cannot interpret the format correctly yet. With Python3 MagPy >= 0.9.6 is required for the analysis.
2016