SDC-Report_v1

Input Data

The data set consists of 691 observations

Information on selected important (key) variables

  • Categorical key variable(s): DISTRICT | URB_RUR | S3B_CROP_PRODUCT_ID
  • Continuous key variable(s): S3BQ7C
  • Weight variable: HH_Final_weight
  • householdID: HOUSEHOLD_ID
  • strataVariable(s): not defined

Modifications

  • Modifications on categorical key variables: FALSE
  • Modifications on continuous key variables: TRUE
  • Modifications using PRAM: FALSE
  • Local suppressions: TRUE

Disclosure risk:

Frequency Analysis for Categorical Key Variables

Number of observations violating

  • 2-Anonymity: 0 (original dataset: 27)
  • 3-Anonymity: 0 (original dataset: 59)

Percentage of observations violating

  • 2-Anonymity: 0.000% (original dataset: 3.907%)
  • 3-Anonymity: 0.000% (original dataset: 8.538%)

Disclosure Risk for Categorical Variables

Expected Percentage of Reidentifications:

  • modified data: 0.034% (~ 0.236 observations)
  • original data: 0.178% (~ 1.229 observations)

10 combinations of categories with highest risks

DISTRICT URB_RUR S3B_CROP_PRODUCT_ID risk fk Fk hier_risk
MOYAMBA Urban Gari 0.008 3 194.4658 0.0103135
KARENE Urban Foo foo 0.004 3 390.2900 0.0038286
TONKOLILI Urban Foo foo 0.003 3 457.3164 0.0032693
BOMBALI Urban Foo foo 0.002 4 575.9685 0.0023096
MOYAMBA Urban Foo foo 0.002 6 545.1751 0.0103135
KAILAHUN Urban Foo foo 0.002 3 711.1293 0.0021049
KAMBIA Urban Gari 0.002 3 715.9302 0.0033296
PORTLOKO NA Rice flour 0.002 3 833.5657 0.0082564
KAILAHUN Urban Rice flour 0.002 4 745.5009 0.0020501
MOYAMBA NA Coconut oil 0.001 3 1022.2921 0.0014651

Disclosure Risk Continuous Scaled Variables

The (distance-based) disclosure risk for continous key variables is between 0.000% and 99.711% in the modified data.

In the original data, the risk is assumed to be approximately 100.000%.

Hierarchical risk

  • modified data: 0.484 (0.070%)
  • original data: 2.888 (0.418%)

Data Utility

Frequencies Categorical Key Variables:

Variable: DISTRICT

Categories Original data Modified data
KAILAHUN 135 133
KENEMA 117 116
KONO 21 16
BOMBALI 14 13
FALABA 4 1
KOINADUGU 5 2
TONKOLILI 47 44
KAMBIA 34 31
KARENE 29 29
PORTLOKO 44 42
BO 43 43
BONTHE 18 18
MOYAMBA 137 133
PUJEHUN 29 29
WESTERN RURAL 4 3
WESTERN URBAN 0 NA
NA 10 38

Variable: URB_RUR

Categories Original data Modified data
Rural 568 564
Urban 96 70
NA 27 57

Variable: S3B_CROP_PRODUCT_ID

Categories Original data Modified data
Maize flour 25 25
Rice flour 67 67
Polished, glazed, parboiled or converted 46 46
Processed or preserved fruit and vegetab 14 14
Palm oil 246 246
Coconut oil 4 3
Gari 127 127
Foo foo 115 115
Tobacco products (cigars, chewing tobacco) 4 4
Other (specify) 43 43
NA NA 1

Local Suppressions

The table below shows for each categorical key variable the number (1st row) and the percentages (2nd row) of suppressed cells.

DISTRICT URB_RUR S3B_CROP_PRODUCT_ID
Number of Suppression 28 30 1
Percentage 4.052 4.342 0.145

Data Utility of Continuous Scaled Key Variables

Univariate summary of variable S3BQ7C

Original Modified Difference
Min. 1.000 1.0 0.000
1st Qu. 20.000 20.0 0.000
Median 400.000 400.0 0.000
Mean 2057.757 797.6 1260.157
3rd Qu. 750.000 750.0 0.000
Max. 45642.857 4384.3 41258.557

Information Loss Criteria

  • Criteria IL1: 145.427%
  • Difference of Eigenvalues in modified data: 0.000% (0.00% in original data)

Boxplot of Differences

R-Code

Session-Info

About the R-Version

  • Version: R version 4.5.2 (2025-10-31 ucrt)
  • Platform: x86_64-w64-mingw32

Locales

LC_COLLATE=English_United States.utf8 | LC_CTYPE=English_United States.utf8 | LC_MONETARY=English_United States.utf8 | LC_NUMERIC=C | LC_TIME=English_United States.utf8

Attached base packages

stats | graphics | grDevices | utils | datasets | methods | base

Other attached packages

labelled (2.16.0) | questionr (0.8.1) | haven (2.5.5) | readxl (1.4.5) | tidyr (1.3.1) | dplyr (1.1.4) | agrisvyr (0.2.0) | sdcMicro (5.7.9)

Packages loaded via Namespace (but not attached)

tidyselect (1.2.1) | viridisLite (0.4.2) | farver (2.1.2) | R.utils (2.13.0) | S7 (0.2.1) | fastmap (1.2.0) | promises (1.5.0) | digest (0.6.38) | mime (0.13) | lifecycle (1.0.4) | cluster (2.1.8.1) | magrittr (2.0.4) | compiler (4.5.2) | rlang (1.1.6) | sass (0.4.10) | tools (4.5.2) | utf8 (1.2.6) | yaml (2.3.10) | data.table (1.17.8) | knitr (1.50) | askpass (1.2.1) | htmlwidgets (1.6.4) | plyr (1.8.9) | xml2 (1.4.1) | RColorBrewer (1.1-3) | miniUI (0.1.2) | withr (3.0.2) | purrr (1.2.0) | R.oo (1.27.1) | grid (4.5.2) | xtable (1.8-4) | data.tree (1.2.0) | ggplot2 (4.0.1) | scales (1.4.0) | MASS (7.3-65) | cli (3.6.5) | rmarkdown (2.30) | crayon (1.5.3) | generics (0.1.4) | otel (0.2.0) | rstudioapi (0.17.1) | robustbase (0.99-6) | tzdb (0.5.0) | cachem (1.1.0) | stringr (1.6.0) | rhandsontable (0.3.8) | cellranger (1.1.0) | vctrs (0.6.5) | jsonlite (2.0.0) | carData (3.0-5) | hms (1.1.4) | systemfonts (1.3.1) | jquerylib (0.1.4) | shinyBS (0.61.1) | glue (1.8.0) | DEoptimR (1.1-4) | DT (0.34.0) | stringi (1.8.7) | gtable (0.3.6) | later (1.4.4) | tibble (3.3.0) | pillar (1.11.1) | htmltools (0.5.8.1) | openssl (2.3.4) | R6 (2.6.1) | textshaping (1.0.4) | evaluate (1.0.5) | shiny (1.11.1) | kableExtra (1.4.0) | readr (2.1.6) | highr (0.11) | R.methodsS3 (1.8.2) | openxlsx (4.2.8.1) | renv (1.1.5) | httpuv (1.6.16) | bslib (0.9.0) | Rcpp (1.1.0) | zip (2.3.3) | svglite (2.2.2) | xfun (0.54) | fs (1.6.6) | forcats (1.0.1) | usethis (3.2.1) | getPass (0.2-4) | prettydoc (0.4.1) | pkgconfig (2.0.3)

Disclaimer

R-Package sdcMicro is developed and maintained by Statistics Austria (www.statistik.at).

Please use the issue-tracker on github to report any issues:


This report was generated on Tue, 07/04/2026 at 23:04:35.