SDC-Report_v1

Input Data

The data set consists of 9430 observations

Information on selected important (key) variables

  • Categorical key variable(s): DISTRICT | URB_RUR | Crop_ID
  • Continuous key variable(s): production_KGS | DEST_1 | DEST_2 | DEST_3 | DEST_4 | DEST_5 | DEST_6 | DEST_7 | DEST_8 | DEST_9 | DEST_10 | DEST_11 | DEST_12
  • Weight variable: HH_Final_weight
  • householdID: HOUSEHOLD_ID
  • strataVariable(s): not defined

Modifications

  • Modifications on categorical key variables: FALSE
  • Modifications on continuous key variables: TRUE
  • Modifications using PRAM: FALSE
  • Local suppressions: TRUE

Disclosure risk:

Frequency Analysis for Categorical Key Variables

Number of observations violating

  • 2-Anonymity: 0 (original dataset: 4)
  • 3-Anonymity: 0 (original dataset: 20)

Percentage of observations violating

  • 2-Anonymity: 0.000% (original dataset: 0.042%)
  • 3-Anonymity: 0.000% (original dataset: 0.212%)

Disclosure Risk for Categorical Variables

Expected Percentage of Reidentifications:

  • modified data: 0.012% (~ 1.118 observations)
  • original data: 0.015% (~ 1.382 observations)

10 combinations of categories with highest risks

DISTRICT URB_RUR Crop_ID risk fk Fk hier_risk
MOYAMBA Urban Banana 0.007 3 208.9014 0.0071292
MOYAMBA Rural Cocoa 0.004 3 349.2412 0.0043916
TONKOLILI Urban Krain Krain 0.004 3 404.5985 0.0047751
MOYAMBA Urban Okra 0.003 3 432.9942 0.0049036
KAMBIA Urban Oil Palm 0.003 3 451.9014 0.0041282
FALABA Urban Oil Palm 0.003 3 470.8783 0.0033514
TONKOLILI Urban Sweet Peper 0.003 3 527.2700 0.0028862
TONKOLILI NA Coffee 0.003 3 547.1005 0.0043672
KAMBIA Urban Krain Krain 0.003 3 555.9193 0.0029694
FALABA Urban Millets 0.003 3 561.7391 0.0045446

Disclosure Risk Continuous Scaled Variables

The (distance-based) disclosure risk for continous key variables is between 0.000% and 96.490% in the modified data.

In the original data, the risk is assumed to be approximately 100.000%.

Hierarchical risk

  • modified data: 3.905 (0.041%)
  • original data: 5.018 (0.053%)

Data Utility

Frequencies Categorical Key Variables:

Variable: DISTRICT

Categories Original data Modified data
KAILAHUN 1653 1653
KENEMA 979 978
KONO 1359 1358
BOMBALI 300 299
FALABA 334 333
KOINADUGU 125 125
TONKOLILI 1041 1041
KAMBIA 417 415
KARENE 280 279
PORTLOKO 662 661
BO 699 699
BONTHE 138 137
MOYAMBA 729 728
PUJEHUN 512 512
WESTERN RURAL 98 97
WESTERN URBAN 0 NA
NA 104 115

Variable: URB_RUR

Categories Original data Modified data
Rural 7800 7796
Urban 1314 1308
NA 316 326

Variable: Crop_ID

Categories Original data Modified data
Rice 3171 3171
Maize 581 581
Millets 73 73
Chilli Peper 196 196
Cucumber 179 179
Okra 159 159
Sweet Peper 110 110
Krain Krain 61 61
Potato leaves 82 82
Cocoa 904 904
Coffee 186 186
Kola 64 64
Banana 95 95
Groundnut 962 962
Soya Beans 73 73
Sesame(benie) 105 105
Oil Palm 713 713
Cassava 1076 1076
Yams 76 76
Broad beans 136 136
Other Cereal Crops 0 NA
Other Vegetable Crops 125 125
Other Fruits and Nuts Crops 133 133
Other Oil Seeds Crops 25 25
Other Tuber/Root Crops 87 87
Other Leguminous Crops 50 50
Other Industrial Crops 8 8

Local Suppressions

The table below shows for each categorical key variable the number (1st row) and the percentages (2nd row) of suppressed cells.

DISTRICT URB_RUR Crop_ID
Number of Suppression 11 10 0
Percentage 0.117 0.106 0.000

Data Utility of Continuous Scaled Key Variables

Univariate summary of variable production_KGS

Original Modified Difference
Min. 2.14682 2.1 0.0468201
1st Qu. 391.17077 391.2 -0.0292313
Median 978.05923 978.1 -0.0407654
Mean 2794.73627 2794.7 0.0362667
3rd Qu. 2312.01184 2312.0 0.0118408
Max. 97567.16406 97567.2 -0.0359375

Univariate summary of variable DEST_1

Original Modified Difference
Min. 0.0507396 0.1 -0.0492604
1st Qu. 17.8921967 17.9 -0.0078033
Median 47.5138321 47.5 0.0138321
Mean 197.4092338 193.6 3.8092338
3rd Qu. 129.3279877 129.3 0.0279877
Max. 7770.9951172 4765.6 3005.3951172

Univariate summary of variable DEST_2

Original Modified Difference
Min. 0.00106 0.0 1.060000e-03
1st Qu. 38.13836 38.1 3.836290e-02
Median 115.20111 115.2 1.110800e-03
Mean 357.56677 337.1 2.046677e+01
3rd Qu. 315.24680 315.2 4.679570e-02
Max. 20299.00977 4937.2 1.536181e+04

Univariate summary of variable DEST_3

Original Modified Difference
Min. 0.0000000 0.0 0.0000000
1st Qu. 0.0000000 0.0 0.0000000
Median 0.6833843 0.7 -0.0166157
Mean 40.5362756 38.7 1.8362756
3rd Qu. 21.9112706 21.9 0.0112706
Max. 7080.8574219 1610.1 5470.7574219

Univariate summary of variable DEST_4

Original Modified Difference
Min. 0.0066955 0.0 0.0066955
1st Qu. 9.8233783 9.8 0.0233783
Median 26.4751396 26.5 -0.0248604
Mean 71.9891663 71.9 0.0891663
3rd Qu. 70.3893700 70.4 -0.0106300
Max. 2763.4555664 1990.4 773.0555664

Univariate summary of variable DEST_5

Original Modified Difference
Min. 0.0016407 0.0 0.0016407
1st Qu. 23.7860074 23.8 -0.0139926
Median 62.8803272 62.9 -0.0196728
Mean 164.9122620 161.2 3.7122620
3rd Qu. 153.3523598 153.4 -0.0476402
Max. 4727.1367188 2488.9 2238.2367188

Univariate summary of variable DEST_6

Original Modified Difference
Min. 0.0044637 0.0 0.0044637
1st Qu. 19.0588002 19.1 -0.0411998
Median 53.2134895 53.2 0.0134895
Mean 170.8510254 160.4 10.4510254
3rd Qu. 140.9511375 141.0 -0.0488625
Max. 5864.0458984 2046.4 3817.6458984

Univariate summary of variable DEST_7

Original Modified Difference
Min. 0.022235 0.0 0.0222350
1st Qu. 26.471132 26.5 -0.0288677
Median 73.139065 73.1 0.0390648
Mean 211.224804 203.4 7.8248041
3rd Qu. 185.135243 185.1 0.0352425
Max. 7077.048828 3112.8 3964.2488281

Univariate summary of variable DEST_8

Original Modified Difference
Min. 0.0242283 0.0 0.0242283
1st Qu. 14.8200226 14.8 0.0200226
Median 41.4326363 41.4 0.0326363
Mean 117.7030881 112.0 5.7030881
3rd Qu. 106.4675636 106.5 -0.0324364
Max. 3833.5402832 1448.1 2385.4402832

Univariate summary of variable DEST_9

Original Modified Difference
Min. 0.0045273 0.0 0.0045273
1st Qu. 11.0084879 11.0 0.0084879
Median 31.6716042 31.7 -0.0283958
Mean 94.8996744 93.4 1.4996744
3rd Qu. 85.9215946 85.9 0.0215946
Max. 2780.9445801 1584.3 1196.6445801

Univariate summary of variable DEST_10

Original Modified Difference
Min. 9.058730e-02 0.1 -0.0094127
1st Qu. 1.926946e+01 19.3 -0.0305436
Median 8.848460e+01 88.5 -0.0153961
Mean 4.582137e+02 421.3 36.9136681
3rd Qu. 3.203775e+02 320.4 -0.0224518
Max. 1.905757e+04 6168.4 12889.1664062

Univariate summary of variable DEST_11

Original Modified Difference
Min. 6.283340e-02 0.1 -0.0371666
1st Qu. 4.414354e+01 44.1 0.0435442
Median 1.542056e+02 154.2 0.0056122
Mean 6.364878e+02 577.1 59.3877739
3rd Qu. 4.541685e+02 454.2 -0.0314972
Max. 2.420275e+04 7270.1 16932.6539063

Univariate summary of variable DEST_12

Original Modified Difference
Min. 0.0049418 0.0 0.0049418
1st Qu. 37.0617895 37.1 -0.0382105
Median 104.9920883 105.0 -0.0079117
Mean 272.9425542 265.7 7.2425542
3rd Qu. 268.9802399 269.0 -0.0197601
Max. 7223.1406250 3318.7 3904.4406250

Information Loss Criteria

  • Criteria IL1: 33620.890%
  • Difference of Eigenvalues in modified data: -135.046% (0.00% in original data)

Boxplot of Differences

R-Code

Session-Info

About the R-Version

  • Version: R version 4.5.2 (2025-10-31 ucrt)
  • Platform: x86_64-w64-mingw32

Locales

LC_COLLATE=English_United States.utf8 | LC_CTYPE=English_United States.utf8 | LC_MONETARY=English_United States.utf8 | LC_NUMERIC=C | LC_TIME=English_United States.utf8

Attached base packages

stats | graphics | grDevices | utils | datasets | methods | base

Other attached packages

labelled (2.16.0) | questionr (0.8.1) | haven (2.5.5) | readxl (1.4.5) | tidyr (1.3.1) | dplyr (1.1.4) | agrisvyr (0.2.0) | sdcMicro (5.7.9)

Packages loaded via Namespace (but not attached)

tidyselect (1.2.1) | viridisLite (0.4.2) | farver (2.1.2) | R.utils (2.13.0) | S7 (0.2.1) | fastmap (1.2.0) | promises (1.5.0) | digest (0.6.38) | mime (0.13) | lifecycle (1.0.4) | cluster (2.1.8.1) | magrittr (2.0.4) | compiler (4.5.2) | rlang (1.1.6) | sass (0.4.10) | tools (4.5.2) | utf8 (1.2.6) | yaml (2.3.10) | data.table (1.17.8) | knitr (1.50) | askpass (1.2.1) | htmlwidgets (1.6.4) | plyr (1.8.9) | xml2 (1.4.1) | RColorBrewer (1.1-3) | miniUI (0.1.2) | withr (3.0.2) | purrr (1.2.0) | R.oo (1.27.1) | grid (4.5.2) | xtable (1.8-4) | data.tree (1.2.0) | ggplot2 (4.0.1) | scales (1.4.0) | MASS (7.3-65) | cli (3.6.5) | rmarkdown (2.30) | crayon (1.5.3) | generics (0.1.4) | otel (0.2.0) | rstudioapi (0.17.1) | robustbase (0.99-6) | tzdb (0.5.0) | cachem (1.1.0) | stringr (1.6.0) | rhandsontable (0.3.8) | cellranger (1.1.0) | vctrs (0.6.5) | jsonlite (2.0.0) | carData (3.0-5) | hms (1.1.4) | systemfonts (1.3.1) | jquerylib (0.1.4) | shinyBS (0.61.1) | glue (1.8.0) | DEoptimR (1.1-4) | DT (0.34.0) | stringi (1.8.7) | gtable (0.3.6) | later (1.4.4) | tibble (3.3.0) | pillar (1.11.1) | htmltools (0.5.8.1) | openssl (2.3.4) | R6 (2.6.1) | textshaping (1.0.4) | evaluate (1.0.5) | shiny (1.11.1) | kableExtra (1.4.0) | readr (2.1.6) | highr (0.11) | R.methodsS3 (1.8.2) | openxlsx (4.2.8.1) | renv (1.1.5) | httpuv (1.6.16) | bslib (0.9.0) | Rcpp (1.1.0) | zip (2.3.3) | svglite (2.2.2) | xfun (0.54) | fs (1.6.6) | forcats (1.0.1) | usethis (3.2.1) | getPass (0.2-4) | prettydoc (0.4.1) | pkgconfig (2.0.3)

Disclaimer

R-Package sdcMicro is developed and maintained by Statistics Austria (www.statistik.at).

Please use the issue-tracker on github to report any issues:


This report was generated on Fri, 10/04/2026 at 10:37:59.