SDC-Report_v1

Input Data

The data set consists of 1255 observations

Information on selected important (key) variables

  • Categorical key variable(s): DISTRICT | URB_RUR | S2B_CROP_ID
  • Continuous key variable(s): S2BQ2_Qty_sold_kg | S2BQ3B
  • Weight variable: HH_Final_weight
  • householdID: HOUSEHOLD_ID
  • strataVariable(s): not defined

Modifications

  • Modifications on categorical key variables: FALSE
  • Modifications on continuous key variables: TRUE
  • Modifications using PRAM: FALSE
  • Local suppressions: TRUE

Disclosure risk:

Frequency Analysis for Categorical Key Variables

Number of observations violating

  • 2-Anonymity: 0 (original dataset: 3)
  • 3-Anonymity: 0 (original dataset: 11)

Percentage of observations violating

  • 2-Anonymity: 0.000% (original dataset: 0.239%)
  • 3-Anonymity: 0.000% (original dataset: 0.876%)

Disclosure Risk for Categorical Variables

Expected Percentage of Reidentifications:

  • modified data: 0.015% (~ 0.194 observations)
  • original data: 0.028% (~ 0.353 observations)

10 combinations of categories with highest risks

DISTRICT URB_RUR S2B_CROP_ID risk fk Fk hier_risk
KAILAHUN Urban Coffee 0.010 6 115.0861 0.0103194
KENEMA Rural Coffee 0.001 7 1041.4632 0.0011966
KENEMA Rural Other Fruits and Nuts Crops 0.001 8 1049.2550 0.0030853
BONTHE Rural Oil Palm 0.001 5 1179.4662 0.0010587
BOMBALI Rural Oil Palm 0.001 6 1199.0135 0.0011202
KARENE Rural Oil Palm 0.001 6 1213.5774 0.0009878
KARENE NA Oil Palm 0.001 6 1213.5774 0.0009878
BO Rural Other Fruits and Nuts Crops 0.001 6 1232.4847 0.0009727
PORTLOKO NA Oil Palm 0.001 5 1347.9396 0.0009265
PORTLOKO Rural Oil Palm 0.001 5 1347.9396 0.0009265

Disclosure Risk Continuous Scaled Variables

The (distance-based) disclosure risk for continous key variables is between 0.000% and 94.263% in the modified data.

In the original data, the risk is assumed to be approximately 100.000%.

Hierarchical risk

  • modified data: 0.376 (0.030%)
  • original data: 0.611 (0.049%)

Data Utility

Frequencies Categorical Key Variables:

Variable: DISTRICT

Categories Original data Modified data
KAILAHUN 556 556
KENEMA 279 279
KONO 296 296
BOMBALI 4 3
FALABA 4 NA
KOINADUGU 0 NA
TONKOLILI 33 32
KAMBIA 1 NA
KARENE 5 5
PORTLOKO 5 5
BO 12 12
BONTHE 2 2
MOYAMBA 39 39
PUJEHUN 11 11
WESTERN RURAL 0 NA
WESTERN URBAN 0 NA
NA 8 15

Variable: URB_RUR

Categories Original data Modified data
Rural 1102 1102
Urban 124 120
NA 29 33

Variable: S2B_CROP_ID

Categories Original data Modified data
Rice 0 NA
Maize 0 NA
Millets 0 NA
Chilli Peper 0 NA
Cucumber 0 NA
Okra 0 NA
Sweet Peper 0 NA
Krain Krain 0 NA
Potato leaves 0 NA
Cocoa 622 622
Coffee 104 104
Kola 34 34
Banana 0 NA
Groundnut 0 NA
Soya Beans 0 NA
Sesame(benie) 0 NA
Oil Palm 441 441
Cassava 0 NA
Yams 0 NA
Broad beans 0 NA
Other Cereal Crops 0 NA
Other Vegetable Crops 0 NA
Other Fruits and Nuts Crops 41 41
Other Oil Seeds Crops 11 11
Other Tuber/Root Crops 0 NA
Other Leguminous Crops 0 NA
Other Industrial Crops 0 NA
NA 2 2

Local Suppressions

The table below shows for each categorical key variable the number (1st row) and the percentages (2nd row) of suppressed cells.

DISTRICT URB_RUR S2B_CROP_ID
Number of Suppression 7 4 0
Percentage 0.558 0.319 0.000

Data Utility of Continuous Scaled Key Variables

Univariate summary of variable S2BQ2_Qty_sold_kg

Original Modified Difference
Min. 0.0080 0.0 0.0080
1st Qu. 65.0000 65.0 0.0000
Median 130.0000 130.0 0.0000
Mean 341.9813 179.5 162.4813
3rd Qu. 260.0000 260.0 0.0000
Max. 32175.0000 466.2 31708.8000

Univariate summary of variable S2BQ3B

Original Modified Difference
Min. 1.732350e-02 0.0 1.732350e-02
1st Qu. 3.940000e+02 394.0 0.000000e+00
Median 1.000000e+03 1000.0 0.000000e+00
Mean 4.209047e+03 1950.0 2.259047e+03
3rd Qu. 2.487862e+03 2487.9 -3.816160e-02
Max. 8.211570e+05 8331.4 8.128256e+05

Information Loss Criteria

  • Criteria IL1: 3550.843%
  • Difference of Eigenvalues in modified data: -255.575% (0.00% in original data)

Boxplot of Differences

R-Code

Session-Info

About the R-Version

  • Version: R version 4.5.2 (2025-10-31 ucrt)
  • Platform: x86_64-w64-mingw32

Locales

LC_COLLATE=English_United States.utf8 | LC_CTYPE=English_United States.utf8 | LC_MONETARY=English_United States.utf8 | LC_NUMERIC=C | LC_TIME=English_United States.utf8

Attached base packages

stats | graphics | grDevices | utils | datasets | methods | base

Other attached packages

labelled (2.16.0) | questionr (0.8.1) | haven (2.5.5) | readxl (1.4.5) | tidyr (1.3.1) | dplyr (1.1.4) | agrisvyr (0.2.0) | sdcMicro (5.7.9)

Packages loaded via Namespace (but not attached)

tidyselect (1.2.1) | viridisLite (0.4.2) | farver (2.1.2) | R.utils (2.13.0) | S7 (0.2.1) | fastmap (1.2.0) | promises (1.5.0) | digest (0.6.38) | mime (0.13) | lifecycle (1.0.4) | cluster (2.1.8.1) | magrittr (2.0.4) | compiler (4.5.2) | rlang (1.1.6) | sass (0.4.10) | tools (4.5.2) | utf8 (1.2.6) | yaml (2.3.10) | data.table (1.17.8) | knitr (1.50) | askpass (1.2.1) | htmlwidgets (1.6.4) | plyr (1.8.9) | xml2 (1.4.1) | RColorBrewer (1.1-3) | miniUI (0.1.2) | withr (3.0.2) | purrr (1.2.0) | R.oo (1.27.1) | grid (4.5.2) | xtable (1.8-4) | data.tree (1.2.0) | ggplot2 (4.0.1) | scales (1.4.0) | MASS (7.3-65) | cli (3.6.5) | rmarkdown (2.30) | crayon (1.5.3) | generics (0.1.4) | otel (0.2.0) | rstudioapi (0.17.1) | robustbase (0.99-6) | tzdb (0.5.0) | cachem (1.1.0) | stringr (1.6.0) | rhandsontable (0.3.8) | cellranger (1.1.0) | vctrs (0.6.5) | jsonlite (2.0.0) | carData (3.0-5) | hms (1.1.4) | systemfonts (1.3.1) | jquerylib (0.1.4) | shinyBS (0.61.1) | glue (1.8.0) | DEoptimR (1.1-4) | DT (0.34.0) | stringi (1.8.7) | gtable (0.3.6) | later (1.4.4) | tibble (3.3.0) | pillar (1.11.1) | htmltools (0.5.8.1) | openssl (2.3.4) | R6 (2.6.1) | textshaping (1.0.4) | evaluate (1.0.5) | shiny (1.11.1) | kableExtra (1.4.0) | readr (2.1.6) | highr (0.11) | R.methodsS3 (1.8.2) | openxlsx (4.2.8.1) | renv (1.1.5) | httpuv (1.6.16) | bslib (0.9.0) | Rcpp (1.1.0) | zip (2.3.3) | svglite (2.2.2) | xfun (0.54) | fs (1.6.6) | forcats (1.0.1) | usethis (3.2.1) | getPass (0.2-4) | prettydoc (0.4.1) | pkgconfig (2.0.3)

Disclaimer

R-Package sdcMicro is developed and maintained by Statistics Austria (www.statistik.at).

Please use the issue-tracker on github to report any issues:


This report was generated on Wed, 01/04/2026 at 21:25:18.