Input Data
The data set consists of 1255 observations
Information on selected important (key) variables
- Categorical key variable(s): DISTRICT | URB_RUR | S2B_CROP_ID
- Continuous key variable(s): S2BQ2_Qty_sold_kg | S2BQ3B
- Weight variable: HH_Final_weight
- householdID: HOUSEHOLD_ID
- strataVariable(s): not defined
Modifications
- Modifications on categorical key variables: FALSE
- Modifications on continuous key variables: TRUE
- Modifications using PRAM: FALSE
- Local suppressions: TRUE
Disclosure risk:
Frequency Analysis for Categorical Key Variables
Number of observations violating
- 2-Anonymity: 0 (original dataset: 3)
- 3-Anonymity: 0 (original dataset: 11)
Percentage of observations violating
- 2-Anonymity: 0.000% (original dataset: 0.239%)
- 3-Anonymity: 0.000% (original dataset: 0.876%)
Disclosure Risk for Categorical Variables
Expected Percentage of Reidentifications:
- modified data: 0.015% (~ 0.194 observations)
- original data: 0.028% (~ 0.353 observations)
10 combinations of categories with highest risks
| DISTRICT | URB_RUR | S2B_CROP_ID | risk | fk | Fk | hier_risk |
|---|---|---|---|---|---|---|
| KAILAHUN | Urban | Coffee | 0.010 | 6 | 115.0861 | 0.0103194 |
| KENEMA | Rural | Coffee | 0.001 | 7 | 1041.4632 | 0.0011966 |
| KENEMA | Rural | Other Fruits and Nuts Crops | 0.001 | 8 | 1049.2550 | 0.0030853 |
| BONTHE | Rural | Oil Palm | 0.001 | 5 | 1179.4662 | 0.0010587 |
| BOMBALI | Rural | Oil Palm | 0.001 | 6 | 1199.0135 | 0.0011202 |
| KARENE | Rural | Oil Palm | 0.001 | 6 | 1213.5774 | 0.0009878 |
| KARENE | NA | Oil Palm | 0.001 | 6 | 1213.5774 | 0.0009878 |
| BO | Rural | Other Fruits and Nuts Crops | 0.001 | 6 | 1232.4847 | 0.0009727 |
| PORTLOKO | NA | Oil Palm | 0.001 | 5 | 1347.9396 | 0.0009265 |
| PORTLOKO | Rural | Oil Palm | 0.001 | 5 | 1347.9396 | 0.0009265 |
Disclosure Risk Continuous Scaled Variables
The (distance-based) disclosure risk for continous key variables is between 0.000% and 94.263% in the modified data.
In the original data, the risk is assumed to be approximately 100.000%.
Hierarchical risk
- modified data: 0.376 (0.030%)
- original data: 0.611 (0.049%)
Data Utility
Frequencies Categorical Key Variables:
Variable: DISTRICT
| Categories | Original data | Modified data |
|---|---|---|
| KAILAHUN | 556 | 556 |
| KENEMA | 279 | 279 |
| KONO | 296 | 296 |
| BOMBALI | 4 | 3 |
| FALABA | 4 | NA |
| KOINADUGU | 0 | NA |
| TONKOLILI | 33 | 32 |
| KAMBIA | 1 | NA |
| KARENE | 5 | 5 |
| PORTLOKO | 5 | 5 |
| BO | 12 | 12 |
| BONTHE | 2 | 2 |
| MOYAMBA | 39 | 39 |
| PUJEHUN | 11 | 11 |
| WESTERN RURAL | 0 | NA |
| WESTERN URBAN | 0 | NA |
| NA | 8 | 15 |
Variable: URB_RUR
| Categories | Original data | Modified data |
|---|---|---|
| Rural | 1102 | 1102 |
| Urban | 124 | 120 |
| NA | 29 | 33 |
Variable: S2B_CROP_ID
| Categories | Original data | Modified data |
|---|---|---|
| Rice | 0 | NA |
| Maize | 0 | NA |
| Millets | 0 | NA |
| Chilli Peper | 0 | NA |
| Cucumber | 0 | NA |
| Okra | 0 | NA |
| Sweet Peper | 0 | NA |
| Krain Krain | 0 | NA |
| Potato leaves | 0 | NA |
| Cocoa | 622 | 622 |
| Coffee | 104 | 104 |
| Kola | 34 | 34 |
| Banana | 0 | NA |
| Groundnut | 0 | NA |
| Soya Beans | 0 | NA |
| Sesame(benie) | 0 | NA |
| Oil Palm | 441 | 441 |
| Cassava | 0 | NA |
| Yams | 0 | NA |
| Broad beans | 0 | NA |
| Other Cereal Crops | 0 | NA |
| Other Vegetable Crops | 0 | NA |
| Other Fruits and Nuts Crops | 41 | 41 |
| Other Oil Seeds Crops | 11 | 11 |
| Other Tuber/Root Crops | 0 | NA |
| Other Leguminous Crops | 0 | NA |
| Other Industrial Crops | 0 | NA |
| NA | 2 | 2 |
Local Suppressions
The table below shows for each categorical key variable the number (1st row) and the percentages (2nd row) of suppressed cells.
| DISTRICT | URB_RUR | S2B_CROP_ID | |
|---|---|---|---|
| Number of Suppression | 7 | 4 | 0 |
| Percentage | 0.558 | 0.319 | 0.000 |
Data Utility of Continuous Scaled Key Variables
Univariate summary of variable S2BQ2_Qty_sold_kg
| Original | Modified | Difference | |
|---|---|---|---|
| Min. | 0.0080 | 0.0 | 0.0080 |
| 1st Qu. | 65.0000 | 65.0 | 0.0000 |
| Median | 130.0000 | 130.0 | 0.0000 |
| Mean | 341.9813 | 179.5 | 162.4813 |
| 3rd Qu. | 260.0000 | 260.0 | 0.0000 |
| Max. | 32175.0000 | 466.2 | 31708.8000 |
Univariate summary of variable S2BQ3B
| Original | Modified | Difference | |
|---|---|---|---|
| Min. | 1.732350e-02 | 0.0 | 1.732350e-02 |
| 1st Qu. | 3.940000e+02 | 394.0 | 0.000000e+00 |
| Median | 1.000000e+03 | 1000.0 | 0.000000e+00 |
| Mean | 4.209047e+03 | 1950.0 | 2.259047e+03 |
| 3rd Qu. | 2.487862e+03 | 2487.9 | -3.816160e-02 |
| Max. | 8.211570e+05 | 8331.4 | 8.128256e+05 |
Information Loss Criteria
- Criteria IL1: 3550.843%
- Difference of Eigenvalues in modified data: -255.575% (0.00% in original data)
Boxplot of Differences
R-Code
Session-Info
About the R-Version
- Version: R version 4.5.2 (2025-10-31 ucrt)
- Platform: x86_64-w64-mingw32
Locales
LC_COLLATE=English_United States.utf8 | LC_CTYPE=English_United States.utf8 | LC_MONETARY=English_United States.utf8 | LC_NUMERIC=C | LC_TIME=English_United States.utf8
Attached base packages
stats | graphics | grDevices | utils | datasets | methods | base
Other attached packages
labelled (2.16.0) | questionr (0.8.1) | haven (2.5.5) | readxl (1.4.5) | tidyr (1.3.1) | dplyr (1.1.4) | agrisvyr (0.2.0) | sdcMicro (5.7.9)
Packages loaded via Namespace (but not attached)
tidyselect (1.2.1) | viridisLite (0.4.2) | farver (2.1.2) | R.utils (2.13.0) | S7 (0.2.1) | fastmap (1.2.0) | promises (1.5.0) | digest (0.6.38) | mime (0.13) | lifecycle (1.0.4) | cluster (2.1.8.1) | magrittr (2.0.4) | compiler (4.5.2) | rlang (1.1.6) | sass (0.4.10) | tools (4.5.2) | utf8 (1.2.6) | yaml (2.3.10) | data.table (1.17.8) | knitr (1.50) | askpass (1.2.1) | htmlwidgets (1.6.4) | plyr (1.8.9) | xml2 (1.4.1) | RColorBrewer (1.1-3) | miniUI (0.1.2) | withr (3.0.2) | purrr (1.2.0) | R.oo (1.27.1) | grid (4.5.2) | xtable (1.8-4) | data.tree (1.2.0) | ggplot2 (4.0.1) | scales (1.4.0) | MASS (7.3-65) | cli (3.6.5) | rmarkdown (2.30) | crayon (1.5.3) | generics (0.1.4) | otel (0.2.0) | rstudioapi (0.17.1) | robustbase (0.99-6) | tzdb (0.5.0) | cachem (1.1.0) | stringr (1.6.0) | rhandsontable (0.3.8) | cellranger (1.1.0) | vctrs (0.6.5) | jsonlite (2.0.0) | carData (3.0-5) | hms (1.1.4) | systemfonts (1.3.1) | jquerylib (0.1.4) | shinyBS (0.61.1) | glue (1.8.0) | DEoptimR (1.1-4) | DT (0.34.0) | stringi (1.8.7) | gtable (0.3.6) | later (1.4.4) | tibble (3.3.0) | pillar (1.11.1) | htmltools (0.5.8.1) | openssl (2.3.4) | R6 (2.6.1) | textshaping (1.0.4) | evaluate (1.0.5) | shiny (1.11.1) | kableExtra (1.4.0) | readr (2.1.6) | highr (0.11) | R.methodsS3 (1.8.2) | openxlsx (4.2.8.1) | renv (1.1.5) | httpuv (1.6.16) | bslib (0.9.0) | Rcpp (1.1.0) | zip (2.3.3) | svglite (2.2.2) | xfun (0.54) | fs (1.6.6) | forcats (1.0.1) | usethis (3.2.1) | getPass (0.2-4) | prettydoc (0.4.1) | pkgconfig (2.0.3)
Disclaimer
R-Package sdcMicro is developed and maintained by Statistics Austria (www.statistik.at).
Please use the issue-tracker on github to report any issues:
This report was generated on Wed, 01/04/2026 at 21:25:18.