Input Data
The data set consists of 4973 observations
Information on selected important (key) variables
- Categorical key variable(s): DISTRICT | URB_RUR | S0BQ3 | S5AQ1 | S6AQ1 | S7AQ1 | S0AQ15A | S0AQ16A
- Continuous key variable(s): S0BQ2
- Weight variable: HH_Final_weight
- householdID: HOUSEHOLD_ID
- strataVariable(s): not defined
Modifications
- Modifications on categorical key variables: TRUE
- Modifications on continuous key variables: FALSE
- Modifications using PRAM: FALSE
- Local suppressions: TRUE
Disclosure risk:
Frequency Analysis for Categorical Key Variables
Number of observations violating
- 2-Anonymity: 0 (original dataset: 103)
- 3-Anonymity: 0 (original dataset: 171)
Percentage of observations violating
- 2-Anonymity: 0.000% (original dataset: 2.071%)
- 3-Anonymity: 0.000% (original dataset: 3.439%)
Disclosure Risk for Categorical Variables
Expected Percentage of Reidentifications:
- modified data: 0.018% (~ 0.875 observations)
- original data: 0.088% (~ 4.371 observations)
10 combinations of categories with highest risks
| DISTRICT | URB_RUR | S0BQ3 | S5AQ1 | S6AQ1 | S7AQ1 | S0AQ15A | S0AQ16A | risk | fk | Fk | hier_risk |
|---|---|---|---|---|---|---|---|---|---|---|---|
| KAILAHUN | Rural | PARTNERSHIP/MULTIPLE HOLDING | NO | YES | YES | YES | NO | 0.008 | 3 | 183.7653 | 0.0080965 |
| KAILAHUN | NA | SINGLE/SOLE PROPRIETORSHIP | NO | YES | YES | NO | YES | 0.007 | 3 | 220.6618 | 0.0067518 |
| KAILAHUN | Rural | SINGLE/SOLE PROPRIETORSHIP | NO | NO | NA | NO | YES | 0.005 | 3 | 276.4649 | 0.0053964 |
| KAMBIA | NA | SINGLE/SOLE PROPRIETORSHIP | NO | YES | 0 | YES | NO | 0.005 | 3 | 281.6336 | 0.0052979 |
| MOYAMBA | Urban | SINGLE/SOLE PROPRIETORSHIP | NO | NO | 0 | YES | YES | 0.004 | 5 | 315.4564 | 0.0039469 |
| KAMBIA | NA | SINGLE/SOLE PROPRIETORSHIP | YES | NO | 0 | YES | YES | 0.003 | 3 | 460.0665 | 0.0032498 |
| KAILAHUN | Rural | SINGLE/SOLE PROPRIETORSHIP | NO | NA | YES | NO | YES | 0.003 | 5 | 456.7708 | 0.0027291 |
| PUJEHUN | Urban | SINGLE/SOLE PROPRIETORSHIP | NO | NO | 0 | YES | NO | 0.003 | 5 | 465.3433 | 0.0026790 |
| FALABA | Urban | SINGLE/SOLE PROPRIETORSHIP | NO | NO | YES | YES | YES | 0.002 | 5 | 522.5204 | 0.0023865 |
| KAMBIA | Urban | SINGLE/SOLE PROPRIETORSHIP | NO | NO | YES | YES | NO | 0.002 | 4 | 670.6210 | 0.0019843 |
Disclosure Risk Continuous Scaled Variables
The (distance-based) disclosure risk for continous key variables is between 0.000% and 100.000% in the modified data.
In the original data, the risk is assumed to be approximately 100.000%.
Hierarchical risk
- modified data: 0.875 (0.018%)
- original data: 4.371 (0.088%)
Data Utility
Frequencies Categorical Key Variables:
Variable: DISTRICT
| Categories | Original data | Modified data |
|---|---|---|
| KAILAHUN | 719 | 719 |
| KENEMA | 637 | 637 |
| KONO | 608 | 608 |
| BOMBALI | 177 | 177 |
| FALABA | 149 | 149 |
| KOINADUGU | 89 | 89 |
| TONKOLILI | 502 | 502 |
| KAMBIA | 320 | 320 |
| KARENE | 157 | 157 |
| PORTLOKO | 451 | 451 |
| BO | 408 | 408 |
| BONTHE | 118 | 118 |
| MOYAMBA | 296 | 296 |
| PUJEHUN | 286 | 286 |
| WESTERN RURAL | 56 | 56 |
| WESTERN URBAN | 0 | NA |
Variable: URB_RUR
| Categories | Original data | Modified data |
|---|---|---|
| Rural | 4115 | 4107 |
| Urban | 858 | 818 |
| NA | NA | 48 |
Variable: S0BQ3
| Categories | Original data | Modified data |
|---|---|---|
| SINGLE HOLDING | 4852 | NA |
| COOPERATIVES | 19 | 7 |
| PARTNERSHIP | 74 | NA |
| MULTIPLE HOLDING | 8 | NA |
| GOVERNMENT | 1 | NA |
| SOLE PROPRIETORSHIP | 19 | NA |
| OTHERS(SPECIFY) | 0 | NA |
| SINGLE/SOLE PROPRIETORSHIP | NA | 4871 |
| PARTNERSHIP/MULTIPLE HOLDING | NA | 49 |
| NA | NA | 46 |
Variable: S5AQ1
| Categories | Original data | Modified data |
|---|---|---|
| YES | 34 | 6 |
| NO | 4939 | 4937 |
| NA | NA | 30 |
Variable: S6AQ1
| Categories | Original data | Modified data |
|---|---|---|
| YES | 1498 | 1486 |
| NO | 3475 | 3469 |
| NA | NA | 18 |
Variable: S7AQ1
| Categories | Original data | Modified data |
|---|---|---|
| 0 | 2795 | 2792 |
| YES | 2178 | 2177 |
| NO | 0 | NA |
| NA | NA | 4 |
Variable: S0AQ15A
| Categories | Original data | Modified data |
|---|---|---|
| YES | 4927 | 4927 |
| NO | 46 | 26 |
| NA | NA | 20 |
Variable: S0AQ16A
| Categories | Original data | Modified data |
|---|---|---|
| NO | 1916 | 1916 |
| YES | 3057 | 3057 |
Local Suppressions
The table below shows for each categorical key variable the number (1st row) and the percentages (2nd row) of suppressed cells.
| DISTRICT | URB_RUR | S0BQ3 | S5AQ1 | S6AQ1 | S7AQ1 | S0AQ15A | S0AQ16A | |
|---|---|---|---|---|---|---|---|---|
| Number of Suppression | 0 | 48 | 46 | 30 | 18 | 4 | 20 | 0 |
| Percentage | 0.000 | 0.965 | 0.925 | 0.603 | 0.362 | 0.080 | 0.402 | 0.000 |
Data Utility of Continuous Scaled Key Variables
Univariate summary of variable S0BQ2
| Original | Modified | Difference | |
|---|---|---|---|
| Min. | 0.0000000 | 0 | 0.0000000 |
| 1st Qu. | 0.0000000 | 0 | 0.0000000 |
| Median | 0.0000000 | 0 | 0.0000000 |
| Mean | 0.0323748 | 0 | 0.0323748 |
| 3rd Qu. | 0.0000000 | 0 | 0.0000000 |
| Max. | 3.0000000 | 3 | 0.0000000 |
Information Loss Criteria
- Criteria IL1: 0.000%
- Difference of Eigenvalues in modified data: 0.000% (0.00% in original data)
Boxplot of Differences
R-Code
Session-Info
About the R-Version
- Version: R version 4.5.2 (2025-10-31 ucrt)
- Platform: x86_64-w64-mingw32
Locales
LC_COLLATE=English_United States.utf8 | LC_CTYPE=English_United States.utf8 | LC_MONETARY=English_United States.utf8 | LC_NUMERIC=C | LC_TIME=English_United States.utf8
Attached base packages
stats | graphics | grDevices | utils | datasets | methods | base
Other attached packages
labelled (2.16.0) | questionr (0.8.1) | haven (2.5.5) | readxl (1.4.5) | tidyr (1.3.1) | dplyr (1.1.4) | agrisvyr (0.2.0) | sdcMicro (5.7.9)
Packages loaded via Namespace (but not attached)
tidyselect (1.2.1) | viridisLite (0.4.2) | farver (2.1.2) | R.utils (2.13.0) | S7 (0.2.1) | fastmap (1.2.0) | promises (1.5.0) | digest (0.6.38) | mime (0.13) | lifecycle (1.0.4) | cluster (2.1.8.1) | magrittr (2.0.4) | compiler (4.5.2) | rlang (1.1.6) | sass (0.4.10) | tools (4.5.2) | utf8 (1.2.6) | yaml (2.3.10) | data.table (1.17.8) | knitr (1.50) | askpass (1.2.1) | htmlwidgets (1.6.4) | plyr (1.8.9) | xml2 (1.4.1) | RColorBrewer (1.1-3) | miniUI (0.1.2) | withr (3.0.2) | purrr (1.2.0) | R.oo (1.27.1) | grid (4.5.2) | xtable (1.8-4) | data.tree (1.2.0) | ggplot2 (4.0.1) | scales (1.4.0) | MASS (7.3-65) | cli (3.6.5) | rmarkdown (2.30) | crayon (1.5.3) | generics (0.1.4) | otel (0.2.0) | rstudioapi (0.17.1) | robustbase (0.99-6) | tzdb (0.5.0) | cachem (1.1.0) | stringr (1.6.0) | rhandsontable (0.3.8) | cellranger (1.1.0) | vctrs (0.6.5) | jsonlite (2.0.0) | carData (3.0-5) | hms (1.1.4) | systemfonts (1.3.1) | jquerylib (0.1.4) | shinyBS (0.61.1) | glue (1.8.0) | DEoptimR (1.1-4) | DT (0.34.0) | stringi (1.8.7) | gtable (0.3.6) | later (1.4.4) | tibble (3.3.0) | pillar (1.11.1) | htmltools (0.5.8.1) | openssl (2.3.4) | R6 (2.6.1) | textshaping (1.0.4) | evaluate (1.0.5) | shiny (1.11.1) | kableExtra (1.4.0) | readr (2.1.6) | highr (0.11) | R.methodsS3 (1.8.2) | openxlsx (4.2.8.1) | renv (1.1.5) | httpuv (1.6.16) | bslib (0.9.0) | Rcpp (1.1.0) | zip (2.3.3) | svglite (2.2.2) | xfun (0.54) | fs (1.6.6) | forcats (1.0.1) | usethis (3.2.1) | getPass (0.2-4) | prettydoc (0.4.1) | pkgconfig (2.0.3)
Disclaimer
R-Package sdcMicro is developed and maintained by Statistics Austria (www.statistik.at).
Please use the issue-tracker on github to report any issues:
This report was generated on Thu, 09/04/2026 at 11:38:27.