Input Data
The data set consists of 4695 observations
Information on selected important (key) variables
- Categorical key variable(s): DISTRICT | URB_RUR | S6A_ITEM_NAME
- Continuous key variable(s): Total_qty_collected_kg | Total_qty_consumed_kg | Total_qty_sold_kg | Value_qty_sold | Value_qty_collected
- Weight variable: HH_Final_weight
- householdID: HOUSEHOLD_ID
- strataVariable(s): not defined
Modifications
- Modifications on categorical key variables: FALSE
- Modifications on continuous key variables: TRUE
- Modifications using PRAM: FALSE
- Local suppressions: TRUE
Disclosure risk:
Frequency Analysis for Categorical Key Variables
Number of observations violating
- 2-Anonymity: 0 (original dataset: 0)
- 3-Anonymity: 0 (original dataset: 0)
Percentage of observations violating
- 2-Anonymity: 0.000% (original dataset: 0.000%)
- 3-Anonymity: 0.000% (original dataset: 0.000%)
Disclosure Risk for Categorical Variables
Expected Percentage of Reidentifications:
- modified data: 0.005% (~ 0.213 observations)
- original data: 0.005% (~ 0.213 observations)
10 combinations of categories with highest risks
| DISTRICT | URB_RUR | S6A_ITEM_NAME | risk | fk | Fk | hier_risk |
|---|---|---|---|---|---|---|
| KAMBIA | Urban | Fishes | 0.003 | 3 | 589.5226 | 0.0075946 |
| KAMBIA | Urban | Crustaceans(Crabs & Shrims) | 0.003 | 3 | 589.5226 | 0.0075946 |
| KAMBIA | Urban | Clams/Mollusks (Snales) | 0.003 | 3 | 589.5226 | 0.0075946 |
| PUJEHUN | Urban | Fishes | 0.001 | 5 | 1332.0859 | 0.0028099 |
| PUJEHUN | Urban | Crustaceans(Crabs & Shrims) | 0.001 | 5 | 1332.0859 | 0.0028099 |
| PUJEHUN | Urban | Clams/Mollusks (Snales) | 0.001 | 5 | 1332.0859 | 0.0028099 |
| FALABA | Urban | Fishes | 0.000 | 18 | 2167.0934 | 0.0014643 |
| FALABA | Urban | Crustaceans(Crabs & Shrims) | 0.000 | 18 | 2167.0934 | 0.0014643 |
| FALABA | Urban | Clams/Mollusks (Snales) | 0.000 | 18 | 2167.0934 | 0.0014643 |
| TONKOLILI | Urban | Fishes | 0.000 | 19 | 3445.3788 | 0.0009185 |
Disclosure Risk Continuous Scaled Variables
The (distance-based) disclosure risk for continous key variables is between 0.000% and 97.359% in the modified data.
In the original data, the risk is assumed to be approximately 100.000%.
Hierarchical risk
- modified data: 0.638 (0.014%)
- original data: 0.638 (0.014%)
Data Utility
Frequencies Categorical Key Variables:
Variable: DISTRICT
| Categories | Original data | Modified data |
|---|---|---|
| KAILAHUN | 885 | 885 |
| KENEMA | 672 | 672 |
| KONO | 768 | 768 |
| BOMBALI | 150 | 150 |
| FALABA | 234 | 234 |
| KOINADUGU | 90 | 90 |
| TONKOLILI | 375 | 375 |
| KAMBIA | 42 | 42 |
| KARENE | 60 | 60 |
| PORTLOKO | 291 | 291 |
| BO | 423 | 423 |
| BONTHE | 186 | 186 |
| MOYAMBA | 294 | 294 |
| PUJEHUN | 156 | 156 |
| WESTERN RURAL | 36 | 36 |
| WESTERN URBAN | 0 | NA |
| NA | 33 | 33 |
Variable: URB_RUR
| Categories | Original data | Modified data |
|---|---|---|
| Rural | 4083 | 4083 |
| Urban | 483 | 483 |
| NA | 129 | 129 |
Variable: S6A_ITEM_NAME
| Categories | Original data | Modified data |
|---|---|---|
| Clams/Mollusks (Snales) | 1565 | 1565 |
| Crustaceans(Crabs & Shrims) | 1565 | 1565 |
| Fishes | 1565 | 1565 |
Local Suppressions
The table below shows for each categorical key variable the number (1st row) and the percentages (2nd row) of suppressed cells.
| DISTRICT | URB_RUR | S6A_ITEM_NAME | |
|---|---|---|---|
| Number of Suppression | 0 | 0 | 0 |
| Percentage | 0.000 | 0.000 | 0.000 |
Data Utility of Continuous Scaled Key Variables
Univariate summary of variable Total_qty_collected_kg
| Original | Modified | Difference | |
|---|---|---|---|
| Min. | 9.000 | 9.0 | 0.000 |
| 1st Qu. | 130.440 | 130.4 | 0.040 |
| Median | 245.760 | 184.3 | 61.460 |
| Mean | 1271.294 | 142.5 | 1128.794 |
| 3rd Qu. | 986.880 | 184.3 | 802.580 |
| Max. | 17920.000 | 184.3 | 17735.700 |
Univariate summary of variable Total_qty_consumed_kg
| Original | Modified | Difference | |
|---|---|---|---|
| Min. | 3.0000 | 3.0 | 0.0000 |
| 1st Qu. | 19.3200 | 19.3 | 0.0200 |
| Median | 60.0000 | 60.0 | 0.0000 |
| Mean | 350.6835 | 65.7 | 284.9835 |
| 3rd Qu. | 353.2800 | 106.2 | 247.0800 |
| Max. | 4480.0000 | 106.2 | 4373.8000 |
Univariate summary of variable Total_qty_sold_kg
| Original | Modified | Difference | |
|---|---|---|---|
| Min. | 3.0000 | 3.0 | 0.0000 |
| 1st Qu. | 89.0000 | 89.0 | 0.0000 |
| Median | 144.0000 | 144.0 | 0.0000 |
| Mean | 919.5426 | 185.5 | 734.0426 |
| 3rd Qu. | 583.6800 | 311.0 | 272.6800 |
| Max. | 13440.0000 | 311.0 | 13129.0000 |
Univariate summary of variable Value_qty_sold
| Original | Modified | Difference | |
|---|---|---|---|
| Min. | 150.00 | 150.0 | 0.00000 |
| 1st Qu. | 1080.00 | 1080.0 | 0.00000 |
| Median | 3000.00 | 3000.0 | 0.00000 |
| Mean | 32855.48 | 32769.4 | 86.07826 |
| 3rd Qu. | 18640.00 | 18640.0 | 0.00000 |
| Max. | 225000.00 | 223020.0 | 1980.00000 |
Univariate summary of variable Value_qty_collected
| Original | Modified | Difference | |
|---|---|---|---|
| Min. | 300.00 | 300.0 | 0.0000 |
| 1st Qu. | 1920.00 | 1920.0 | 0.0000 |
| Median | 4500.00 | 4500.0 | 0.0000 |
| Mean | 45522.24 | 44979.9 | 542.3406 |
| 3rd Qu. | 28960.00 | 28960.0 | 0.0000 |
| Max. | 337500.00 | 325026.0 | 12474.0000 |
Information Loss Criteria
- Criteria IL1: 11826.112%
- Difference of Eigenvalues in modified data: 307.676% (0.00% in original data)
Boxplot of Differences
R-Code
Session-Info
About the R-Version
- Version: R version 4.5.2 (2025-10-31 ucrt)
- Platform: x86_64-w64-mingw32
Locales
LC_COLLATE=English_United States.utf8 | LC_CTYPE=English_United States.utf8 | LC_MONETARY=English_United States.utf8 | LC_NUMERIC=C | LC_TIME=English_United States.utf8
Attached base packages
stats | graphics | grDevices | utils | datasets | methods | base
Other attached packages
labelled (2.16.0) | questionr (0.8.1) | haven (2.5.5) | readxl (1.4.5) | tidyr (1.3.1) | dplyr (1.1.4) | agrisvyr (0.2.0) | sdcMicro (5.7.9)
Packages loaded via Namespace (but not attached)
tidyselect (1.2.1) | viridisLite (0.4.2) | farver (2.1.2) | R.utils (2.13.0) | S7 (0.2.1) | fastmap (1.2.0) | promises (1.5.0) | digest (0.6.38) | mime (0.13) | lifecycle (1.0.4) | cluster (2.1.8.1) | magrittr (2.0.4) | compiler (4.5.2) | rlang (1.1.6) | sass (0.4.10) | tools (4.5.2) | utf8 (1.2.6) | yaml (2.3.10) | data.table (1.17.8) | knitr (1.50) | askpass (1.2.1) | htmlwidgets (1.6.4) | plyr (1.8.9) | xml2 (1.4.1) | RColorBrewer (1.1-3) | miniUI (0.1.2) | withr (3.0.2) | purrr (1.2.0) | R.oo (1.27.1) | grid (4.5.2) | xtable (1.8-4) | data.tree (1.2.0) | ggplot2 (4.0.1) | scales (1.4.0) | MASS (7.3-65) | cli (3.6.5) | rmarkdown (2.30) | crayon (1.5.3) | generics (0.1.4) | otel (0.2.0) | rstudioapi (0.17.1) | robustbase (0.99-6) | tzdb (0.5.0) | cachem (1.1.0) | stringr (1.6.0) | rhandsontable (0.3.8) | cellranger (1.1.0) | vctrs (0.6.5) | jsonlite (2.0.0) | carData (3.0-5) | hms (1.1.4) | systemfonts (1.3.1) | jquerylib (0.1.4) | shinyBS (0.61.1) | glue (1.8.0) | DEoptimR (1.1-4) | DT (0.34.0) | stringi (1.8.7) | gtable (0.3.6) | later (1.4.4) | tibble (3.3.0) | pillar (1.11.1) | htmltools (0.5.8.1) | openssl (2.3.4) | R6 (2.6.1) | textshaping (1.0.4) | evaluate (1.0.5) | shiny (1.11.1) | kableExtra (1.4.0) | readr (2.1.6) | highr (0.11) | R.methodsS3 (1.8.2) | openxlsx (4.2.8.1) | renv (1.1.5) | httpuv (1.6.16) | bslib (0.9.0) | Rcpp (1.1.0) | zip (2.3.3) | svglite (2.2.2) | xfun (0.54) | fs (1.6.6) | forcats (1.0.1) | usethis (3.2.1) | getPass (0.2-4) | prettydoc (0.4.1) | pkgconfig (2.0.3)
Disclaimer
R-Package sdcMicro is developed and maintained by Statistics Austria (www.statistik.at).
Please use the issue-tracker on github to report any issues:
This report was generated on Wed, 08/04/2026 at 23:33:29.