SDC-Report_v1

Input Data

The data set consists of 45990 observations

Information on selected important (key) variables

  • Categorical key variable(s): DISTRICT | URB_RUR | S3A_PRODUCT_ID
  • Continuous key variable(s): S3AQ4C
  • Weight variable: HH_Final_weight
  • householdID: HOUSEHOLD_ID
  • strataVariable(s): not defined

Modifications

  • Modifications on categorical key variables: FALSE
  • Modifications on continuous key variables: TRUE
  • Modifications using PRAM: FALSE
  • Local suppressions: TRUE

Disclosure risk:

Frequency Analysis for Categorical Key Variables

Number of observations violating

  • 2-Anonymity: 0 (original dataset: 0)
  • 3-Anonymity: 0 (original dataset: 0)

Percentage of observations violating

  • 2-Anonymity: 0.000% (original dataset: 0.000%)
  • 3-Anonymity: 0.000% (original dataset: 0.000%)

Disclosure Risk for Categorical Variables

Expected Percentage of Reidentifications:

  • modified data: 0.001% (~ 0.642 observations)
  • original data: 0.001% (~ 0.642 observations)

10 combinations of categories with highest risks

DISTRICT URB_RUR S3A_PRODUCT_ID risk fk Fk hier_risk
MOYAMBA Urban Cassava starch 0.000 43 8380.417 0.0010988
MOYAMBA Urban Palm kernal chaffs 0.000 43 8380.417 0.0010988
MOYAMBA Urban Rice straw 0.000 43 8380.417 0.0010988
MOYAMBA Urban Rice husk 0.000 43 8380.417 0.0010988
MOYAMBA Urban Palm kernal Nut oil 0.000 43 8380.417 0.0010988
MOYAMBA Urban Palm wine 0.000 43 8380.417 0.0010988
MOYAMBA Urban Palm kernal shell 0.000 43 8380.417 0.0010988
MOYAMBA Urban Cacao shell 0.000 43 8380.417 0.0010988
MOYAMBA Urban Cacao liquid 0.000 43 8380.417 0.0010988
PUJEHUN Urban Cassava starch 0.000 42 11617.421 0.0007932

Disclosure Risk Continuous Scaled Variables

The (distance-based) disclosure risk for continous key variables is between 0.000% and 99.972% in the modified data.

In the original data, the risk is assumed to be approximately 100.000%.

Hierarchical risk

  • modified data: 5.777 (0.013%)
  • original data: 5.777 (0.013%)

Data Utility

Frequencies Categorical Key Variables:

Variable: DISTRICT

Categories Original data Modified data
KAILAHUN 6624 6624
KENEMA 5751 5751
KONO 5697 5697
BOMBALI 1593 1593
FALABA 1332 1332
KOINADUGU 765 765
TONKOLILI 4518 4518
KAMBIA 2916 2916
KARENE 1395 1395
PORTLOKO 4149 4149
BO 3933 3933
BONTHE 990 990
MOYAMBA 2673 2673
PUJEHUN 2556 2556
WESTERN RURAL 612 612
WESTERN URBAN 0 NA
NA 486 486

Variable: URB_RUR

Categories Original data Modified data
Rural 37260 37260
Urban 7272 7272
NA 1458 1458

Variable: S3A_PRODUCT_ID

Categories Original data Modified data
Cacao liquid 5110 5110
Cacao shell 5110 5110
Cassava starch 5110 5110
Palm kernal chaffs 5110 5110
Palm kernal Nut oil 5110 5110
Palm kernal shell 5110 5110
Palm wine 5110 5110
Rice husk 5110 5110
Rice straw 5110 5110

Local Suppressions

The table below shows for each categorical key variable the number (1st row) and the percentages (2nd row) of suppressed cells.

DISTRICT URB_RUR S3A_PRODUCT_ID
Number of Suppression 0 0 0
Percentage 0.000 0.000 0.000

Data Utility of Continuous Scaled Key Variables

Univariate summary of variable S3AQ4C

Original Modified Difference
Min. 1.00000 1.0 0.000000e+00
1st Qu. 79.45208 79.5 -4.791670e-02
Median 200.00000 200.0 0.000000e+00
Mean 2744.22640 798.2 1.946026e+03
3rd Qu. 600.00000 600.0 0.000000e+00
Max. 129836.06557 6187.5 1.236486e+05

Information Loss Criteria

  • Criteria IL1: 933.657%
  • Difference of Eigenvalues in modified data: 0.000% (0.00% in original data)

Boxplot of Differences

R-Code

Session-Info

About the R-Version

  • Version: R version 4.5.2 (2025-10-31 ucrt)
  • Platform: x86_64-w64-mingw32

Locales

LC_COLLATE=English_United States.utf8 | LC_CTYPE=English_United States.utf8 | LC_MONETARY=English_United States.utf8 | LC_NUMERIC=C | LC_TIME=English_United States.utf8

Attached base packages

stats | graphics | grDevices | utils | datasets | methods | base

Other attached packages

labelled (2.16.0) | questionr (0.8.1) | haven (2.5.5) | readxl (1.4.5) | tidyr (1.3.1) | dplyr (1.1.4) | agrisvyr (0.2.0) | sdcMicro (5.7.9)

Packages loaded via Namespace (but not attached)

tidyselect (1.2.1) | viridisLite (0.4.2) | farver (2.1.2) | R.utils (2.13.0) | S7 (0.2.1) | fastmap (1.2.0) | promises (1.5.0) | digest (0.6.38) | mime (0.13) | lifecycle (1.0.4) | cluster (2.1.8.1) | magrittr (2.0.4) | compiler (4.5.2) | rlang (1.1.6) | sass (0.4.10) | tools (4.5.2) | utf8 (1.2.6) | yaml (2.3.10) | data.table (1.17.8) | knitr (1.50) | askpass (1.2.1) | htmlwidgets (1.6.4) | plyr (1.8.9) | xml2 (1.4.1) | RColorBrewer (1.1-3) | miniUI (0.1.2) | withr (3.0.2) | purrr (1.2.0) | R.oo (1.27.1) | grid (4.5.2) | xtable (1.8-4) | data.tree (1.2.0) | ggplot2 (4.0.1) | scales (1.4.0) | MASS (7.3-65) | cli (3.6.5) | rmarkdown (2.30) | crayon (1.5.3) | generics (0.1.4) | otel (0.2.0) | rstudioapi (0.17.1) | robustbase (0.99-6) | tzdb (0.5.0) | cachem (1.1.0) | stringr (1.6.0) | rhandsontable (0.3.8) | cellranger (1.1.0) | vctrs (0.6.5) | jsonlite (2.0.0) | carData (3.0-5) | hms (1.1.4) | systemfonts (1.3.1) | jquerylib (0.1.4) | shinyBS (0.61.1) | glue (1.8.0) | DEoptimR (1.1-4) | DT (0.34.0) | stringi (1.8.7) | gtable (0.3.6) | later (1.4.4) | tibble (3.3.0) | pillar (1.11.1) | htmltools (0.5.8.1) | openssl (2.3.4) | R6 (2.6.1) | textshaping (1.0.4) | evaluate (1.0.5) | shiny (1.11.1) | kableExtra (1.4.0) | readr (2.1.6) | highr (0.11) | R.methodsS3 (1.8.2) | openxlsx (4.2.8.1) | renv (1.1.5) | httpuv (1.6.16) | bslib (0.9.0) | Rcpp (1.1.0) | zip (2.3.3) | svglite (2.2.2) | xfun (0.54) | fs (1.6.6) | forcats (1.0.1) | usethis (3.2.1) | getPass (0.2-4) | prettydoc (0.4.1) | pkgconfig (2.0.3)

Disclaimer

R-Package sdcMicro is developed and maintained by Statistics Austria (www.statistik.at).

Please use the issue-tracker on github to report any issues:


This report was generated on Tue, 07/04/2026 at 23:03:25.