SDC-Report_v1

Input Data

The data set consists of 4973 observations

Information on selected important (key) variables

  • Categorical key variable(s): DISTRICT | URB_RUR | S0BQ3 | S5AQ1 | S6AQ1 | S7AQ1 | S0AQ15A | S0AQ16A
  • Continuous key variable(s): S0BQ2
  • Weight variable: HH_Final_weight
  • householdID: HOUSEHOLD_ID
  • strataVariable(s): not defined

Modifications

  • Modifications on categorical key variables: TRUE
  • Modifications on continuous key variables: FALSE
  • Modifications using PRAM: FALSE
  • Local suppressions: TRUE

Disclosure risk:

Frequency Analysis for Categorical Key Variables

Number of observations violating

  • 2-Anonymity: 0 (original dataset: 103)
  • 3-Anonymity: 0 (original dataset: 171)

Percentage of observations violating

  • 2-Anonymity: 0.000% (original dataset: 2.071%)
  • 3-Anonymity: 0.000% (original dataset: 3.439%)

Disclosure Risk for Categorical Variables

Expected Percentage of Reidentifications:

  • modified data: 0.018% (~ 0.875 observations)
  • original data: 0.088% (~ 4.371 observations)

10 combinations of categories with highest risks

DISTRICT URB_RUR S0BQ3 S5AQ1 S6AQ1 S7AQ1 S0AQ15A S0AQ16A risk fk Fk hier_risk
KAILAHUN Rural PARTNERSHIP/MULTIPLE HOLDING NO YES YES YES NO 0.008 3 183.7653 0.0080965
KAILAHUN NA SINGLE/SOLE PROPRIETORSHIP NO YES YES NO YES 0.007 3 220.6618 0.0067518
KAILAHUN Rural SINGLE/SOLE PROPRIETORSHIP NO NO NA NO YES 0.005 3 276.4649 0.0053964
KAMBIA NA SINGLE/SOLE PROPRIETORSHIP NO YES 0 YES NO 0.005 3 281.6336 0.0052979
MOYAMBA Urban SINGLE/SOLE PROPRIETORSHIP NO NO 0 YES YES 0.004 5 315.4564 0.0039469
KAMBIA NA SINGLE/SOLE PROPRIETORSHIP YES NO 0 YES YES 0.003 3 460.0665 0.0032498
KAILAHUN Rural SINGLE/SOLE PROPRIETORSHIP NO NA YES NO YES 0.003 5 456.7708 0.0027291
PUJEHUN Urban SINGLE/SOLE PROPRIETORSHIP NO NO 0 YES NO 0.003 5 465.3433 0.0026790
FALABA Urban SINGLE/SOLE PROPRIETORSHIP NO NO YES YES YES 0.002 5 522.5204 0.0023865
KAMBIA Urban SINGLE/SOLE PROPRIETORSHIP NO NO YES YES NO 0.002 4 670.6210 0.0019843

Disclosure Risk Continuous Scaled Variables

The (distance-based) disclosure risk for continous key variables is between 0.000% and 100.000% in the modified data.

In the original data, the risk is assumed to be approximately 100.000%.

Hierarchical risk

  • modified data: 0.875 (0.018%)
  • original data: 4.371 (0.088%)

Data Utility

Frequencies Categorical Key Variables:

Variable: DISTRICT

Categories Original data Modified data
KAILAHUN 719 719
KENEMA 637 637
KONO 608 608
BOMBALI 177 177
FALABA 149 149
KOINADUGU 89 89
TONKOLILI 502 502
KAMBIA 320 320
KARENE 157 157
PORTLOKO 451 451
BO 408 408
BONTHE 118 118
MOYAMBA 296 296
PUJEHUN 286 286
WESTERN RURAL 56 56
WESTERN URBAN 0 NA

Variable: URB_RUR

Categories Original data Modified data
Rural 4115 4107
Urban 858 818
NA NA 48

Variable: S0BQ3

Categories Original data Modified data
SINGLE HOLDING 4852 NA
COOPERATIVES 19 7
PARTNERSHIP 74 NA
MULTIPLE HOLDING 8 NA
GOVERNMENT 1 NA
SOLE PROPRIETORSHIP 19 NA
OTHERS(SPECIFY) 0 NA
SINGLE/SOLE PROPRIETORSHIP NA 4871
PARTNERSHIP/MULTIPLE HOLDING NA 49
NA NA 46

Variable: S5AQ1

Categories Original data Modified data
YES 34 6
NO 4939 4937
NA NA 30

Variable: S6AQ1

Categories Original data Modified data
YES 1498 1486
NO 3475 3469
NA NA 18

Variable: S7AQ1

Categories Original data Modified data
0 2795 2792
YES 2178 2177
NO 0 NA
NA NA 4

Variable: S0AQ15A

Categories Original data Modified data
YES 4927 4927
NO 46 26
NA NA 20

Variable: S0AQ16A

Categories Original data Modified data
NO 1916 1916
YES 3057 3057

Local Suppressions

The table below shows for each categorical key variable the number (1st row) and the percentages (2nd row) of suppressed cells.

DISTRICT URB_RUR S0BQ3 S5AQ1 S6AQ1 S7AQ1 S0AQ15A S0AQ16A
Number of Suppression 0 48 46 30 18 4 20 0
Percentage 0.000 0.965 0.925 0.603 0.362 0.080 0.402 0.000

Data Utility of Continuous Scaled Key Variables

Univariate summary of variable S0BQ2

Original Modified Difference
Min. 0.0000000 0 0.0000000
1st Qu. 0.0000000 0 0.0000000
Median 0.0000000 0 0.0000000
Mean 0.0323748 0 0.0323748
3rd Qu. 0.0000000 0 0.0000000
Max. 3.0000000 3 0.0000000

Information Loss Criteria

  • Criteria IL1: 0.000%
  • Difference of Eigenvalues in modified data: 0.000% (0.00% in original data)

Boxplot of Differences

R-Code

Session-Info

About the R-Version

  • Version: R version 4.5.2 (2025-10-31 ucrt)
  • Platform: x86_64-w64-mingw32

Locales

LC_COLLATE=English_United States.utf8 | LC_CTYPE=English_United States.utf8 | LC_MONETARY=English_United States.utf8 | LC_NUMERIC=C | LC_TIME=English_United States.utf8

Attached base packages

stats | graphics | grDevices | utils | datasets | methods | base

Other attached packages

labelled (2.16.0) | questionr (0.8.1) | haven (2.5.5) | readxl (1.4.5) | tidyr (1.3.1) | dplyr (1.1.4) | agrisvyr (0.2.0) | sdcMicro (5.7.9)

Packages loaded via Namespace (but not attached)

tidyselect (1.2.1) | viridisLite (0.4.2) | farver (2.1.2) | R.utils (2.13.0) | S7 (0.2.1) | fastmap (1.2.0) | promises (1.5.0) | digest (0.6.38) | mime (0.13) | lifecycle (1.0.4) | cluster (2.1.8.1) | magrittr (2.0.4) | compiler (4.5.2) | rlang (1.1.6) | sass (0.4.10) | tools (4.5.2) | utf8 (1.2.6) | yaml (2.3.10) | data.table (1.17.8) | knitr (1.50) | askpass (1.2.1) | htmlwidgets (1.6.4) | plyr (1.8.9) | xml2 (1.4.1) | RColorBrewer (1.1-3) | miniUI (0.1.2) | withr (3.0.2) | purrr (1.2.0) | R.oo (1.27.1) | grid (4.5.2) | xtable (1.8-4) | data.tree (1.2.0) | ggplot2 (4.0.1) | scales (1.4.0) | MASS (7.3-65) | cli (3.6.5) | rmarkdown (2.30) | crayon (1.5.3) | generics (0.1.4) | otel (0.2.0) | rstudioapi (0.17.1) | robustbase (0.99-6) | tzdb (0.5.0) | cachem (1.1.0) | stringr (1.6.0) | rhandsontable (0.3.8) | cellranger (1.1.0) | vctrs (0.6.5) | jsonlite (2.0.0) | carData (3.0-5) | hms (1.1.4) | systemfonts (1.3.1) | jquerylib (0.1.4) | shinyBS (0.61.1) | glue (1.8.0) | DEoptimR (1.1-4) | DT (0.34.0) | stringi (1.8.7) | gtable (0.3.6) | later (1.4.4) | tibble (3.3.0) | pillar (1.11.1) | htmltools (0.5.8.1) | openssl (2.3.4) | R6 (2.6.1) | textshaping (1.0.4) | evaluate (1.0.5) | shiny (1.11.1) | kableExtra (1.4.0) | readr (2.1.6) | highr (0.11) | R.methodsS3 (1.8.2) | openxlsx (4.2.8.1) | renv (1.1.5) | httpuv (1.6.16) | bslib (0.9.0) | Rcpp (1.1.0) | zip (2.3.3) | svglite (2.2.2) | xfun (0.54) | fs (1.6.6) | forcats (1.0.1) | usethis (3.2.1) | getPass (0.2-4) | prettydoc (0.4.1) | pkgconfig (2.0.3)

Disclaimer

R-Package sdcMicro is developed and maintained by Statistics Austria (www.statistik.at).

Please use the issue-tracker on github to report any issues:


This report was generated on Thu, 09/04/2026 at 11:38:27.