SDC-Report_v1

Input Data

The data set consists of 1088 observations

Information on selected important (key) variables

  • Categorical key variable(s): DISTRICT | URB_RUR | S4E_LIVESTOCK_ID
  • Continuous key variable(s): S4EQ7A | S4EQ7C
  • Weight variable: HH_Final_weight
  • householdID: HOUSEHOLD_ID
  • strataVariable(s): not defined

Modifications

  • Modifications on categorical key variables: FALSE
  • Modifications on continuous key variables: TRUE
  • Modifications using PRAM: FALSE
  • Local suppressions: TRUE

Disclosure risk:

Frequency Analysis for Categorical Key Variables

Number of observations violating

  • 2-Anonymity: 0 (original dataset: 1)
  • 3-Anonymity: 0 (original dataset: 5)

Percentage of observations violating

  • 2-Anonymity: 0.000% (original dataset: 0.092%)
  • 3-Anonymity: 0.000% (original dataset: 0.460%)

Disclosure Risk for Categorical Variables

Expected Percentage of Reidentifications:

  • modified data: 0.008% (~ 0.089 observations)
  • original data: 0.010% (~ 0.106 observations)

10 combinations of categories with highest risks

DISTRICT URB_RUR S4E_LIVESTOCK_ID risk fk Fk hier_risk
KAMBIA Urban Other - Ducks, Geese, Guineafowl 0.001 4 1082.974 0.0013622
KAILAHUN Urban Other - Ducks, Geese, Guineafowl 0.001 4 1194.113 0.0011153
FALABA Urban Chicken - hens/layers 0.001 4 1672.503 0.0007966
BOMBALI Urban Chicken - hens/layers 0.001 4 1711.431 0.0007785
PUJEHUN Rural Other - Ducks, Geese, Guineafowl 0.001 5 1999.404 0.0006736
KONO Rural Other - Ducks, Geese, Guineafowl 0.001 5 2169.534 0.0006167
KARENE NA Other - Ducks, Geese, Guineafowl 0.001 6 2331.101 0.0006872
KARENE Rural Other - Ducks, Geese, Guineafowl 0.001 6 2331.101 0.0006404
MOYAMBA Urban Chicken - hens/layers 0.000 15 2418.352 0.0004428
PORTLOKO Rural Other - Ducks, Geese, Guineafowl 0.000 12 2984.326 0.0003654

Disclosure Risk Continuous Scaled Variables

The (distance-based) disclosure risk for continous key variables is between 0.000% and 100.000% in the modified data.

In the original data, the risk is assumed to be approximately 100.000%.

Hierarchical risk

  • modified data: 0.115 (0.011%)
  • original data: 0.149 (0.014%)

Data Utility

Frequencies Categorical Key Variables:

Variable: DISTRICT

Categories Original data Modified data
KAILAHUN 84 84
KENEMA 89 89
KONO 92 92
BOMBALI 11 11
FALABA 5 5
KOINADUGU 2 2
TONKOLILI 86 86
KAMBIA 118 118
KARENE 31 31
PORTLOKO 109 109
BO 182 182
BONTHE 60 60
MOYAMBA 135 135
PUJEHUN 54 54
WESTERN RURAL 9 9
WESTERN URBAN 0 NA
NA 21 21

Variable: URB_RUR

Categories Original data Modified data
Rural 918 918
Urban 124 119
NA 46 51

Variable: S4E_LIVESTOCK_ID

Categories Original data Modified data
Chicken - hens/layers 997 997
Other - Ducks, Geese, Guineafowl 91 91

Local Suppressions

The table below shows for each categorical key variable the number (1st row) and the percentages (2nd row) of suppressed cells.

DISTRICT URB_RUR S4E_LIVESTOCK_ID
Number of Suppression 0 5 0
Percentage 0.000 0.460 0.000

Data Utility of Continuous Scaled Key Variables

Univariate summary of variable S4EQ7A

Original Modified Difference
Min. 1.000 1.0 0.000
1st Qu. 2.000 2.0 0.000
Median 7.000 7.0 0.000
Mean 14.875 14.8 0.075
3rd Qu. 18.500 18.5 0.000
Max. 50.000 49.2 0.800

Univariate summary of variable S4EQ7C

Original Modified Difference
Min. 2000.000 2000.0 0.0000000
1st Qu. 3998.167 3998.2 -0.0333333
Median 13993.583 13993.6 -0.0166667
Mean 29736.479 29526.6 209.8791667
3rd Qu. 36983.042 36983.0 0.0416667
Max. 99954.167 98274.9 1679.2666667

Information Loss Criteria

  • Criteria IL1: 3.360%
  • Difference of Eigenvalues in modified data: 2.371% (0.00% in original data)

Boxplot of Differences

R-Code

Session-Info

About the R-Version

  • Version: R version 4.5.2 (2025-10-31 ucrt)
  • Platform: x86_64-w64-mingw32

Locales

LC_COLLATE=English_United States.utf8 | LC_CTYPE=English_United States.utf8 | LC_MONETARY=English_United States.utf8 | LC_NUMERIC=C | LC_TIME=English_United States.utf8

Attached base packages

stats | graphics | grDevices | utils | datasets | methods | base

Other attached packages

labelled (2.16.0) | questionr (0.8.1) | haven (2.5.5) | readxl (1.4.5) | tidyr (1.3.1) | dplyr (1.1.4) | agrisvyr (0.2.0) | sdcMicro (5.7.9)

Packages loaded via Namespace (but not attached)

tidyselect (1.2.1) | viridisLite (0.4.2) | farver (2.1.2) | R.utils (2.13.0) | S7 (0.2.1) | fastmap (1.2.0) | promises (1.5.0) | digest (0.6.38) | mime (0.13) | lifecycle (1.0.4) | cluster (2.1.8.1) | magrittr (2.0.4) | compiler (4.5.2) | rlang (1.1.6) | sass (0.4.10) | tools (4.5.2) | utf8 (1.2.6) | yaml (2.3.10) | data.table (1.17.8) | knitr (1.50) | askpass (1.2.1) | htmlwidgets (1.6.4) | plyr (1.8.9) | xml2 (1.4.1) | RColorBrewer (1.1-3) | miniUI (0.1.2) | withr (3.0.2) | purrr (1.2.0) | R.oo (1.27.1) | grid (4.5.2) | xtable (1.8-4) | data.tree (1.2.0) | ggplot2 (4.0.1) | scales (1.4.0) | MASS (7.3-65) | cli (3.6.5) | rmarkdown (2.30) | crayon (1.5.3) | generics (0.1.4) | otel (0.2.0) | rstudioapi (0.17.1) | robustbase (0.99-6) | tzdb (0.5.0) | cachem (1.1.0) | stringr (1.6.0) | rhandsontable (0.3.8) | cellranger (1.1.0) | vctrs (0.6.5) | jsonlite (2.0.0) | carData (3.0-5) | hms (1.1.4) | systemfonts (1.3.1) | jquerylib (0.1.4) | shinyBS (0.61.1) | glue (1.8.0) | DEoptimR (1.1-4) | DT (0.34.0) | stringi (1.8.7) | gtable (0.3.6) | later (1.4.4) | tibble (3.3.0) | pillar (1.11.1) | htmltools (0.5.8.1) | openssl (2.3.4) | R6 (2.6.1) | textshaping (1.0.4) | evaluate (1.0.5) | shiny (1.11.1) | kableExtra (1.4.0) | readr (2.1.6) | highr (0.11) | R.methodsS3 (1.8.2) | openxlsx (4.2.8.1) | renv (1.1.5) | httpuv (1.6.16) | bslib (0.9.0) | Rcpp (1.1.0) | zip (2.3.3) | svglite (2.2.2) | xfun (0.54) | fs (1.6.6) | forcats (1.0.1) | usethis (3.2.1) | getPass (0.2-4) | prettydoc (0.4.1) | pkgconfig (2.0.3)

Disclaimer

R-Package sdcMicro is developed and maintained by Statistics Austria (www.statistik.at).

Please use the issue-tracker on github to report any issues:


This report was generated on Tue, 07/04/2026 at 23:38:04.