SDC-Report_v1

Input Data

The data set consists of 47 observations

Information on selected important (key) variables

  • Categorical key variable(s): DISTRICT | URB_RUR | S4D_LIVESTOCK_ID
  • Continuous key variable(s): S4DQ3 | S4DQ7A | S4DQ7C | S4DQ8
  • Weight variable: HH_Final_weight
  • householdID: HOUSEHOLD_ID
  • strataVariable(s): not defined

Modifications

  • Modifications on categorical key variables: FALSE
  • Modifications on continuous key variables: TRUE
  • Modifications using PRAM: FALSE
  • Local suppressions: FALSE

Disclosure risk:

Frequency Analysis for Categorical Key Variables

Number of observations violating

  • 2-Anonymity: 0 (original dataset: 0)
  • 3-Anonymity: 0 (original dataset: 0)

Percentage of observations violating

  • 2-Anonymity: 0.000% (original dataset: 0.000%)
  • 3-Anonymity: 0.000% (original dataset: 0.000%)

Disclosure Risk for Categorical Variables

Expected Percentage of Reidentifications:

  • modified data: 0.124% (~ 0.058 observations)
  • original data: 0.124% (~ 0.058 observations)

10 combinations of categories with highest risks

DISTRICT URB_RUR S4D_LIVESTOCK_ID risk fk Fk hier_risk
PORTLOKO Urban Cows 0.007 3 222.4450 0.0066981
KOINADUGU Rural Cows 0.003 3 536.8566 0.0027863
FALABA Urban Cows 0.002 5 509.8208 0.0024458
KAMBIA Rural Cows 0.002 5 755.8718 0.0016510
WESTERN RURAL NA Cows 0.001 3 1161.5506 0.0012897
KONO Rural Cows 0.001 7 1098.8913 0.0010606
KARENE Urban Cows 0.001 4 1874.0491 0.0007110
FALABA Rural Cows 0.000 9 2327.3599 0.0004831
PORTLOKO Rural Cows 0.000 13 2253.6341 0.0004805
KARENE Rural Cows 0.000 9 2577.5857 0.0004363

Disclosure Risk Continuous Scaled Variables

The (distance-based) disclosure risk for continous key variables is between 0.000% and 95.745% in the modified data.

In the original data, the risk is assumed to be approximately 100.000%.

Hierarchical risk

  • modified data: 0.058 (0.124%)
  • original data: 0.058 (0.124%)

Data Utility

Frequencies Categorical Key Variables:

Variable: DISTRICT

Categories Original data Modified data
KAILAHUN 0 NA
KENEMA 0 NA
KONO 5 5
BOMBALI 0 NA
FALABA 11 11
KOINADUGU 1 1
TONKOLILI 0 NA
KAMBIA 3 3
KARENE 10 10
PORTLOKO 14 14
BO 0 NA
BONTHE 0 NA
MOYAMBA 0 NA
PUJEHUN 0 NA
WESTERN RURAL 1 1
WESTERN URBAN 0 NA
NA 2 2

Variable: URB_RUR

Categories Original data Modified data
Rural 34 34
Urban 10 10
NA 3 3

Variable: S4D_LIVESTOCK_ID

Categories Original data Modified data
Cows 47 47

Data Utility of Continuous Scaled Key Variables

Univariate summary of variable S4DQ3

Original Modified Difference
Min. 2.00 2.0 0.00
1st Qu. 2.75 2.8 -0.05
Median 4.50 4.5 0.00
Mean 5.25 5.2 0.05
3rd Qu. 7.00 6.9 0.10
Max. 10.00 9.7 0.30

Univariate summary of variable S4DQ7A

Original Modified Difference
Min. 1.00 1.0 0.00
1st Qu. 3.25 3.2 0.05
Median 4.50 4.5 0.00
Mean 6.50 6.4 0.10
3rd Qu. 7.75 7.7 0.05
Max. 16.00 15.7 0.30

Univariate summary of variable S4DQ7C

Original Modified Difference
Min. 100 100.0 0.0
1st Qu. 145 145.0 0.0
Median 180 180.0 0.0
Mean 190 189.2 0.8
3rd Qu. 225 224.2 0.8
Max. 300 297.0 3.0

Univariate summary of variable S4DQ8

Original Modified Difference
Min. 0.00 0.0 0.00
1st Qu. 0.75 0.8 -0.05
Median 1.00 1.0 0.00
Mean 1.75 1.7 0.05
3rd Qu. 2.00 1.9 0.10
Max. 5.00 4.7 0.30

Information Loss Criteria

  • Criteria IL1: 12.663%
  • Difference of Eigenvalues in modified data: 5.030% (0.00% in original data)

Boxplot of Differences

R-Code

Session-Info

About the R-Version

  • Version: R version 4.5.2 (2025-10-31 ucrt)
  • Platform: x86_64-w64-mingw32

Locales

LC_COLLATE=English_United States.utf8 | LC_CTYPE=English_United States.utf8 | LC_MONETARY=English_United States.utf8 | LC_NUMERIC=C | LC_TIME=English_United States.utf8

Attached base packages

stats | graphics | grDevices | utils | datasets | methods | base

Other attached packages

labelled (2.16.0) | questionr (0.8.1) | haven (2.5.5) | readxl (1.4.5) | tidyr (1.3.1) | dplyr (1.1.4) | agrisvyr (0.2.0) | sdcMicro (5.7.9)

Packages loaded via Namespace (but not attached)

tidyselect (1.2.1) | viridisLite (0.4.2) | farver (2.1.2) | R.utils (2.13.0) | S7 (0.2.1) | fastmap (1.2.0) | promises (1.5.0) | digest (0.6.38) | mime (0.13) | lifecycle (1.0.4) | cluster (2.1.8.1) | magrittr (2.0.4) | compiler (4.5.2) | rlang (1.1.6) | sass (0.4.10) | tools (4.5.2) | utf8 (1.2.6) | yaml (2.3.10) | data.table (1.17.8) | knitr (1.50) | askpass (1.2.1) | htmlwidgets (1.6.4) | plyr (1.8.9) | xml2 (1.4.1) | RColorBrewer (1.1-3) | miniUI (0.1.2) | withr (3.0.2) | purrr (1.2.0) | R.oo (1.27.1) | grid (4.5.2) | xtable (1.8-4) | data.tree (1.2.0) | ggplot2 (4.0.1) | scales (1.4.0) | MASS (7.3-65) | cli (3.6.5) | rmarkdown (2.30) | crayon (1.5.3) | generics (0.1.4) | otel (0.2.0) | rstudioapi (0.17.1) | robustbase (0.99-6) | tzdb (0.5.0) | cachem (1.1.0) | stringr (1.6.0) | rhandsontable (0.3.8) | cellranger (1.1.0) | vctrs (0.6.5) | jsonlite (2.0.0) | carData (3.0-5) | hms (1.1.4) | systemfonts (1.3.1) | jquerylib (0.1.4) | shinyBS (0.61.1) | glue (1.8.0) | DEoptimR (1.1-4) | DT (0.34.0) | stringi (1.8.7) | gtable (0.3.6) | later (1.4.4) | tibble (3.3.0) | pillar (1.11.1) | htmltools (0.5.8.1) | openssl (2.3.4) | R6 (2.6.1) | textshaping (1.0.4) | evaluate (1.0.5) | shiny (1.11.1) | kableExtra (1.4.0) | readr (2.1.6) | highr (0.11) | R.methodsS3 (1.8.2) | openxlsx (4.2.8.1) | renv (1.1.5) | httpuv (1.6.16) | bslib (0.9.0) | Rcpp (1.1.0) | zip (2.3.3) | svglite (2.2.2) | xfun (0.54) | fs (1.6.6) | forcats (1.0.1) | usethis (3.2.1) | getPass (0.2-4) | prettydoc (0.4.1) | pkgconfig (2.0.3)

Disclaimer

R-Package sdcMicro is developed and maintained by Statistics Austria (www.statistik.at).

Please use the issue-tracker on github to report any issues:


This report was generated on Tue, 07/04/2026 at 23:37:11.